Introduction to Data-Centric AI

Friday, January 27, 2023 at 1:00pm to 2:00pm

Building 35, 35-225
127 MASSACHUSETTS AVE, Cambridge, MA 02139

Typical machine learning classes teach techniques to produce effective models for a given dataset. In real-world applications, data is messy and improving models is not the only way to get better performance. You can also improve the dataset itself rather than treating it as fixed. Data-Centric AI (DCAI) is an emerging science that studies techniques to improve datasets, which is often the best way to improve performance in practical ML applications. While good data scientists have long practiced this manually via ad hoc trial/error and intuition, DCAI considers the improvement of data as a systematic engineering discipline.

This is the first-ever course on DCAI. This class covers algorithms to find and fix common issues in ML data and to construct better datasets, concentrating on data used in supervised learning tasks like classification. All material taught in this course is highly practical, focused on impactful aspects of real-world ML applications, rather than mathematical details of how particular models work. You can take this course to learn practical techniques not covered in most ML classes, which will help mitigate the “garbage in, garbage out” problem that plagues many real-world ML applications.

Register at 

Events By Audience

Public, MIT Community, Students, Alumni, Faculty, Staff


data, data science, artificial intelligence, machine learning


Contact Email

Add to my calendar

Recent Activity