Schedule
We will aim to fill in lecture topics at least 1 week in advance. Assignment due dates are final, unless there are exceptional unforeseen circumstances.
-
EventDateDescriptionCourse Material
-
Lecture08/21
MondayLecture 1 - Course Introduction[slides] -
Lecture08/23
WednesdayLecture 2 - Data challenges[slides]Suggested Readings:
- Lessons from measurement [only need to read measurement section]
- When You Hear the Margin of Error Is Plus or Minus 3 Percent, Think 7 Instead
-
Assignment08/23
WednesdayHomework #1 - Polling and Data Collection released! -
Lecture08/28
MondayLecture 3 - Survey weighting[slides] -
Lecture08/30
WednesdayLecture 4 - Weighting 2[slides] -
Lecture09/06
WednesdayLecture 5 - Other aspects of data collection[slides] -
Lecture09/11
MondayLecture 6 - Recommendations introduction[slides]Suggested Readings:
-
Lecture09/13
WednesdayLecture 7 - Recommendations, from predictions to decisions[slides] -
Due09/12 23:59 ET
TuesdayHomework #1 due -
Assignment09/13
WednesdayHomework #2 - Recommendations released! -
Lecture09/18
MondayLecture 8 - Algorithmic Pricing basics[slides] -
Lecture09/20
WednesdayGuest - Lily Xu Guest LectureTitle: High-stakes decisions with low-quality data: Learning and planning under uncertainty for conservation
Abstract: Wildlife poaching pushes countless species to the brink of extinction, with animal population sizes declining by an average of 70% since 1970. To aid rangers in preventing poaching in protected areas around the world, we have developed the Protection Assistant for Wildlife Security (PAWS). We present technical advances in multi-armed bandits, robust reinforcement learning, and causal inference, guided by research questions that emerged from on-the-ground challenges in deploying PAWS. We also discuss bridging the gap between research and practice, from field tests in Uganda and Cambodia to large-scale deployment through integration with SMART, the leading software system for protected area management used by over 1,000 wildlife parks worldwide.
Bio: Lily Xu is a computer science PhD student at Harvard developing AI techniques to address environmental planning challenges. She focuses on advancing methods in machine learning, sequential decision-making, and game theory for biodiversity conservation through preventing wildlife poaching. Her work building the PAWS system to predict poaching hotspots has been deployed across Africa, southeast Asia, and Latin America and is being scaled globally through integration with SMART conservation software. Lily co-organizes the Mechanism Design for Social Good (MD4SG) research initiative and serves as AI Lead for the SMART Partnership. Her research has been recognized with best paper runner-up at AAAI, the INFORMS Doing Good with Good OR award, and a Google PhD Fellowship.
-
Lecture09/25
MondayLecture 9 - Algorithmic Pricing complications[slides] -
Assignment09/26
TuesdayHomework #3 - Dynamic and Personalized Pricing released! -
Due09/26 23:59 ET
TuesdayHomework #2 due -
Lecture09/27
WednesdayLecture 10 - Algorithmic Pricing complications 2[slides] -
Lecture10/02
MondayLecture 11 - Algorithmic Pricing practice -- ride-hailing[slides] -
Lecture10/04
WednesdayIn class discussion - Ethics in (Algorithmic) PricingSuggested Readings:
-
Lecture10/09
MondayNO CLASS -- FALL BREAK -
Lecture10/11
WednesdayLecture 12 - Experimentation -- Introduction[slides] -
Assignment10/15
SundayHomework #4 - Experimentation released! -
Lecture10/16
MondayGuest Lecture -- Emily AikenVIRTUAL GUEST LECTURE
Talk title: Algorithmic allocation of humanitarian aid with machine learning and digital data
Abstract: The vast majority of humanitarian aid and social protection programs globally are targeted, providing assistance to individuals or communities identified to be poorest or most in need. In low and middle-income countries, the targeting of aid programs is often limited by low-quality, out-of-date, or missing data on poverty and vulnerability. Novel “big” digital data sources, such as those captured by satellites, mobile phones, and financial services providers – when combined with advances in machine learning – can improve the accuracy of aid program targeting. In this talk, I will cover empirical results on the accuracy of these new data-driven and algorithmic approaches to aid allocation, as well as the potential and limitations of additional use cases in impact evaluation and poverty measurement.
Speaker bio: Emily is a PhD candidate at the UC Berkeley School of Information, where she studies the application of novel algorithms and digital data sources for social protection programs. Her work has been published in venues including Nature and Science Advances, and she is a recipient of a Microsoft Research PhD fellowship. Prior to Berkeley, Emily received her bachelor’s degree in computer science from Harvard.
-
Due10/17 23:59 ET
TuesdayHomework #3 due -
Lecture10/18
WednesdayCANCELLED CLASS -
Lecture10/23
MondayLecture 13 - Experimentation -- Peeking and Interference[slides] -
Lecture10/25
WednesdayLecture 14 - Experimentation in marketplaces[slides] -
Lecture10/30
MondayGuest Lecture -- Allison KoeneckeVIRTUAL GUEST LECTURE
Title: Equitable Decision-Making in Public Resource Allocation
Abstract: Algorithmically guided decisions are becoming increasingly prevalent and, if left unchecked, can amplify pre-existing societal biases. In this talk, I audit the equity of decision-making in two public resource allocation settings. First, I present a methodological framework for online advertisers to determine a demographically equitable allocation of individuals being shown ads for SNAP (food stamp) benefits – specifically, considering budget-constrained trade-offs between ad conversions for English-speaking and Spanish-speaking SNAP applicants. Second, I discuss sensitivity analyses on public funding allocation algorithms such as CalEnviroScreen, an algorithm used to promote environmental justice by aiding disadvantaged census tracts – which we find to encode bias against tracts with high immigrant populations. In both case studies, we will discuss methods to mitigate allocative harm and to foster equitable outcomes using accountability mechanisms.
Paper links: SNAP, CalEnviroScreen
Bio: Allison Koenecke is an Assistant Professor of Information Science at Cornell University. Her research applies computational methods, such as machine learning and causal inference, to study societal inequities in domains from online services to public health. Koenecke is regularly quoted as an expert on disparities in automated speech-to-text systems. She previously held a postdoctoral researcher role at Microsoft Research and received her PhD from Stanford’s Institute for Computational and Mathematical Engineering.
-
Lecture11/01
WednesdaySpecial Lecture (by Gabriel and Zhi) -- Heterogeneous reporting in NYC311 systems -
Due10/31 23:59 ET
TuesdayHomework #4 due -
Lecture11/06
MondayGuest - Emma Pierson Guest LectureTitle: Data Science for Social Equality
Abstract: Our society remains profoundly unequal. This talk discusses how data science and machine learning can be used to facilitate more equitable decision-making by presenting case studies around policing and COVID-19.
Bio: Emma Pierson is an assistant professor of computer science at the Jacobs Technion-Cornell Institute at Cornell Tech and the Technion, and a computer science field member at Cornell University. She holds a secondary joint appointment as an Assistant Professor of Population Health Sciences at Weill Cornell Medical College. She develops data science and machine learning methods to study inequality and healthcare. Her work has been recognized by best paper, poster, and talk awards, an NSF CAREER award, a Rhodes Scholarship, Hertz Fellowship, Rising Star in EECS, MIT Technology Review 35 Innovators Under 35, and Forbes 30 Under 30 in Science. Her research has been published at venues including ICML, KDD, WWW, Nature, and Nature Medicine, and she has also written for The New York Times, FiveThirtyEight, Wired, and various other publications.
-
Lecture11/08
WednesdayGuest - Smitha Milli Guest Lecture -
Lecture11/13
MondayLecture 15 - Project description and game theory[slides] -
Lecture11/15
WednesdayLecture 16 - Differential Privacy[slides] -
Lecture11/20
MondayLecture 17 - Limits to prediction[slides] -
Lecture11/27
MondayLecture 18 - Discrimination in Platforms[slides] -
Lecture11/29
WednesdayGuest - Serina Chang Guest Lecture