Schedule

We will aim to fill in lecture topics at least 1 week in advance. Assignment due dates are final, unless there are exceptional unforeseen circumstances.

Event

Date

Description

Course Material
Lecture

08/21
Monday

Lecture 1 - Course Introduction
[slides]
Lecture

08/23
Wednesday

Lecture 2 - Data challenges
[slides]
Suggested Readings:
- Lessons from measurement [only need to read measurement section]
- When You Hear the Margin of Error Is Plus or Minus 3 Percent, Think 7 Instead
Assignment

08/23
Wednesday

Homework #1 - Polling and Data Collection released!

[Homework #1 - Polling and Data Collection]
Lecture

08/28
Monday

Lecture 3 - Survey weighting
[slides]
Suggested Readings:
Lecture

08/30
Wednesday

Lecture 4 - Weighting 2
[slides]
Lecture

09/06
Wednesday

Lecture 5 - Other aspects of data collection
[slides]
Optional Readings:
- Designing Informative Rating Systems: Evidence from an Online Labor Market
- Reputation Inflation
Lecture

09/11
Monday

Lecture 6 - Recommendations introduction
[slides]
Suggested Readings:
- Mining Massive Datasets, Ch 9, Recommendations
- Stitchfix Algorithms tour
Lecture

09/13
Wednesday

Lecture 7 - Recommendations, from predictions to decisions
[slides]
Suggested Readings:
- The Engagement-Diversity Connection: Evidence from a Field Experiment on Spotify
- Algorithmic Effects on the Diversity of Consumption on Spotify
Due

09/12 23:59 ET
Tuesday

Homework #1 due
Assignment

09/13
Wednesday

Homework #2 - Recommendations released!

[Homework #2 - Recommendations]
Lecture

09/18
Monday

Lecture 8 - Algorithmic Pricing basics
[slides]
Lecture

09/20
Wednesday

Guest - Lily Xu Guest Lecture

Title: High-stakes decisions with low-quality data: Learning and planning under uncertainty for conservation

Abstract: Wildlife poaching pushes countless species to the brink of extinction, with animal population sizes declining by an average of 70% since 1970. To aid rangers in preventing poaching in protected areas around the world, we have developed the Protection Assistant for Wildlife Security (PAWS). We present technical advances in multi-armed bandits, robust reinforcement learning, and causal inference, guided by research questions that emerged from on-the-ground challenges in deploying PAWS. We also discuss bridging the gap between research and practice, from field tests in Uganda and Cambodia to large-scale deployment through integration with SMART, the leading software system for protected area management used by over 1,000 wildlife parks worldwide.

Bio: Lily Xu is a computer science PhD student at Harvard developing AI techniques to address environmental planning challenges. She focuses on advancing methods in machine learning, sequential decision-making, and game theory for biodiversity conservation through preventing wildlife poaching. Her work building the PAWS system to predict poaching hotspots has been deployed across Africa, southeast Asia, and Latin America and is being scaled globally through integration with SMART conservation software. Lily co-organizes the Mechanism Design for Social Good (MD4SG) research initiative and serves as AI Lead for the SMART Partnership. Her research has been recognized with best paper runner-up at AAAI, the INFORMS Doing Good with Good OR award, and a Google PhD Fellowship.
Lecture

09/25
Monday

Lecture 9 - Algorithmic Pricing complications
[slides]
Suggested Readings:
- The Algorithm behind Plane Ticket Prices and How to Get the Best Deal
- Book chapter on Dynamic Programming
Notes:
- Handwritten notes backward induction 1
- Handwritten notes backward induction 2
Assignment

09/26
Tuesday

Homework #3 - Dynamic and Personalized Pricing released!

[Homework #3 - Dynamic and Personalized Pricing]
Due

09/26 23:59 ET
Tuesday

Homework #2 due
Lecture

09/27
Wednesday

Lecture 10 - Algorithmic Pricing complications 2
[slides]
Lecture

10/02
Monday

Lecture 11 - Algorithmic Pricing practice -- ride-hailing
[slides]
Lecture

10/04
Wednesday

In class discussion - Ethics in (Algorithmic) Pricing
[Pre-class Homework]
Suggested Readings:
- Fairness as a Constraint on Profit Seeking: Entitlements in the Market
Lecture

10/09
Monday

NO CLASS -- FALL BREAK
Lecture

10/11
Wednesday

Lecture 12 - Experimentation -- Introduction
[slides]
Suggested Readings:
Assignment

10/15
Sunday

Homework #4 - Experimentation released!

[Homework #4 - Experimentation]
Lecture

10/16
Monday

Guest Lecture -- Emily Aiken

VIRTUAL GUEST LECTURE

Talk title: Algorithmic allocation of humanitarian aid with machine learning and digital data

Abstract: The vast majority of humanitarian aid and social protection programs globally are targeted, providing assistance to individuals or communities identified to be poorest or most in need. In low and middle-income countries, the targeting of aid programs is often limited by low-quality, out-of-date, or missing data on poverty and vulnerability. Novel “big” digital data sources, such as those captured by satellites, mobile phones, and financial services providers – when combined with advances in machine learning – can improve the accuracy of aid program targeting. In this talk, I will cover empirical results on the accuracy of these new data-driven and algorithmic approaches to aid allocation, as well as the potential and limitations of additional use cases in impact evaluation and poverty measurement.

Speaker bio: Emily is a PhD candidate at the UC Berkeley School of Information, where she studies the application of novel algorithms and digital data sources for social protection programs. Her work has been published in venues including Nature and Science Advances, and she is a recipient of a Microsoft Research PhD fellowship. Prior to Berkeley, Emily received her bachelor’s degree in computer science from Harvard.
Due

10/17 23:59 ET
Tuesday

Homework #3 due
Lecture

10/18
Wednesday

CANCELLED CLASS
Lecture

10/23
Monday

Lecture 13 - Experimentation -- Peeking and Interference
[slides]
Suggested Readings:
- Lyft – experimentation in marketplaces (Part 1), (Part 2), (Part 3)
- Toward Causal Inference With Interference
Lecture

10/25
Wednesday

Lecture 14 - Experimentation in marketplaces
[slides]
Suggested Readings:
- Switchback Tests and Randomized Experimentation Under Network Effects at DoorDash
- A/B testing at marketplaces
Lecture

10/30
Monday

Guest Lecture -- Allison Koenecke

VIRTUAL GUEST LECTURE

Title: Equitable Decision-Making in Public Resource Allocation

Abstract: Algorithmically guided decisions are becoming increasingly prevalent and, if left unchecked, can amplify pre-existing societal biases. In this talk, I audit the equity of decision-making in two public resource allocation settings. First, I present a methodological framework for online advertisers to determine a demographically equitable allocation of individuals being shown ads for SNAP (food stamp) benefits – specifically, considering budget-constrained trade-offs between ad conversions for English-speaking and Spanish-speaking SNAP applicants. Second, I discuss sensitivity analyses on public funding allocation algorithms such as CalEnviroScreen, an algorithm used to promote environmental justice by aiding disadvantaged census tracts – which we find to encode bias against tracts with high immigrant populations. In both case studies, we will discuss methods to mitigate allocative harm and to foster equitable outcomes using accountability mechanisms.

Paper links: SNAP, CalEnviroScreen

Bio: Allison Koenecke is an Assistant Professor of Information Science at Cornell University. Her research applies computational methods, such as machine learning and causal inference, to study societal inequities in domains from online services to public health. Koenecke is regularly quoted as an expert on disparities in automated speech-to-text systems. She previously held a postdoctoral researcher role at Microsoft Research and received her PhD from Stanford’s Institute for Computational and Mathematical Engineering.
Lecture

11/01
Wednesday

Special Lecture (by Gabriel and Zhi) -- Heterogeneous reporting in NYC311 systems
Optional Readings:
- Quantifying Spatial Under-reporting Disparities in Resident Crowdsourcing
- A Bayesian Spatial Model to Correct Under-Reporting in Urban Crowdsourcing
Due

10/31 23:59 ET
Tuesday

Homework #4 due
Lecture

11/06
Monday

Guest - Emma Pierson Guest Lecture

Title: Data Science for Social Equality

Abstract: Our society remains profoundly unequal. This talk discusses how data science and machine learning can be used to facilitate more equitable decision-making by presenting case studies around policing and COVID-19.

Bio: Emma Pierson is an assistant professor of computer science at the Jacobs Technion-Cornell Institute at Cornell Tech and the Technion, and a computer science field member at Cornell University. She holds a secondary joint appointment as an Assistant Professor of Population Health Sciences at Weill Cornell Medical College. She develops data science and machine learning methods to study inequality and healthcare. Her work has been recognized by best paper, poster, and talk awards, an NSF CAREER award, a Rhodes Scholarship, Hertz Fellowship, Rising Star in EECS, MIT Technology Review 35 Innovators Under 35, and Forbes 30 Under 30 in Science. Her research has been published at venues including ICML, KDD, WWW, Nature, and Nature Medicine, and she has also written for The New York Times, FiveThirtyEight, Wired, and various other publications.
Lecture

11/08
Wednesday

Guest - Smitha Milli Guest Lecture
Lecture

11/13
Monday

Lecture 15 - Project description and game theory
[slides]
Lecture

11/15
Wednesday

Lecture 16 - Differential Privacy
[slides]
Suggested Readings:
Lecture

11/20
Monday

Lecture 17 - Limits to prediction
[slides]
Suggested Readings:
Lecture

11/27
Monday

Lecture 18 - Discrimination in Platforms
[slides]
Lecture

11/29
Wednesday

Guest - Serina Chang Guest Lecture