18-667: Algorithms for Large-scale Distributed Machine Learning and Optimization (Fall 2025)

Instructor:

Prof. Gauri Joshi (gaurij)

Teaching Assistants:

Anupam Nayak(anupamn)
Jong-Ik Park(jongikp)
Shivam Patel(shivamap)
He Wang(hew2)
Tong Yang(tongyang)

Lecture:

Mon/Wed 3:30pm-4:50pm ET, Posner Hall A35

Recitation:

Friday 9:00 am-9:50am ET, GHC 4307

Office Hours: Posted on Piazza


Course Overview

The objective of this course is to introduce students to state-of-the-art algorithms in large-scale machine learning and distributed optimization, in particular, the emerging field of federated learning. Topics to be covered include but are not limited to:

Prerequisites


Comparison with Related Courses

Textbooks

Students are expected to read the research paper discussed in each lecture and review the lecture slides to prepare for the quizzes and homework assignments. Material covered in the first part of the class is also in Prof. Joshi's book on Optimization Algorithms for Distributed Machine Learning, available through CMU libraries

Piazza

We will use Piazza for class discussions. We strongly encourage students to post on this forum rather than emailing the course staff directly (this will be more efficient for both students and staff). Students should use Piazza to:

The course Academic Integrity Policy must be followed on the message boards at all times. Do not post or request homework solutions! Also, please be polite.

Tentative Grading Policy

Grades will be based on the following components:

Collaboration Policy

Group studying and collaborating on problem sets are encouraged, as working together is a great way to understand new material. Students are free to discuss the homework problems with anyone under the following conditions: Students are encouraged to read CMU's Policy on Cheating and Plagiarism.

Schedule (subject to change)

Date Lecture/Recitation Readings Announcements
08/25 Intro and Logistics [Slides] [Annotated]

08/27 SGD in Machine Learning [Slides] [Annotated]

08/29 Math Review

HW1 release
09/01 Labor Day; No classes

09/03 Lecture Cancelled

09/05 Review of SGD convergence analysis

09/08 SGD Convergence Analysis I [Slides] [Annotated]

09/10 SGD Convergence Analysis II [Slides] [Annotated]

09/12 Review of Order Statistics

09/15 Distributed Synchronous SGD [Slides] [Annotated]

09/17 Asynchronous SGD [Slides]

09/19 Concept Review and Practice

HW1 due; HW2 release
09/22 Quiz 1

9/24 Local-update SGD

9/26 Review of Local Update SGD

9/29 Adacomm, Elastic Averaging, Overlap SGD

10/01 Quantized and Sparsified Distributed SGD

10/03 Review of Concepts and Proofs

10/06 Decentralized SGD

10/08 Federated Learning Intro

10/10 No recitation

HW2 due; HW3 release
10/13 Fall Break

10/15 Fall Break

10/17 Fall Break

10/20 Data Heterogeneity in FL

10/22 Computational Heterogeneity in FL

10/24 Concept Review and Practice

Project proposal due
10/27 Client Selection and Partial Participation

10/29 Quiz 2

10/31 No recitation

HW3 due
11/03 Personalized Federated Learning

11/05

HW4 release; HW3 due

11/07 Concept Review and Project Office Hours

Project Checkpoint I

11/10 Fairness and Participation Incentives

11/12 Differential Privacy and Secure Aggregation

11/14 Concept Review and Project Office Hours

Project Checkpoint II

11/17 Robustness to Adversaries

11/19 Federated Learning in the LLM Era

11/21 Concept Review and Practice

11/24 Quiz 3

11/27 Thanksgiving Break

11/29 Thanksgiving Break

12/01 Project Presentations

12/03 Project Presentations

HW4 due

12/05 Project Presentations

12/10

Project reports due