Beidi Chen

Email · beidic@andrew.cmu.edu

Check InfiniAI Lab @ CMU for latest update.

I am starting as an Assistant Professor of Electrical and Computer Engineering at Carnegie Mellon University in Fall 2023.

My lab is recruiting for 23' (Application Link)! I am looking for students and interns who are excited to tackle efficiency problems in ML together from an algorithm, modeling, or system/hardware perspective. Ph.D., master's, undergraduate, and visiting students are welcome to reach out!

I am currently a visiting researcher at Meta/Facebook AI Research (FAIR). Previously, I was a postdoc researcher at Stanford working with Dr. Chris Ré. I received my Ph.D. in Computer Science from Rice University under the supervision of Dr. Anshumali Shrivastava in 2020. I received my B.S. from University of California, Berkeley in 2015. My mentors were Dr. Sara Alspaugh, Dr. Kaifei Chen and my advisor was Dr. Randy Katz. My research focuses on large-scale machine learning. Specifically, I design and optimize randomized algorithms (algorithm-hardware co-design) to accelerate large machine learning systems for real-world problems.

NEWS:

We will present three papers: H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models, Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer, and Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions in NeurIPS 2023.
We will present Fast Algorithms for a New Relaxation of Optimal Transport in COLT 2023.
We will present three papers: Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time (Oral), FlexGen: High-throughput Generative Inference of Large Language Models with a Single GPU (Oral), and CocktailSGD: Fine-tuning Foundation Models over 500Mbps Networks in ICML 2023.
We will present two papers: Decentralized Training of Foundation Models in Heterogeneous Environments (Oral) and Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees in NeurIPS 2022.
Our paper: Monarch: Expressive Structured Matrices for Efficient and Accurate Training got Outstanding Paper Runner Up in ICML 2022.
Our recent work Monarch got Outstanding Poster Award at the EfficientML Bay Area meetup!
Pixelated Butterfly is accepted by ICLR 2022 (Spotlight)!

Research

Zhenyu Zhang, Ying Sheng, Tianyi Zhou, Tianlong Chen, Lianmin Zheng, Ruisi Cai, Zhao Song, Yuandong Tian, Christopher Ré, Clark Barrett, Zhangyang Wang, Beidi Chen. " H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models". NeurIPS 2023.

Yuandong Tian, Yiping Wang, Beidi Chen, Simon Shaolei Du. " Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer". NeurIPS 2023.

Stefano Massaroli, Michael Poli, Daniel Y Fu, Hermann Kumbong, David W. Romero, Rom Nishijima Parnichkun, Aman Timalsina, Quinn McIntyre, Beidi Chen, Atri Rudra, Ce Zhang, Christopher Ré, Stefano Ermon, Yoshua Bengio. " Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions". NeurIPS 2023.

Moses Charikar, Beidi Chen, Christopher Ré, Erik Waingarten. " Fast Algorithms for a New Relaxation of Optimal Transport ". Conference on Learning Theory (COLT) 2023.

Zichang Liu, Jue Wang, Tri Dao, Tianyi Zhou, Binhang Yuan, Zhao Song, Anshumali Shrivastava, Ce Zhang, Yuandong Tian, Christopher Ré, Beidi Chen. " Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time". ICML 2023. (Oral).

Ying Sheng, Lianmin Zheng, Binhang Yuan, Zhuohan Li, Max Ryabinin, Daniel Y. Fu, Zhiqiang Xie, Beidi Chen, Clark Barrett, Joseph E. Gonzalez, Percy Liang, Christopher Ré, Ion Stoica, Ce Zhang. " High-throughput Generative Inference of Large Language Models with a Single GPU". ICML 2023. (Oral). [ code ]

Jue Wang, Yucheng Lu, Binhang Yuan, Beidi Chen, Percy Liang, Christopher De Sa, Christopher Ré, Ce Zhang. " CocktailSGD: Fine-tuning Foundation Models over 500Mbps Networks ". ICML 2023.

Binhang Yuan, Yongjun He, Jared Quincy Davis, Tianyi Zhang, Tri Dao, Beidi Chen, Percy Liang, Christopher Ré, Ce Zhang. " Decentralized Training of Foundation Models in Heterogeneous Environments". NeurIPS 2022. (Oral).

Jue Wang, Binhang Yuan, Luka Rimanic, Yongjun He, Tri Dao, Beidi Chen, Christopher Ré, Ce Zhang. " Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees". NeurIPS 2022.

Tri Dao, Beidi Chen, Nimit Sohoni, Arjun Desai, Michael Poli, Jessica Grogan, Alexander Liu, Aniruddh Rao, Atri Rudra, Christopher Ré. " Monarch: Expressive Structured Matrices for Efficient and Accurate Training". ICML 2022. (Long Talk, Outstanding Paper Runner Up). [ code ][ poster ]

Beidi Chen*, Tri Dao*, Kaizhao Liang, Jiaming Yang, Zhao Song, Atri Rudra, Christopher Ré. " Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models". In Proceedings of International Conference on Learning Representations, ICLR 2022 (Spotlight). [video][ code ]

Zichang Liu, Zhaozhuo Xu, Alan Ji, Junyan Zhang, Jonathan Li, Beidi Chen, Anshumali Shrivastava. "HALOS: Hashing Large Output Space for Cheap Inference". In Proceedings of the 5rd Conference on Machine Learning and Systems, MLSys 2022.

Beidi Chen*, Tri Dao*, Eric Winsor, Zhao Song, Atri Rudra, Christopher Ré. " Scatterbrain: Unifying Sparse and Low-rank Attention Approximation ". In Neural Information Processing Systems, NeurIPS 2021. [ code ]

Beidi Chen, Zichang Liu, Binghui Peng, Zhaozhuo Xu, Jonathan Lingjie Li, Tri Dao, Zhao Song, Anshumali Shrivastava, Christopher Re. "MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training". In Proceedings of International Conference on Learning Representations, ICLR 2021. (Oral) [video][ code]

Zhaozhuo Xu, Beidi Chen, Chaojian Li, Weiyang Liu, Le Song, Yingyan Lin, and Anshumali Shrivastava. " Locality Sensitive Teaching". In Neural Information Processing Systems, NeurIPS 2021.

Shabnam Daghaghi, Nicholas Meisburger, Mengnan Zhao, Beidi Chen, Tharun Medini, and Anshumali Shrivastava. "A Tale of Two Efficient and Informative Negative Sampling Distributions". In Proceedings of International Conference on Machine Learning, ICML 2021. (Long Talk)

Tharun Medini, Beidi Chen, Anshumali Shrivastava. "SOLAR: Sparse Orthogonal Learned and Random Embeddings". In Proceedings of International Conference on Learning Representations, ICLR 2021.

Beidi Chen, Weiyang Liu, Animesh Garg, Zhiding Yu, Anshumali Shrivastava, Jan Kautz, Anima Anandkumar. "Angular Visual Hardness". In Proceedings of the 37th International Conference on Machine Learning, ICML 2020. [Video]

Beidi Chen, Tharun Medini, James Farwell, Sameh Gobriel, Charlie Tai, Anshumali Shrivastava. "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems". In Proceedings of the 3rd Conference on Machine Learning and Systems, MLSys 2020.

Beidi Chen, Yingchen Xu, and Anshumali Shrivastava. "LGD: Fast and Accurate Stochastic Gradient Estimation". In Neural Information Processing Systems, NeurIPS 2019. (LSH-Sampling Breaks the Computational Chicken-and-Egg Loop in Adaptive Stochastic Gradient Estimation in ICLR 2018 Workshop)

Beidi Chen, M. Sadegh Riazi, Anshumali Shrivastava, DanWallach, Farinaz Koushanfar. "Sub-linear Privacy-preserving Search with Untrusted Server and Semi-honest Parties". Manuscript.

Beidi Chen, Anshumali Shrivastava. " Revisiting Winner Take All (WTA) Hashing for Sparse Datasets". In Proceedings of the 34th Conference in Uncertainty in Artificial Intelligence, UAI 2018.

Beidi Chen, Anshumali Shrivastava, Rebecca C. Steorts. "Unique Entity Estimation with Application to the Syrian Conflict". The Annals of Applied Statistics 12.2 (2018). (Also Won IISA 2018 Best Student Paper in Applied Statistics with this paper.)

Kaifei Chen, Siyuan He, Beidi Chen, John Kolb, Randy H. Katz, David E. Culler. "BearLoc: A Composable Distributed Framework for Indoor Localization Systems". In Proceedings of the 2015 Workshop on IoT challenges in Mobile and Industrial Systems, IoT-Sys@MobiSys 2015, pages 7-12, May. 2015. Florence, Italy.

S. Alspaugh, Beidi Chen, Jessica Lin, Archana Ganapathi, Marti Hearst, and Randy Katz. "Analyzing Log Analysis: An Empirical Study of User Log Mining". In Proceedings of the 28th Large Installation System Administration Conference, LISA 2014. (Best Student Paper)

Experience

Research Intern

Microsoft Research, Redmond, WA

May 2019 - Present

Machine Learning Scientist Intern

Nvidia, Santa Clara, CA

Feb 2019 - May 2019

Applied Scientist Intern

A9.com, Palo Alto, CA

May 2018 - Dec 2018

Applied Scientist Intern

Amazon Web Services, Inc., Palo Alto, CA

March 2017 - June 2017

Software Engineering Intern

Apple Inc., Cupertino, CA

Feb 2015 - Aug 2015

Research and Development Intern

VMware Inc., Palo Alto, CA

May 2014 - Aug 2014

Software Engineering Intern

Broadcom Corporation, Sunnyvale, CA

May 2013 - Aug 2013

Education

Stanford University

Postdoc

Computer Science

August 2020 - Present

Rice University

Doctor of Philosophy

Computer Science

August 2015 - May 2020

University of California, Berkeley

Bachelor of Science

Electrical Engineering and Computer Science

August 2011 - May 2015

Wuhan Foreign Languages School

High School

Sep 2008 - June 2011

Honors and Awards

ICML 2022 Outstanding Paper Runner Up
MIT EECS Rising Stars 2021
UIUC EECS Rising Stars 2019
IISA 2018 Best Student Paper Award
Ken Kennedy Institute for Information Technology 2017 Fellowship
USENIX LISA 2014 Best Student Paper Award
ANITA BORG INSTITUTE GHC Scholarship 2014
Qualcomm Undergraduate Experiences in Science & Technology Scholar 2013

Invited Talks

LightOn AI Meetup [webpage]

VMware ML

TWIML AI Podcast [video]

Microsoft Machine Translation Group

Microsoft Rearch Talks [video]

Record Linkage workshop at CIMAT [webpage]

IISA2018 Conference

JSM 2018 Topic-Contributed Session

Activity and Service

Teaching

TA in Comp530 (Database System Implementation)

Rice University, Spring 2018

TA in Comp382 (Reasoning about Algorithms)

Rice University, Fall 2017, 2019

TA in Comp330 (Tools and Models For Data Science)

Rice University, Fall 2016, 2018

TA in EE122 (Introduction to Communication Networks)

UC Berkeley, Spring 2013, 2014

TA in EE40 (Introduction to Microelectronic Circuits)

UC Berkeley, Fall 2014

Service

Reviewer for Science Advances, NeurIPS, ICML, ICLR, AISTATS, AAAI, UAI, AISTATS

Activity

Stanford CS Undergraduate Mentoring Program

Women in Science and Engineering at Stanford

Member of CSters at Rice University

Member of Association of Women in EECS (AWE) at UC Berkeley