Homepage -

Photo credit to my lovely girlfriend, Lin ❤️.

Yiwei Chen

I am a graduate student in the Department of Electrical and Computer Engineering at Carnegie Mellon University, fortunately working with Prof. Greg Ganger in Parallel Data Lab and Prof. Zhihao Jia in Catalyst Group.

My research focuses on exploring ways to enhance scalability, performance, and efficiency of large scale data-centric applications from both systems and algorithms perspective.

yiweiche(at)andrew.cmu.edu LinkedIn

Education

Carnegie Mellon University

Department of Electrical and Computer Engineering
Master of Science in Electrical and Computer Engineering

Jan. 2025 - present
University of Wisconsin-Madison

Bachelor of Science in Computer Science

Sep. 2022 - Jul. 2024
University of Waterloo

Bachelor of Mathematics in Combinatorics & Optimization

Sep. 2020 - Jul. 2022

Selected Experiences

Carnegie Mellon University

Research Staff -> Graduate Student Research Assistant

Sept. 2024 - present
Microsoft Research

Research Intern

May 2024 - Aug. 2024
University of Wisconsin-Madison

Undergraduate Student Research Assistant

Jan. 2023 - May 2024
Google

Software Engineer Intern

Jan. 2022 - April. 2022
Ford Motor

Media & USB Infotainment Developer Intern

Sep. 2021 - Dec. 2021

Experience Details

My journey began with operating systems internals. I started by working at Ford, where I customized Android system components relevant to multi-media and USB. Later at Google, I contributed to designing a unified customization interface for OEMs to make such modifications more smoothly and securely. After transferring to UW-Madison, I started to focus on systems research.

My systems researches are primarily about bandwidth issues in modern data-intensive applications. At UW-Madison, I worked on ways to make consensus protocols bandwidth-adaptive, advised by Prof. Remzi Arpaci-Dusseau and Prof. Andrea Arpaci-Dusseau. Currently, I’m working with Prof. Greg Ganger at Carnegie Mellon University’s Parallel Data Lab , where I contribute to re-designing IO interfaces in data center file systems to better utilize bandwidth.

Building on this systems foundation, I have also explored problems at the intersection between systems and machine learning.

On the machine learning side, I was working on improving LLMs’ planning and tool use abilities in software engineering and infra ops domains at Microsoft Research, fortunately supervised by Dr. Xuan Feng and Dr. Sameh Elnikety.

Currently, I’m also collaborating with Prof. Zhihao Jia at Carnegie Mellon University’s Catalyst Group about improving LLM' inference with sparse attention. We are taking an end-to-end approach -- optimize LLMs’ inference from model structure to kernel implementations.

News

2025

I have been serving as a artifact evaluation committee member at FAST '25

Jan 28

Our paper about cloud architecture has been accepted at FAST '25

Jan 18

Selected Publications (view all )

Cloudscape: A Study of Storage Services in Modern Cloud Architectures

Sambhav Satija, Chenhao Ye, Ranjitha Kosgi, Aditya Jain, Romit Kankaria, Yiwei Chen, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau

USENIX Conference on File and Storage Technologies 2025

We present Cloudscape, a dataset of nearly 400 cloud architectures deployed on AWS. We perform an in-depth analysis of the usage of storage services in cloud systems. Our findings include: S3 is the most prevalent storage service (68%), while file system services are rare (4%); heterogeneity is common in the storage layer; storage services primarily interface with Lambda and EC2, while also serving as the foundation for more specialized ML and analytics services. Our findings provide a concrete understanding of how storage services are deployed in real-world cloud architectures, and our analysis of the popularity of different services grounds existing research.

[Paper]

Cloudscape: A Study of Storage Services in Modern Cloud Architectures

Sambhav Satija, Chenhao Ye, Ranjitha Kosgi, Aditya Jain, Romit Kankaria, Yiwei Chen, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau

USENIX Conference on File and Storage Technologies 2025

[Paper]

Warning

Action required

Education

Selected Experiences

Experience Details

News

Selected Publications (view all )

Cloudscape: A Study of Storage Services in Modern Cloud Architectures

Cloudscape: A Study of Storage Services in Modern Cloud Architectures

All publications