Research Map

View publications
HPC
Systems
GPU
Kernels
Communication
Optimization
AI
Systems
Networking
for AI
Distributed
Inference
To be
continued...

Performance x Scale

Research Areas

High-Performance Computing Networking for AI Distributed Inference GPU Kernels Communication Optimization Systems for ML

News

View all
  1. May 28 I will be presenting my work “NIMBLE” at IPDPS 2026, New Orleans, LA.
  2. May 26 I will be joining Anyscale as research intern at San Francisco, working on Ray inference. Happy to grab a coffee!
  3. May 18 I will be presenting my work “MAC-Attention” at MLSys 2026, Bellevue, WA.
  4. Aug 14 I have concluded my internship at AI Frameworks team, Microsoft. Big thanks to Masahiro, Sam, and Walid for an extrordinary research experience!

Open Source

GitHub

Research Artifacts

Systems code, experiments, and reproducible project materials.

Interested in AI systems, GPU communication, or distributed inference?

yao.877@osu.edu