Research Map
HPC
Systems
Systems
GPU
Kernels
Kernels
Communication
Optimization
Optimization
AI
Systems
Systems
Networking
for AI
for AI
Distributed
Inference
Inference
To be
continued...
continued...
Performance x Scale
Performance x Scale
MLSys 2026 a Match-Amend-Complete Scheme for Fast and Accurate Attention Computation
IPDPS 2026 From Skew to Symmetry: Node-Interconnect Multi-Path Balancing with Execution-time Planning for Modern GPU Clusters
Training ultra long context language model with fully pipelined distributed transformer
IPDPS 2024 Inter-layer expert affinity for accelerating Mixture-of-Experts inference.
Systems code, experiments, and reproducible project materials.
Interested in AI systems, GPU communication, or distributed inference?
yao.877@osu.edu