My paper “Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference” has been accepted by IPDPS’24.