[2305.06942] Optimizing Distributed ML Communication with Fused Computation-Collective Operations