[2303.11525] Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency