[2410.11778] On the Training Convergence of Transformers for In-Context Classification