[2407.00611] WallFacer: Harnessing Multi-dimensional Ring Parallelism for Efficient Long Sequence Model Training