[2305.05948] Multi-Path Transformer is Better: A Case Study on Neural Machine Translation