[2410.13786] Emphasizing Semantic Consistency of Salient Posture for Speech-Driven Gesture Generation