[2305.07910] Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval