[2407.19546] XLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training