[2310.20607] What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning