[2407.07053] Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model