[2401.10005] Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation