[2306.05424] Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models