[2311.01767] PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion