[2407.03010] Context-Aware Video Instance Segmentation