[2212.00280] GRiT: A Generative Region-to-text Transformer for Object Understanding