[2307.02628] SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference