[2405.11299] The CAP Principle for LLM Serving: A Survey of Long-Context Large Language Model Serving