[2306.07542] A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory Management