dblp: Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management.

"Learning in Structured MDPs with Convex Cost Functions: Improved Regret ..."

Shipra Agrawal, Randy Jia (2019)

Details and statistics

DOI: 10.1145/3328526.3329565

access: closed

type: Conference or Workshop Paper

metadata version: 2019-06-26

Open Alex

Please note: Providing information about references and citations is only possible thanks to to the open metadata APIs provided by crossref.org and opencitations.net. If citation data of your publications is not openly available yet, then please consider asking your publisher to release your citation data to the public. For more information please see the Initiative for Open Citations (I4OC). Please also note that there is no way of submitting missing references or citation data directly to dblp.

Please also note that this feature is work in progress and that it is still far from being perfect. That is, in particular,

JavaScript is requires in order to retrieve and display any references and citations for this record.