dblp: Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch.

"Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor ..."

Shangtong Zhang, Remi Tachet des Combes, Romain Laroche (2021)

SPARQL queries 

Details and statistics

DOI:

access: open

type: Informal or Other Publication

metadata version: 2021-11-05

Open Alex

Please note: Providing information about references and citations is only possible thanks to to the open metadata APIs provided by crossref.org and opencitations.net. If citation data of your publications is not openly available yet, then please consider asking your publisher to release your citation data to the public. For more information please see the Initiative for Open Citations (I4OC). Please also note that there is no way of submitting missing references or citation data directly to dblp.

Please also note that this feature is work in progress and that it is still far from being perfect. That is, in particular,

JavaScript is requires in order to retrieve and display any references and citations for this record.