default search action
Gregory Farquhar
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c19]Matthew Thomas Jackson, Minqi Jiang, Jack Parker-Holder, Risto Vuorio, Chris Lu, Gregory Farquhar, Shimon Whiteson, Jakob N. Foerster:
Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design. NeurIPS 2023 - [i19]Matthew Thomas Jackson, Minqi Jiang, Jack Parker-Holder, Risto Vuorio, Chris Lu, Gregory Farquhar, Shimon Whiteson, Jakob Nicolaus Foerster:
Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design. CoRR abs/2310.02782 (2023) - 2022
- [c18]Angelos Filos, Eszter Vértes, Zita Marinho, Gregory Farquhar, Diana Borsa, Abram L. Friesen, Feryal M. P. Behbahani, Tom Schaul, André Barreto, Simon Osindero:
Model-Value Inconsistency as a Signal for Epistemic Uncertainty. ICML 2022: 6474-6498 - [i18]Risto Vuorio, Jacob Beck, Shimon Whiteson, Jakob N. Foerster, Gregory Farquhar:
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients. CoRR abs/2209.11303 (2022) - 2021
- [c17]Maximilian Igl, Gregory Farquhar, Jelena Luketina, Wendelin Boehmer, Shimon Whiteson:
Transient Non-stationarity and Generalisation in Deep Reinforcement Learning. ICLR 2021 - [c16]Angelos Filos, Clare Lyle, Yarin Gal, Sergey Levine, Natasha Jaques, Gregory Farquhar:
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning. ICML 2021: 3305-3317 - [c15]Gregory Farquhar, Kate Baumli, Zita Marinho, Angelos Filos, Matteo Hessel, Hado Philip van Hasselt, David Silver:
Self-Consistent Models and Values. NeurIPS 2021: 1111-1125 - [c14]Christopher Grimm, André Barreto, Gregory Farquhar, David Silver, Satinder Singh:
Proper Value Equivalence. NeurIPS 2021: 7773-7786 - [i17]Angelos Filos, Clare Lyle, Yarin Gal, Sergey Levine, Natasha Jaques, Gregory Farquhar:
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning. CoRR abs/2102.12560 (2021) - [i16]Christopher Grimm, André Barreto, Gregory Farquhar, David Silver, Satinder Singh:
Proper Value Equivalence. CoRR abs/2106.10316 (2021) - [i15]Gregory Farquhar, Kate Baumli, Zita Marinho, Angelos Filos, Matteo Hessel, Hado van Hasselt, David Silver:
Self-Consistent Models and Values. CoRR abs/2110.12840 (2021) - [i14]Angelos Filos, Eszter Vértes, Zita Marinho, Gregory Farquhar, Diana Borsa, Abram L. Friesen, Feryal M. P. Behbahani, Tom Schaul, André Barreto, Simon Osindero:
Model-Value Inconsistency as a Signal for Epistemic Uncertainty. CoRR abs/2112.04153 (2021) - 2020
- [j1]Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson:
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. J. Mach. Learn. Res. 21: 178:1-178:51 (2020) - [c13]Gregory Farquhar, Laura Gustafson, Zeming Lin, Shimon Whiteson, Nicolas Usunier, Gabriel Synnaeve:
Growing Action Spaces. ICML 2020: 3040-3051 - [c12]Tabish Rashid, Gregory Farquhar, Bei Peng, Shimon Whiteson:
Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. NeurIPS 2020 - [i13]Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson:
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. CoRR abs/2003.08839 (2020) - [i12]Maximilian Igl, Gregory Farquhar, Jelena Luketina, Wendelin Boehmer, Shimon Whiteson:
The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning. CoRR abs/2006.05826 (2020) - [i11]Tabish Rashid, Gregory Farquhar, Bei Peng, Shimon Whiteson:
Weighted QMIX: Expanding Monotonic Value Function Factorisation. CoRR abs/2006.10800 (2020)
2010 – 2019
- 2019
- [c11]Mikayel Samvelyan, Tabish Rashid, Christian Schröder de Witt, Gregory Farquhar, Nantas Nardelli, Tim G. J. Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob N. Foerster, Shimon Whiteson:
The StarCraft Multi-Agent Challenge. AAMAS 2019: 2186-2188 - [c10]Jingkai Mao, Jakob N. Foerster, Tim Rocktäschel, Maruan Al-Shedivat, Gregory Farquhar, Shimon Whiteson:
A Baseline for Any Order Gradient Estimation in Stochastic Computation Graphs. ICML 2019: 4343-4351 - [c9]Jelena Luketina, Nantas Nardelli, Gregory Farquhar, Jakob N. Foerster, Jacob Andreas, Edward Grefenstette, Shimon Whiteson, Tim Rocktäschel:
A Survey of Reinforcement Learning Informed by Natural Language. IJCAI 2019: 6309-6317 - [c8]Gregory Farquhar, Shimon Whiteson, Jakob N. Foerster:
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning. NeurIPS 2019: 8149-8160 - [c7]Christian Schröder de Witt, Jakob N. Foerster, Gregory Farquhar, Philip H. S. Torr, Wendelin Boehmer, Shimon Whiteson:
Multi-Agent Common Knowledge Reinforcement Learning. NeurIPS 2019: 9924-9935 - [i10]Mikayel Samvelyan, Tabish Rashid, Christian Schröder de Witt, Gregory Farquhar, Nantas Nardelli, Tim G. J. Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob N. Foerster, Shimon Whiteson:
The StarCraft Multi-Agent Challenge. CoRR abs/1902.04043 (2019) - [i9]Jelena Luketina, Nantas Nardelli, Gregory Farquhar, Jakob N. Foerster, Jacob Andreas, Edward Grefenstette, Shimon Whiteson, Tim Rocktäschel:
A Survey of Reinforcement Learning Informed by Natural Language. CoRR abs/1906.03926 (2019) - [i8]Gregory Farquhar, Laura Gustafson, Zeming Lin, Shimon Whiteson, Nicolas Usunier, Gabriel Synnaeve:
Growing Action Spaces. CoRR abs/1906.12266 (2019) - [i7]Gregory Farquhar, Shimon Whiteson, Jakob N. Foerster:
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning. CoRR abs/1909.10549 (2019) - 2018
- [c6]Jakob N. Foerster, Gregory Farquhar, Triantafyllos Afouras, Nantas Nardelli, Shimon Whiteson:
Counterfactual Multi-Agent Policy Gradients. AAAI 2018: 2974-2982 - [c5]Gregory Farquhar, Tim Rocktäschel, Maximilian Igl, Shimon Whiteson:
TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning. ICLR (Poster) 2018 - [c4]Jakob N. Foerster, Gregory Farquhar, Maruan Al-Shedivat, Tim Rocktäschel, Eric P. Xing, Shimon Whiteson:
DiCE: The Infinitely Differentiable Monte-Carlo Estimator. ICLR (Workshop) 2018 - [c3]Jakob N. Foerster, Gregory Farquhar, Maruan Al-Shedivat, Tim Rocktäschel, Eric P. Xing, Shimon Whiteson:
DiCE: The Infinitely Differentiable Monte Carlo Estimator. ICML 2018: 1524-1533 - [c2]Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson:
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. ICML 2018: 4292-4301 - [i6]Jakob N. Foerster, Gregory Farquhar, Maruan Al-Shedivat, Tim Rocktäschel, Eric P. Xing, Shimon Whiteson:
DiCE: The Infinitely Differentiable Monte-Carlo Estimator. CoRR abs/1802.05098 (2018) - [i5]Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson:
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. CoRR abs/1803.11485 (2018) - [i4]Jakob N. Foerster, Christian A. Schröder de Witt, Gregory Farquhar, Philip H. S. Torr, Wendelin Boehmer, Shimon Whiteson:
Multi-Agent Common Knowledge Reinforcement Learning. CoRR abs/1810.11702 (2018) - 2017
- [c1]Jakob N. Foerster, Nantas Nardelli, Gregory Farquhar, Triantafyllos Afouras, Philip H. S. Torr, Pushmeet Kohli, Shimon Whiteson:
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning. ICML 2017: 1146-1155 - [i3]Jakob N. Foerster, Nantas Nardelli, Gregory Farquhar, Philip H. S. Torr, Pushmeet Kohli, Shimon Whiteson:
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning. CoRR abs/1702.08887 (2017) - [i2]Jakob N. Foerster, Gregory Farquhar, Triantafyllos Afouras, Nantas Nardelli, Shimon Whiteson:
Counterfactual Multi-Agent Policy Gradients. CoRR abs/1705.08926 (2017) - [i1]Gregory Farquhar, Tim Rocktäschel, Maximilian Igl, Shimon Whiteson:
TreeQN and ATreeC: Differentiable Tree Planning for Deep Reinforcement Learning. CoRR abs/1710.11417 (2017)
Coauthor Index
aka: Jakob Nicolaus Foerster
aka: Christian A. Schröder de Witt
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-13 00:43 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint