Skip to content

Latest commit

 

History

History
75 lines (65 loc) · 1.88 KB

readings.md

File metadata and controls

75 lines (65 loc) · 1.88 KB

Related Work

For reference, we have compiled a list of some important publications on the topic of structure and priors in reinforcement learning (RL). Please make a pull request at the spirl-readings repository or email us at organizers@spirl.info if there's relevant work that could be added to the list!

Thanks to Michael Janner for contributing!

Cognitive Science

  • [@trommershauser2008decision]
  • [@diuk2013divide]
  • [@solway2014optimal]
  • [@boureau2015deciding]
  • [@gershman2015novelty]
  • [@lieder2017strategy]
  • [@momennejad2017successor]
  • [@dubey2018investigating]
  • [@konidaris2019necessity]

Neuroscience

  • [@schultz1997neural]
  • [@botvinick2009hierarchically]
  • [@ribas2011neural]
  • [@gershman2018successor]

Hierarchical RL

  • [@dayan1992feudal]
  • [@sutton1999between]
  • [@parr1997reinforcement]
  • [@dietterich2000hierarchical]
  • [@levy2011unified]
  • [@bacon2017option]
  • [@vezhnevets2017feudal]

Meta-RL

  • [@duan2016rl]
  • [@wang2016learning]
  • [@duan2017meta]
  • [@finn2017model]
  • [@frans2018meta]
  • [@gupta2018meta]
  • [@saemundsson2018meta]

Modularity in RL

  • [@singh1992transfer]
  • [@heess2016learning]
  • [@andreas2017modular]
  • [@devin2017learning]
  • [@hausman2017multi]
  • [@ghosh2018divide]
  • [@hausman2018learning]
  • [@johannink2018residual]
  • [@chang2019automatically]

Priors and Bayesian RL

  • [@wingate2011bayesian]
  • [@ghavamzadeh2015bayesian]
  • [@osband2018randomized]

Structure in RL

  • [@thrun1994finding]
  • [@sutton1995td]
  • [@littman2001predictive]
  • [@ponsen2009abstraction]
  • [@sutton2011horde]
  • [@schaul2015universal]
  • [@tamar2016value]
  • [@silver2017predictron]
  • [@ok2018exploration]
  • [@ganin2018synthesizing]
  • [@sanchez2018graph]

Transfer, Multi-Task and Lifelong RL

  • [@taylor2009transfer]
  • [@abel2018policy]