For reference, we have compiled a list of some important publications on the topic of structure and priors in reinforcement learning (RL). Please make a pull request at the spirl-readings repository or email us at organizers@spirl.info if there's relevant work that could be added to the list!
Thanks to Michael Janner for contributing!
- [@trommershauser2008decision]
- [@diuk2013divide]
- [@solway2014optimal]
- [@boureau2015deciding]
- [@gershman2015novelty]
- [@lieder2017strategy]
- [@momennejad2017successor]
- [@dubey2018investigating]
- [@konidaris2019necessity]
- [@schultz1997neural]
- [@botvinick2009hierarchically]
- [@ribas2011neural]
- [@gershman2018successor]
- [@dayan1992feudal]
- [@sutton1999between]
- [@parr1997reinforcement]
- [@dietterich2000hierarchical]
- [@levy2011unified]
- [@bacon2017option]
- [@vezhnevets2017feudal]
- [@duan2016rl]
- [@wang2016learning]
- [@duan2017meta]
- [@finn2017model]
- [@frans2018meta]
- [@gupta2018meta]
- [@saemundsson2018meta]
- [@singh1992transfer]
- [@heess2016learning]
- [@andreas2017modular]
- [@devin2017learning]
- [@hausman2017multi]
- [@ghosh2018divide]
- [@hausman2018learning]
- [@johannink2018residual]
- [@chang2019automatically]
- [@wingate2011bayesian]
- [@ghavamzadeh2015bayesian]
- [@osband2018randomized]
- [@thrun1994finding]
- [@sutton1995td]
- [@littman2001predictive]
- [@ponsen2009abstraction]
- [@sutton2011horde]
- [@schaul2015universal]
- [@tamar2016value]
- [@silver2017predictron]
- [@ok2018exploration]
- [@ganin2018synthesizing]
- [@sanchez2018graph]
- [@taylor2009transfer]
- [@abel2018policy]