awesome-deeplearning-resources

deep reinforcement learning

return to home

papers

Asynchronous Methods for Deep Reinforcement Learning. [arxiv] :star:
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning, E. Parisotto, et al., ICLR. [arxiv]
A New Softmax Operator for Reinforcement Learning.[url]
Benchmarking Deep Reinforcement Learning for Continuous Control, Y. Duan et al., ICML. [arxiv]
Better Computer Go Player with Neural Network and Long-term Prediction, Y. Tian et al., ICLR. [arxiv]
Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al., ICLR. [arxiv]
Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks, R. Houthooft et al., arXiv. [url]
Control of Memory, Active Perception, and Action in Minecraft, J. Oh et al., ICML. [arxiv]
Continuous Deep Q-Learning with Model-based Acceleration, S. Gu et al., ICML. [arxiv]
Continuous control with deep reinforcement learning. [arxiv] :star:
Deep Successor Reinforcement Learning. [arxiv]
Dynamic Frame skip Deep Q Network, A. S. Lakshminarayanan et al., IJCAI Deep RL Workshop. [arxiv]
Deep Exploration via Bootstrapped DQN. [arxiv] :star:
Deep Reinforcement Learning for Dialogue Generation. [arxiv] tensorflow
Deep Reinforcement Learning in Parameterized Action Space. [arxiv] :star:
Deep Reinforcement Learning with Successor Features for Navigation across Similar Environments.[url]
Designing Neural Network Architectures using Reinforcement Learning. arxiv code
Dialogue manager domain adaptation using Gaussian process reinforcement learning. [arxiv]
End-to-End Reinforcement Learning of Dialogue Agents for Information Access. [arxiv]
Generating Text with Deep Reinforcement Learning. [arxiv]
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization, C. Finn et al., arXiv. [arxiv]
Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al., arXiv. [arxiv]
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al., arXiv. [arxiv]
Hierarchical Object Detection with Deep Reinforcement Learning. [arxiv]
High-Dimensional Continuous Control Using Generalized Advantage Estimation, J. Schulman et al., ICLR. [arxiv]
Increasing the Action Gap: New Operators for Reinforcement Learning, M. G. Bellemare et al., AAAI. [arxiv]
Interactive Spoken Content Retrieval by Deep Reinforcement Learning. [arxiv]
Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection, S. Levine et al., arXiv. [url]
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks, J. N. Foerster et al., arXiv. [url]
Learning to compose words into sentences with reinforcement learning. [url]
Loss is its own Reward: Self-Supervision for Reinforcement Learning.[arxiv]
Model-Free Episodic Control. [arxiv]
Mastering the game of Go with deep neural networks and tree search. [nature] :star:
MazeBase: A Sandbox for Learning from Games .[arxiv]
Neural Architecture Search with Reinforcement Learning. [pdf]
Neural Combinatorial Optimization with Reinforcement Learning. [arxiv]
Non-Deterministic Policy Improvement Stabilizes Approximated Reinforcement Learning. [url]
Online Sequence-to-Sequence Active Learning for Open-Domain Dialogue Generation. arXiv. [arxiv]
Policy Distillation, A. A. Rusu et at., ICLR. [arxiv]
Prioritized Experience Replay. [arxiv] :star:
Reinforcement Learning Using Quantum Boltzmann Machines. [arxiv]
Safe and Efficient Off-Policy Reinforcement Learning, R. Munos et al.[arxiv]
Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving. [arxiv]
Sample-efficient Deep Reinforcement Learning for Dialog Control. [url]
Self-Correcting Models for Model-Based Reinforcement Learning.[url]
Unifying Count-Based Exploration and Intrinsic Motivation. [arxiv]
Value Iteration Networks. [arxiv]

This site is open source. Improve this page.