Abbeel, Pieter, and Andrew Y. Ng. "Apprenticeship Learning via Inverse Reinforcement Learning." ICML 2004.
Abbeel, Pieter, and Andrew Y. Ng. "Exploration and Apprenticeship Learning in Reinforcement Learning." ICML 2005.
- http://machinelearning.wustl.edu/mlpapers/paper_files/icml2005_AbbeelN05.pdf "KJ"
Ramachandran, Deepak, and Eyal Amir. "Bayesian Inverse Reinforcement Learning." AAAI 2007.
- http://www.aaai.org/Papers/IJCAI/2007/IJCAI07-416.pdf
Peters, Jan, and Stefan Schaal. "Reinforcement learning of motor skills with policy gradients." Neural networks 21.4 (2008)
- http://www.keck.ucsf.edu/~houde/sensorimotor_jc/possible_papers/JPeters08a.pdf
Brian D. Ziebart, Andrew Maas, J.Andrew Bagnell, and Anind K. Dey, "Maximum Entropy Inverse Reinforcement Learning." AAAI. 2008.
- http://www.aaai.org/Papers/AAAI/2008/AAAI08-227.pdf
Daumé Iii, Hal, John Langford, and Daniel Marcu. "Search-based structured prediction." Machine learning 75.3 (2009): 297-325.
- http://arxiv.org/pdf/0907.0786.pdf "KJ"
Ross, Stéphane, Geoffrey J. Gordon, and J. Andrew Bagnell. "A reduction of imitation learning and structured prediction to no-regret online learning." arXiv preprint arXiv:1011.0686 (2010).
- http://arxiv.org/pdf/1011.0686.pdf "KJ"
Scherrer, Bruno, et al. "Approximate modified policy iteration." arXiv preprint arXiv:1205.3054 (2012).
- http://arxiv.org/pdf/1205.3054.pdf
Dimitrakakis, Christos, and Constantin A. Rothkopf. "Bayesian Multitask Inverse Reinforcement Learning." Recent Advances in Reinforcement Learning. 2012
- http://arxiv.org/pdf/1106.3655.pdf
Choi, Jaedeug, and Kee-Eung Kim. "Nonparametric Bayesian Inverse Reinforcement Learning for Multiple Reward Functions." NIPS 2012.
- http://papers.nips.cc/paper/4737-nonparametric-bayesian-inverse-reinforcement-learning-for-multiple-reward-functions.pdf "KJ"
Levine, Sergey, and Vladlen Koltun. "Guided Policy Search." ICML 2013.
- http://jmlr.org/proceedings/papers/v28/levine13.pdf
Ross, Stéphane. Interactive Learning for Sequential Decisions and Predictions. Diss. CARNEGIE MELLON UNIVERSITY, 2013.
- http://www.cs.cmu.edu/~sross1/phd_thesis.pdf
Ross, Stéphane, and J. Andrew Bagnell. "Reinforcement and imitation learning via interactive no-regret learning." arXiv preprint arXiv:1406.5979 (2014).
- http://arxiv.org/pdf/1406.5979.pdf "KJ"
Guo, Xiaoxiao, Satinder Singh, Honglak Lee, Richard Lewis, Xiaoshi Wang. "Deep learning for real-time Atari game play using offline Monte-Carlo tree search planning." Advances in Neural Information Processing Systems. 2014.
- http://papers.nips.cc/paper/5421-deep-learning-for-real-time-atari-game-play-using-offline-monte-carlo-tree-search-planning.pdf
Levine, Sergey, and Vladlen Koltun. "Learning complex neural network policies with trajectory optimization." ICML 2014.
- http://graphics.stanford.edu/projects/cgpspaper/cgps.pdf
- http://graphics.stanford.edu/projects/cgpspaper/index.htm
Levine, Sergey, and Pieter Abbeel. "Learning neural network policies with guided policy search under unknown dynamics." NIPS 2014.
- http://www.eecs.berkeley.edu/~svlevine/papers/mfcgps.pdf "KJ"
Mordatch, Igor, and Emanuel Todorov. "Combining the benefits of function approximation and trajectory optimization." Robotics: Science and Systems (RSS). 2014.
- http://www.roboticsproceedings.org/rss10/p52.pdf "KJ"
Levine, Sergey, Nolan Wagener, and Pieter Abbeel. "Learning Contact-Rich Manipulation Skills with Guided Policy Search." ICRA 2015.
- http://arxiv.org/pdf/1501.05611v2.pdf
Dylan Hadfield-Menell, Edward Groshev, Rohan Chitnis, and Pieter Abbeel. "Modular Task and Motion Planning in Belief Space." IROS 2015
- http://www.cs.berkeley.edu/~pabbeel/papers/2015-IROS-TMP-belief-space.pdf "KJ"
Han, Weiqiao, Sergey Levine, and Pieter Abbeel. "Learning Compound Multi-Step Controllers under Unknown Dynamics.", IROS 2015
- http://rll.berkeley.edu/reset_controller/reset_controller.pdf
- http://rll.berkeley.edu/reset_controller/
Alex X. Lee Abhishek Gupta Henry Lu Sergey Levine Pieter Abbeel. "Learning from Multiple Demonstrations using Trajectory-Aware Non-Rigid Registration with Applications to Deformable Object Manipulation." IROS 2015
- http://www.cs.berkeley.edu/~pabbeel/papers/2015-IROS-trajectory-aware-registration.pdf "KJ"
Sergey Levine, Chelsea Finn, Trevor Darrell, Pieter Abbeel "End-to-End Training of Deep Visuomotor Policies." arXiv preprint arXiv:1504.00702 (2015).
- http://arxiv.org/pdf/1504.00702v1.pdf
John Schulman, Sergey Levine, Philipp Moritz, Michael I. Jordan, Pieter Abbeel. "Trust Region Policy Optimization." arXiv preprint arXiv:1502.05477 (2015).
- http://arxiv.org/pdf/1502.05477v3.pdf
Fu, Justin, Sergey Levine, and Pieter Abbeel. "One-Shot Learning of Manipulation Skills with Online Dynamics Adaptation and Neural Network Priors." arXiv preprint arXiv:1509.06841 (2015).
- http://rll.berkeley.edu/icra2016onlinecontrol/online_control.pdf
Chelsea Finn, Xin Yu Tan, Yan Duan, Trevor Darrell, Sergey Levine, Pieter Abbeel "Learning Visual Feature Spaces for Robotic Manipulation with Deep Spatial Autoencoders." arXiv preprint arXiv:1509.06113 (2015).
- http://arxiv.org/pdf/1509.06113v1.pdf
Marvin Zhang, Zoe McCarthy, Chelsea Finn, Sergey Levine, Pieter Abbeel. "Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search." arXiv preprint arXiv:1509.06791 (2015).
- http://rll.berkeley.edu/icra2016mpcgps/ICRA16_MPCGPS "KJ"
Christopher Xie, Sachin Patil, Teodor Moldovan, Sergey Levine, Pieter Abbeel "Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration." arXiv preprint arXiv:1509.06824 (2015).
- http://arxiv.org/pdf/1509.06824v1.pdf "KJ"
'Enginius > Robotics' 카테고리의 다른 글
Learning from Interactions (LfI) (0) | 2015.10.30 |
---|---|
Theoretical Analysis of Behavior Cloning (0) | 2015.10.10 |
눈이 올 때, 교통 사고를 줄여보자! (0) | 2015.08.21 |
ICRA Learning-related paper survery (0) | 2015.07.31 |
Robotics in Germany (1) | 2015.06.16 |