Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
158 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Meta Adaptation using Importance Weighted Demonstrations (1911.10322v2)

Published 23 Nov 2019 in cs.LG, cs.AI, and stat.ML

Abstract: Imitation learning has gained immense popularity because of its high sample-efficiency. However, in real-world scenarios, where the trajectory distribution of most of the tasks dynamically shifts, model fitting on continuously aggregated data alone would be futile. In some cases, the distribution shifts, so much, that it is difficult for an agent to infer the new task. We propose a novel algorithm to generalize on any related task by leveraging prior knowledge on a set of specific tasks, which involves assigning importance weights to each past demonstration. We show experiments where the robot is trained from a diversity of environmental tasks and is also able to adapt to an unseen environment, using few-shot learning. We also developed a prototype robot system to test our approach on the task of visual navigation, and experimental results obtained were able to confirm these suppositions.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (37)
  1. “Global overview of Imitation Learning” In CoRR abs/1801.06503, 2018 arXiv: http://arxiv.org/abs/1801.06503
  2. “End to End Learning for Self-Driving Cars” In CoRR abs/1604.07316, 2016 arXiv: http://arxiv.org/abs/1604.07316
  3. Sonia Chernova and Manuela M. Veloso “Confidence-based policy learning from demonstration using Gaussian mixture models” In 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), Honolulu, Hawaii, USA, May 14-18, 2007, 2007, pp. 233 DOI: 10.1145/1329125.1329407
  4. “End-to-End Driving Via Conditional Imitation Learning” In 2018 IEEE International Conference on Robotics and Automation, ICRA 2018, Brisbane, Australia, May 21-25, 2018, 2018, pp. 1–9 DOI: 10.1109/ICRA.2018.8460487
  5. “One-Shot Imitation Learning” In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 4-9 December 2017, Long Beach, CA, USA, 2017, pp. 1087–1098 URL: http://papers.nips.cc/paper/6709-one-shot-imitation-learning
  6. “Model-based imitation learning by probabilistic trajectory matching” In 2013 IEEE International Conference on Robotics and Automation, Karlsruhe, Germany, May 6-10, 2013, 2013, pp. 1922–1927 DOI: 10.1109/ICRA.2013.6630832
  7. Chelsea Finn, Pieter Abbeel and Sergey Levine “Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks” In Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, NSW, Australia, 6-11 August 2017, 2017, pp. 1126–1135 URL: http://proceedings.mlr.press/v70/finn17a.html
  8. “One-Shot Visual Imitation Learning via Meta-Learning” In 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, California, USA, November 13-15, 2017, Proceedings, 2017, pp. 357–368 URL: http://proceedings.mlr.press/v78/finn17a.html
  9. Roy Fox, Ari Pakman and Naftali Tishby “Taming the Noise in Reinforcement Learning via Soft Updates” In Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, UAI 2016, June 25-29, 2016, New York City, NY, USA, 2016 URL: http://auai.org/uai2016/proceedings/papers/219.pdf
  10. “Reinforcement Learning from Imperfect Demonstrations” In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Workshop Track Proceedings, 2018 URL: https://openreview.net/forum?id=HytbCQG8z
  11. “Lightweight Learner for Shared Knowledge Lifelong Learning” In CoRR abs/2305.15591, 2023 DOI: 10.48550/arXiv.2305.15591
  12. “Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning” In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings, 2017 URL: https://openreview.net/forum?id=Hyq4yhile
  13. “Recurrent World Models Facilitate Policy Evolution” In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, 3-8 December 2018, Montréal, Canada., 2018, pp. 2455–2467 URL: http://papers.nips.cc/paper/7512-recurrent-world-models-facilitate-policy-evolution
  14. He He, Hal Daumé III and Jason Eisner “Imitation Learning by Coaching” In Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012, Lake Tahoe, Nevada, United States., 2012, pp. 3158–3166 URL: http://papers.nips.cc/paper/4545-imitation-learning-by-coaching
  15. “Deep Q-learning From Demonstrations” In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI-18), the 30th innovative Applications of Artificial Intelligence (IAAI-18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI-18), New Orleans, Louisiana, USA, February 2-7, 2018, 2018, pp. 3223–3230 URL: https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16976
  16. “Learning from Demonstrations for Real World Reinforcement Learning” In CoRR abs/1704.03732, 2017 arXiv: http://arxiv.org/abs/1704.03732
  17. “Evolved Policy Gradients” In CoRR abs/1802.04821, 2018 arXiv: http://arxiv.org/abs/1802.04821
  18. Bingyi Kang, Zequn Jie and Jiashi Feng “Policy Optimization with Demonstrations” In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018, 2018, pp. 2474–2483 URL: http://proceedings.mlr.press/v80/kang18a.html
  19. “DART: Noise Injection for Robust Imitation Learning” In 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, California, USA, November 13-15, 2017, Proceedings, 2017, pp. 143–156 URL: http://proceedings.mlr.press/v78/laskey17a.html
  20. “Hierarchical Imitation and Reinforcement Learning” In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018, 2018, pp. 2923–2932 URL: http://proceedings.mlr.press/v80/le18a.html
  21. Kiran Kumar Lekkala and Vinay Kumar Mittal “Accurate and augmented navigation for quadcopter based on multi-sensor fusion” In 2016 IEEE Annual India Conference (INDICON), 2016, pp. 1–6 IEEE
  22. Kiran Kumar Lekkala and Vinay Kumar Mittal “Artificial intelligence for precision movement robot” In 2015 2nd International Conference on Signal Processing and Integrated Networks (SPIN), 2015, pp. 378–383 IEEE
  23. Kiran Kumar Lekkala and Vinay Kumar Mittal “PID controlled 2D precision robot” In 2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT), 2014, pp. 1141–1145 IEEE
  24. “An Algorithmic Perspective on Imitation Learning” In Foundations and Trends in Robotics 7.1-2, 2018, pp. 1–179 DOI: 10.1561/2300000053
  25. “Agile Autonomous Driving using End-to-End Deep Imitation Learning” In Robotics: Science and Systems XIV, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA, June 26-30, 2018, 2018 DOI: 10.15607/RSS.2018.XIV.056
  26. “Curiosity-driven Exploration by Self-supervised Prediction” In International Conference on Machine Learning (ICML), 2017
  27. “Dataset Shift in Machine Learning” The MIT Press, 2009
  28. “Learning to Reweight Examples for Robust Deep Learning” In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018, 2018, pp. 4331–4340 URL: http://proceedings.mlr.press/v80/ren18a.html
  29. Stéphane Ross, Geoffrey J. Gordon and Drew Bagnell “A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning” In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2011, Fort Lauderdale, USA, April 11-13, 2011, 2011, pp. 627–635 URL: http://proceedings.mlr.press/v15/ross11a/ross11a.pdf
  30. “Meta learning Framework for Automated Driving” In CoRR abs/1706.04038, 2017 arXiv: http://arxiv.org/abs/1706.04038
  31. Jürgen Schmidhuber “Formal Theory of Creativity, Fun, and Intrinsic Motivation (1990-2010)” In IEEE Trans. Autonomous Mental Development 2.3, 2010, pp. 230–247 DOI: 10.1109/TAMD.2010.2056368
  32. Jürgen Schmidhuber “Reinforcement Learning with Interacting Continually Running Fully Recurrent Networks” In International Neural Network Conference: July 9–13, 1990 Palais Des Congres — Paris — France Dordrecht: Springer Netherlands, 1990, pp. 817–820 DOI: 10.1007/978-94-009-0643-3˙97
  33. “Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction” In Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, NSW, Australia, 6-11 August 2017, 2017, pp. 3309–3318 URL: http://proceedings.mlr.press/v70/sun17d.html
  34. “Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning” In CoRR abs/1812.00971, 2018 arXiv: http://arxiv.org/abs/1812.00971
  35. “Shared Multi-Task Imitation Learning for Indoor Self-Navigation” In IEEE Global Communications Conference, GLOBECOM 2018, Abu Dhabi, United Arab Emirates, December 9-13, 2018, 2018, pp. 1–7 DOI: 10.1109/GLOCOM.2018.8647614
  36. “Ferroelectric fet based context-switching fpga enabling dynamic reconfiguration for adaptive deep learning machines” In arXiv preprint arXiv:2212.00089, 2022
  37. “One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks” In CoRR abs/1810.11043, 2018 arXiv: http://arxiv.org/abs/1810.11043
Citations (2)

Summary

We haven't generated a summary for this paper yet.