Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Mixing Human Demonstrations with Self-Exploration in Experience Replay for Deep Reinforcement Learning (2107.06840v1)

Published 14 Jul 2021 in cs.AI

Abstract: We investigate the effect of using human demonstration data in the replay buffer for Deep Reinforcement Learning. We use a policy gradient method with a modified experience replay buffer where a human demonstration experience is sampled with a given probability. We analyze different ratios of using demonstration data in a task where an agent attempts to reach a goal while avoiding obstacles. Our results suggest that while the agents trained by pure self-exploration and pure demonstration had similar success rates, the pure demonstration model converged faster to solutions with less number of steps.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Dylan Klein (1 paper)
  2. Akansel Cosgun (59 papers)

Summary

We haven't generated a summary for this paper yet.