Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Common Challenges of Deep Reinforcement Learning Applications Development: An Empirical Study (2310.09575v3)

Published 14 Oct 2023 in cs.SE

Abstract: Machine Learning (ML) is increasingly being adopted in different industries. Deep Reinforcement Learning (DRL) is a subdomain of ML used to produce intelligent agents. Despite recent developments in DRL technology, the main challenges that developers face in the development of DRL applications are still unknown. To fill this gap, in this paper, we conduct a large-scale empirical study of 927 DRL-related posts extracted from Stack Overflow, the most popular Q&A platform in the software community. Through the process of labeling and categorizing extracted posts, we created a taxonomy of common challenges encountered in the development of DRL applications, along with their corresponding popularity levels. This taxonomy has been validated through a survey involving 65 DRL developers. Results show that at least 45% of developers experienced 18 of the 21 challenges identified in the taxonomy. The most frequent source of difficulty during the development of DRL applications are Comprehension, API usage, and Design problems, while Parallel processing, and DRL libraries/frameworks are classified as the most difficult challenges to address, with respect to the time required to receive an accepted answer. We hope that the research community will leverage this taxonomy to develop efficient strategies to address the identified challenges and improve the quality of DRL applications.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (75)
  1. In: 2019 acm/ieee international symposium on empirical software engineering and measurement (esem), pp. 1–11. IEEE (2019)
  2. Aradi, S.: Survey of deep reinforcement learning for motion planning of autonomous vehicles. IEEE Transactions on Intelligent Transportation Systems 23(2), 740–759 (2020)
  3. In: 2018 44th euromicro conference on software engineering and advanced applications (SEAA), pp. 50–59. IEEE (2018)
  4. IEEE Signal Processing Magazine 34(6), 26–38 (2017)
  5. arXiv preprint arXiv:1905.08833 (2019)
  6. IEEE Transactions on Software Engineering 48(8), 2836–2856 (2021)
  7. In: 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR), pp. 260–264. IEEE (2019)
  8. Empirical Software Engineering 25, 2258–2301 (2020)
  9. arXiv preprint arXiv:1606.01540 (2016)
  10. IEEE Journal of Selected Topics in Signal Processing 9(4), 612–624 (2015)
  11. IEEE Transactions on Software Engineering (2022)
  12. arXiv preprint arXiv:1812.06110 (2018)
  13. In: Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2020, p. 750–762. Association for Computing Machinery, New York, NY, USA (2020). DOI 10.1145/3368089.3409759. URL https://doi.org/10.1145/3368089.3409759
  14. Empirical Software Engineering 24, 381–416 (2019)
  15. Artificial Intelligence Review 54, 3215–3238 (2021)
  16. Machine Learning 110(9), 2419–2468 (2021)
  17. Frost, J.: Introduction to Statistics: An Intuitive Guide for Analyzing Data and (2019)
  18. In: Advanced Informatics for Computing Research: Second International Conference, ICAICR 2018, Shimla, India, July 14–15, 2018, Revised Selected Papers, Part I 2, pp. 160–170. Springer (2019)
  19. Gibbons, J.D.: Nonparametric measures of association. 91. Sage (1993)
  20. Google: How to use google sheets. https://support.google.com/docs/answer/6000292?hl=en&co=GENIE.Platform%3DDesktop (2020). Accessed: 2023-02-01
  21. In: 2021 IEEE 21st International Working Conference on Source Code Analysis and Manipulation (SCAM), pp. 58–69. IEEE (2021)
  22. In: 2020 IEEE International Conference on Software Maintenance and Evolution (ICSME), pp. 868–878. IEEE (2020)
  23. In: Proceedings of the 14th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM), pp. 1–11 (2020)
  24. arXiv preprint arXiv:2303.06853 (2023)
  25. In: Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security, pp. 380–394 (2018)
  26. arXiv preprint arXiv:2207.10347 (2022)
  27. In: Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering, pp. 1110–1121 (2020)
  28. In: Proceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, pp. 510–520 (2019)
  29. In: Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering, ICSE ’20, p. 1135–1146. Association for Computing Machinery, New York, NY, USA (2020). DOI 10.1145/3377811.3380378. URL https://doi.org/10.1145/3377811.3380378
  30. Multimedia Tools and Applications 78, 15169–15211 (2019)
  31. IEEE Transactions on Intelligent Transportation Systems 23(6), 4909–4926 (2021)
  32. Web page (2017). URL https://github.com/tensorforce/tensorforce
  33. Li, Y.: Deep reinforcement learning: An overview. arXiv preprint arXiv:1701.07274 (2017)
  34. In: International Conference on Machine Learning, pp. 3053–3062. PMLR (2018)
  35. Lorenz, U.: Reinforcement Learning From Scratch: Understanding Current Approaches - with Examples in Java and Greenfoot. Springer International Publishing (2022). URL https://books.google.ca/books?id=bV2YEAAAQBAJ
  36. Pearson (2017)
  37. In: Agile Processes in Software Engineering and Extreme Programming: 20th International Conference, XP 2019, Montréal, QC, Canada, May 21–25, 2019, Proceedings 20, pp. 227–243. Springer International Publishing (2019)
  38. Proceedings of the ACM on human-computer interaction 3(CSCW), 1–23 (2019)
  39. nature 518(7540), 529–533 (2015)
  40. arXiv preprint arXiv:2301.10516 (2023)
  41. In: 13th USENIX symposium on operating systems design and implementation (OSDI 18), pp. 561–577 (2018)
  42. Empirical Software Engineering 28(3), 62 (2023)
  43. Empirical Software Engineering 29(1), 14 (2024)
  44. https://github.com/drlchallenges/drl_challenges (2023). Accessed: 2023-02-01
  45. ACM Sigplan Notices 45(10), 302–321 (2010)
  46. IEEE transactions on cybernetics 50(9), 3826–3839 (2020)
  47. ACM Transactions on Software Engineering and Methodology (TOSEM) 31(1), 1–27 (2021)
  48. Automated Software Engineering 29(1), 1–32 (2022)
  49. In: Proceedings of the 26th International Conference on Evaluation and Assessment in Software Engineering, pp. 190–200 (2022)
  50. In: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 7512–7519. IEEE (2021)
  51. International Journal of Production Research 60(13), 4316–4341 (2022)
  52. Empirical Software Engineering 27(1), 11 (2022)
  53. Plappert, M.: keras-rl. https://github.com/keras-rl/keras-rl (2016)
  54. O’Reilly Media (2023). URL https://books.google.ca/books?id=vKj0zgEACAAJ
  55. Qualtrics: Qualtrics official website. https://www.qualtrics.com/ (2023). Accessed: 2023-04-01
  56. In: Proceedings of the 1st international workshop on software engineering for AI in autonomous systems, pp. 35–38 (2018)
  57. In: Proceedings of the 13th ACM SIGGRAPH Conference on Motion, Interaction and Games, pp. 1–10 (2020)
  58. CoRR abs/1808.07903 (2018). URL http://arxiv.org/abs/1808.07903
  59. In: Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, pp. 1–16 (2021)
  60. arXiv preprint arXiv:2401.03069 (2024)
  61. In: 2019 acm/ieee international symposium on empirical software engineering and measurement (esem), pp. 1–6. IEEE (2019)
  62. StackExchange: Stack exchange data dump. https://archive.org/details/stackexchange (2022). Accessed: 2023-02-01
  63. In: 2023 IEEE Conference on Software Testing, Verification and Validation (ICST), pp. 188–198. IEEE (2023)
  64. Star East 2003, 1–40 (2003)
  65. In: Proceedings of the 33rd ACM/IEEE international conference on automated software engineering, pp. 397–407 (2018)
  66. IEEE Communications Surveys & Tutorials 23(4), 2064–2097 (2021)
  67. In: Proceedings of the 31st ACM SIGSOFT International Symposium on Software Testing and Analysis, pp. 176–188 (2022)
  68. arXiv preprint arXiv:2308.12438 (2023)
  69. ACM Computing Surveys (CSUR) 55(1), 1–36 (2021)
  70. In: Proceedings of the evaluation and assessment in software engineering, pp. 41–50 (2020)
  71. Springer (2020)
  72. In: 2019 IEEE 30th International Symposium on Software Reliability Engineering (ISSRE), pp. 104–115. IEEE (2019)
  73. In: 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE), pp. 359–371. IEEE (2021)
  74. Empirical Software Engineering 27(6), 1–25 (2022)
  75. arXiv preprint arXiv:2206.07813 (2022)
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Mohammad Mehdi Morovati (7 papers)
  2. Florian Tambon (13 papers)
  3. Mina Taraghi (2 papers)
  4. Amin Nikanjam (39 papers)
  5. Foutse Khomh (140 papers)

Summary

We haven't generated a summary for this paper yet.