Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code
Abstract: In this work we systematically review the recent advancements in software engineering with LLMs, covering 70+ models, 40+ evaluation tasks, 180+ datasets, and 900 related works. Unlike previous works, we integrate software engineering (SE) with NLP by discussing the perspectives of both sides: SE applies LLMs for development automation, while NLP adopts SE tasks for LLM evaluation. We break down code processing models into general LLMs represented by the GPT family and specialized models that are specifically pretrained on code, often with tailored objectives. We discuss the relations and differences between these models, and highlight the historical transition of code modeling from statistical models and RNNs to pretrained Transformers and LLMs, which is exactly the same course that had been taken by NLP. We also go beyond programming and review LLMs' application in other software engineering activities including requirement engineering, testing, deployment, and operations in an endeavor to provide a global view of NLP in SE, and identify key challenges and potential future directions in this domain. We keep the survey open and updated on GitHub at https://github.com/codefuse-ai/Awesome-Code-LLM.
- Large-scale and language-oblivious code authorship identification. In David Lie, Mohammad Mannan, Michael Backes, and XiaoFeng Wang (eds.), Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security, CCS 2018, Toronto, ON, Canada, October 15-19, 2018, pp. 101–114. ACM, 2018. doi: 10.1145/3243734.3243738. URL https://doi.org/10.1145/3243734.3243738.
- Juice: A large scale distantly supervised dataset for open domain context-based code generation. In Kentaro Inui, Jing Jiang, Vincent Ng, and Xiaojun Wan (eds.), Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019, Hong Kong, China, November 3-7, 2019, pp. 5435–5445. Association for Computational Linguistics, 2019. doi: 10.18653/V1/D19-1546. URL https://doi.org/10.18653/v1/D19-1546.
- Guiding language models of code with global context using monitors. CoRR, abs/2306.10763, 2023. doi: 10.48550/ARXIV.2306.10763. URL https://doi.org/10.48550/arXiv.2306.10763.
- A transformer-based approach for source code summarization. In Dan Jurafsky, Joyce Chai, Natalie Schluter, and Joel R. Tetreault (eds.), Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, July 5-10, 2020, pp. 4998–5007. Association for Computational Linguistics, 2020. doi: 10.18653/v1/2020.acl-main.449. URL https://doi.org/10.18653/v1/2020.acl-main.449.
- Unified pre-training for program understanding and generation. In Kristina Toutanova, Anna Rumshisky, Luke Zettlemoyer, Dilek Hakkani-Tür, Iz Beltagy, Steven Bethard, Ryan Cotterell, Tanmoy Chakraborty, and Yichao Zhou (eds.), Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2021, Online, June 6-11, 2021, pp. 2655–2668. Association for Computational Linguistics, 2021. doi: 10.18653/v1/2021.naacl-main.211. URL https://doi.org/10.18653/v1/2021.naacl-main.211.
- AVATAR: A parallel corpus for java-python program translation. In Anna Rogers, Jordan L. Boyd-Graber, and Naoaki Okazaki (eds.), Findings of the Association for Computational Linguistics: ACL 2023, Toronto, Canada, July 9-14, 2023, pp. 2268–2281. Association for Computational Linguistics, 2023. doi: 10.18653/V1/2023.FINDINGS-ACL.143. URL https://doi.org/10.18653/v1/2023.findings-acl.143.
- GQA: training generalized multi-query transformer models from multi-head checkpoints. CoRR, abs/2305.13245, 2023. doi: 10.48550/ARXIV.2305.13245. URL https://doi.org/10.48550/arXiv.2305.13245.
- A3test: Assertion-augmented automated test case generation. CoRR, abs/2302.10352, 2023. doi: 10.48550/ARXIV.2302.10352. URL https://doi.org/10.48550/arXiv.2302.10352.
- Santacoder: don’t reach for the stars! CoRR, abs/2301.03988, 2023. doi: 10.48550/arXiv.2301.03988. URL https://doi.org/10.48550/arXiv.2301.03988.
- Mining source code repositories at massive scale using language modeling. In Thomas Zimmermann, Massimiliano Di Penta, and Sunghun Kim (eds.), Proceedings of the 10th Working Conference on Mining Software Repositories, MSR ’13, San Francisco, CA, USA, May 18-19, 2013, pp. 207–216. IEEE Computer Society, 2013. doi: 10.1109/MSR.2013.6624029. URL https://doi.org/10.1109/MSR.2013.6624029.
- Mining idioms from source code. In Shing-Chi Cheung, Alessandro Orso, and Margaret-Anne D. Storey (eds.), Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, (FSE-22), Hong Kong, China, November 16 - 22, 2014, pp. 472–483. ACM, 2014. doi: 10.1145/2635868.2635901. URL https://doi.org/10.1145/2635868.2635901.
- Learning natural coding conventions. In Shing-Chi Cheung, Alessandro Orso, and Margaret-Anne D. Storey (eds.), Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, (FSE-22), Hong Kong, China, November 16 - 22, 2014, pp. 281–293. ACM, 2014. doi: 10.1145/2635868.2635883. URL https://doi.org/10.1145/2635868.2635883.
- Suggesting accurate method and class names. In Elisabetta Di Nitto, Mark Harman, and Patrick Heymans (eds.), Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering, ESEC/FSE 2015, Bergamo, Italy, August 30 - September 4, 2015, pp. 38–49. ACM, 2015. doi: 10.1145/2786805.2786849. URL https://doi.org/10.1145/2786805.2786849.
- Tailored mutants fit bugs better. CoRR, abs/1611.02516, 2016a. URL http://arxiv.org/abs/1611.02516.
- A convolutional attention network for extreme summarization of source code. In Maria-Florina Balcan and Kilian Q. Weinberger (eds.), Proceedings of the 33nd International Conference on Machine Learning, ICML 2016, New York City, NY, USA, June 19-24, 2016, volume 48 of JMLR Workshop and Conference Proceedings, pp. 2091–2100. JMLR.org, 2016b. URL http://proceedings.mlr.press/v48/allamanis16.html.
- Learning to represent programs with graphs. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. OpenReview.net, 2018. URL https://openreview.net/forum?id=BJOFETxR-.
- Typilus: neural type hints. In Alastair F. Donaldson and Emina Torlak (eds.), Proceedings of the 41st ACM SIGPLAN International Conference on Programming Language Design and Implementation, PLDI 2020, London, UK, June 15-20, 2020, pp. 91–105. ACM, 2020. doi: 10.1145/3385412.3385997. URL https://doi.org/10.1145/3385412.3385997.
- Self-supervised bug detection and repair. In Marc’Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, and Jennifer Wortman Vaughan (eds.), Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual, pp. 27865–27876, 2021. URL https://proceedings.neurips.cc/paper/2021/hash/ea96efc03b9a050d895110db8c4af057-Abstract.html.
- A general path-based representation for predicting program properties. In Jeffrey S. Foster and Dan Grossman (eds.), Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2018, Philadelphia, PA, USA, June 18-22, 2018, pp. 404–419. ACM, 2018. doi: 10.1145/3192366.3192412. URL https://doi.org/10.1145/3192366.3192412.
- code2seq: Generating sequences from structured representations of code. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net, 2019a. URL https://openreview.net/forum?id=H1gKYo09tX.
- code2vec: learning distributed representations of code. Proc. ACM Program. Lang., 3(POPL):40:1–40:29, 2019b. doi: 10.1145/3290353. URL https://doi.org/10.1145/3290353.
- Structural language models of code. In Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event, volume 119 of Proceedings of Machine Learning Research, pp. 245–256. PMLR, 2020. URL http://proceedings.mlr.press/v119/alon20a.html.
- Learning to recommend third-party library migration opportunities at the API level. Appl. Soft Comput., 90:106140, 2020. doi: 10.1016/J.ASOC.2020.106140. URL https://doi.org/10.1016/j.asoc.2020.106140.
- Mathqa: Towards interpretable math word problem solving with operation-based formalisms. In Jill Burstein, Christy Doran, and Thamar Solorio (eds.), Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2-7, 2019, Volume 1 (Long and Short Papers), pp. 2357–2367. Association for Computational Linguistics, 2019. doi: 10.18653/v1/n19-1245. URL https://doi.org/10.18653/v1/n19-1245.
- Palm 2 technical report. CoRR, abs/2305.10403, 2023. doi: 10.48550/arXiv.2305.10403. URL https://doi.org/10.48550/arXiv.2305.10403.
- T5QL: taming language models for SQL generation. CoRR, abs/2209.10254, 2022. doi: 10.48550/ARXIV.2209.10254. URL https://doi.org/10.48550/arXiv.2209.10254.
- Slade: A portable small language model decompiler for optimized assembler. CoRR, abs/2305.12520, 2023. doi: 10.48550/ARXIV.2305.12520. URL https://doi.org/10.48550/arXiv.2305.12520.
- Nessie: Automatically testing javascript apis with asynchronous callbacks. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 1494–1505. ACM, 2022. doi: 10.1145/3510003.3510106. URL https://doi.org/10.1145/3510003.3510106.
- In nomine function: Naming functions in stripped binaries with neural networks, 2021.
- Multi-lingual evaluation of code generation models. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. URL https://openreview.net/pdf?id=Bo7eeXm6An8.
- Program synthesis with large language models. CoRR, abs/2108.07732, 2021. URL https://arxiv.org/abs/2108.07732.
- Layer normalization. CoRR, abs/1607.06450, 2016. URL http://arxiv.org/abs/1607.06450.
- Studenteval: A benchmark of student-written prompts for large language models of code. CoRR, abs/2306.04556, 2023. doi: 10.48550/ARXIV.2306.04556. URL https://doi.org/10.48550/arXiv.2306.04556.
- Neural machine translation by jointly learning to align and translate. In Yoshua Bengio and Yann LeCun (eds.), 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, 2015. URL http://arxiv.org/abs/1409.0473.
- An actor-critic algorithm for sequence prediction. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings. OpenReview.net, 2017. URL https://openreview.net/forum?id=SJDaqqveg.
- Qwen technical report. CoRR, abs/2309.16609, 2023. doi: 10.48550/arXiv.2309.16609. URL https://doi.org/10.48550/arXiv.2309.16609.
- Training a helpful and harmless assistant with reinforcement learning from human feedback. CoRR, abs/2204.05862, 2022. doi: 10.48550/ARXIV.2204.05862. URL https://doi.org/10.48550/arXiv.2204.05862.
- Codeplan: Repository-level coding using llms and planning. CoRR, abs/2309.12499, 2023. doi: 10.48550/ARXIV.2309.12499. URL https://doi.org/10.48550/arXiv.2309.12499.
- Vipin Balachandran. Reducing human effort and improving quality in peer code reviews using automatic static analysis and reviewer recommendation. In David Notkin, Betty H. C. Cheng, and Klaus Pohl (eds.), 35th International Conference on Software Engineering, ICSE ’13, San Francisco, CA, USA, May 18-26, 2013, pp. 931–940. IEEE Computer Society, 2013. doi: 10.1109/ICSE.2013.6606642. URL https://doi.org/10.1109/ICSE.2013.6606642.
- Deepcoder: Learning to write programs. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings. OpenReview.net, 2017. URL https://openreview.net/forum?id=ByldLrqlx.
- Deep neural networks for source code author identification. In Minho Lee, Akira Hirose, Zeng-Guang Hou, and Rhee Man Kil (eds.), Neural Information Processing - 20th International Conference, ICONIP 2013, Daegu, Korea, November 3-7, 2013. Proceedings, Part II, volume 8227 of Lecture Notes in Computer Science, pp. 368–375. Springer, 2013. doi: 10.1007/978-3-642-42042-9_46. URL https://doi.org/10.1007/978-3-642-42042-9_46.
- Variable name recovery in decompiled binary code using constrained masked language modeling. CoRR, abs/2103.12801, 2021. URL https://arxiv.org/abs/2103.12801.
- Project-level encoding for neural source code summarization of subroutines. In 29th IEEE/ACM International Conference on Program Comprehension, ICPC 2021, Madrid, Spain, May 20-21, 2021, pp. 253–264. IEEE, 2021. doi: 10.1109/ICPC52881.2021.00032. URL https://doi.org/10.1109/ICPC52881.2021.00032.
- Code generation tools (almost) for free? A study of few-shot, pre-trained language models on code. CoRR, abs/2206.01335, 2022. doi: 10.48550/arXiv.2206.01335. URL https://doi.org/10.48550/arXiv.2206.01335.
- Helping developers help themselves: Automatic decomposition of code review changesets. In Antonia Bertolino, Gerardo Canfora, and Sebastian G. Elbaum (eds.), 37th IEEE/ACM International Conference on Software Engineering, ICSE 2015, Florence, Italy, May 16-24, 2015, Volume 1, pp. 134–144. IEEE Computer Society, 2015. doi: 10.1109/ICSE.2015.35. URL https://doi.org/10.1109/ICSE.2015.35.
- Antonio Valerio Miceli Barone and Rico Sennrich. A parallel corpus of python functions and documentation strings for automated code documentation and code generation. In Greg Kondrak and Taro Watanabe (eds.), Proceedings of the Eighth International Joint Conference on Natural Language Processing, IJCNLP 2017, Taipei, Taiwan, November 27 - December 1, 2017, Volume 2: Short Papers, pp. 314–319. Asian Federation of Natural Language Processing, 2017. URL https://aclanthology.org/I17-2053/.
- Property-based mutation testing. In IEEE Conference on Software Testing, Verification and Validation, ICST 2023, Dublin, Ireland, April 16-20, 2023, pp. 222–233. IEEE, 2023. doi: 10.1109/ICST57152.2023.00029. URL https://doi.org/10.1109/ICST57152.2023.00029.
- Efficient training of language models to fill in the middle. CoRR, abs/2207.14255, 2022. doi: 10.48550/arXiv.2207.14255. URL https://doi.org/10.48550/arXiv.2207.14255.
- Autopandas: neural-backed generators for program synthesis. Proc. ACM Program. Lang., 3(OOPSLA):168:1–168:27, 2019. doi: 10.1145/3360594. URL https://doi.org/10.1145/3360594.
- Neural code comprehension: A learnable representation of code semantics. In Samy Bengio, Hanna M. Wallach, Hugo Larochelle, Kristen Grauman, Nicolò Cesa-Bianchi, and Roman Garnett (eds.), Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, December 3-8, 2018, Montréal, Canada, pp. 3589–3601, 2018. URL https://proceedings.neurips.cc/paper/2018/hash/17c3433fecc21b57000debdf7ad5c930-Abstract.html.
- Tfix: Learning to fix coding errors with a text-to-text transformer. In Marina Meila and Tong Zhang (eds.), Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event, volume 139 of Proceedings of Machine Learning Research, pp. 780–791. PMLR, 2021. URL http://proceedings.mlr.press/v139/berabi21a.html.
- Cvefixes: automated collection of vulnerabilities and their fixes from open-source software. In Shane McIntosh, Xin Xia, and Sousuke Amasaki (eds.), PROMISE ’21: 17th International Conference on Predictive Models and Data Analytics in Software Engineering, Athens Greece, August 19-20, 2021, pp. 30–39. ACM, 2021. doi: 10.1145/3475960.3475985. URL https://doi.org/10.1145/3475960.3475985.
- Neuro-symbolic program corrector for introductory programming assignments. In Michel Chaudron, Ivica Crnkovic, Marsha Chechik, and Mark Harman (eds.), Proceedings of the 40th International Conference on Software Engineering, ICSE 2018, Gothenburg, Sweden, May 27 - June 03, 2018, pp. 60–70. ACM, 2018. doi: 10.1145/3180155.3180219. URL https://doi.org/10.1145/3180155.3180219.
- Benchmarking software vulnerability detection techniques: A survey. CoRR, abs/2303.16362, 2023. doi: 10.48550/ARXIV.2303.16362. URL https://doi.org/10.48550/arXiv.2303.16362.
- Nar-miner: discovering negative association rules from code for bug detection. In Gary T. Leavens, Alessandro Garcia, and Corina S. Pasareanu (eds.), Proceedings of the 2018 ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2018, Lake Buena Vista, FL, USA, November 04-09, 2018, pp. 411–422. ACM, 2018. doi: 10.1145/3236024.3236032. URL https://doi.org/10.1145/3236024.3236032.
- Statistical deobfuscation of android applications. In Edgar R. Weippl, Stefan Katzenbeisser, Christopher Kruegel, Andrew C. Myers, and Shai Halevi (eds.), Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, Vienna, Austria, October 24-28, 2016, pp. 343–355. ACM, 2016. doi: 10.1145/2976749.2978422. URL https://doi.org/10.1145/2976749.2978422.
- Datasheet for the pile. CoRR, abs/2201.07311, 2022. URL https://arxiv.org/abs/2201.07311.
- PHOG: probabilistic model for code. In Maria-Florina Balcan and Kilian Q. Weinberger (eds.), Proceedings of the 33nd International Conference on Machine Learning, ICML 2016, New York City, NY, USA, June 19-24, 2016, volume 48 of JMLR Workshop and Conference Proceedings, pp. 2933–2942. JMLR.org, 2016. URL http://proceedings.mlr.press/v48/bielik16.html.
- Gpt-neox-20b: An open-source autoregressive language model. CoRR, abs/2204.06745, 2022. doi: 10.48550/arXiv.2204.06745. URL https://doi.org/10.48550/arXiv.2204.06745.
- Memo: Automatically identifying metamorphic relations in javadoc comments for test automation. J. Syst. Softw., 181:111041, 2021. doi: 10.1016/j.jss.2021.111041. URL https://doi.org/10.1016/j.jss.2021.111041.
- Representing schema structure with graph neural networks for text-to-sql parsing. In Anna Korhonen, David R. Traum, and Lluís Màrquez (eds.), Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, Florence, Italy, July 28- August 2, 2019, Volume 1: Long Papers, pp. 4560–4565. Association for Computational Linguistics, 2019. doi: 10.18653/v1/p19-1448. URL https://doi.org/10.18653/v1/p19-1448.
- Directed greybox fuzzing. In Bhavani Thuraisingham, David Evans, Tal Malkin, and Dongyan Xu (eds.), Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, CCS 2017, Dallas, TX, USA, October 30 - November 03, 2017, pp. 2329–2344. ACM, 2017. doi: 10.1145/3133956.3134020. URL https://doi.org/10.1145/3133956.3134020.
- Coverage-based greybox fuzzing as markov chain. IEEE Trans. Software Eng., 45(5):489–506, 2019. doi: 10.1109/TSE.2017.2785841. URL https://doi.org/10.1109/TSE.2017.2785841.
- Dltpy: Deep learning type inference of python function signatures using natural language context. CoRR, abs/1912.00680, 2019. URL http://arxiv.org/abs/1912.00680.
- The care and feeding of wild-caught mutants. In Eric Bodden, Wilhelm Schäfer, Arie van Deursen, and Andrea Zisman (eds.), Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering, ESEC/FSE 2017, Paderborn, Germany, September 4-8, 2017, pp. 511–522. ACM, 2017. doi: 10.1145/3106237.3106280. URL https://doi.org/10.1145/3106237.3106280.
- Language models are few-shot learners. In Hugo Larochelle, Marc’Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin (eds.), Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, 2020. URL https://proceedings.neurips.cc/paper/2020/hash/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html.
- Learning from examples to improve code completion systems. In Hans van Vliet and Valérie Issarny (eds.), Proceedings of the 7th joint meeting of the European Software Engineering Conference and the ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2009, Amsterdam, The Netherlands, August 24-28, 2009, pp. 213–222. ACM, 2009. doi: 10.1145/1595696.1595728. URL https://doi.org/10.1145/1595696.1595728.
- Sparks of artificial general intelligence: Early experiments with GPT-4. CoRR, abs/2303.12712, 2023. doi: 10.48550/arXiv.2303.12712. URL https://doi.org/10.48550/arXiv.2303.12712.
- Detect-localize-repair: A unified framework for learning to debug with codet5. In Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (eds.), Findings of the Association for Computational Linguistics: EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11, 2022, pp. 812–823. Association for Computational Linguistics, 2022. doi: 10.18653/V1/2022.FINDINGS-EMNLP.57. URL https://doi.org/10.18653/v1/2022.findings-emnlp.57.
- SAR: learning cross-language API mappings with little knowledge. In Marlon Dumas, Dietmar Pfahl, Sven Apel, and Alessandra Russo (eds.), Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2019, Tallinn, Estonia, August 26-30, 2019, pp. 796–806. ACM, 2019. doi: 10.1145/3338906.3338924. URL https://doi.org/10.1145/3338906.3338924.
- Infercode: Self-supervised learning of code representations by predicting subtrees. In 43rd IEEE/ACM International Conference on Software Engineering, ICSE 2021, Madrid, Spain, 22-30 May 2021, pp. 1186–1197. IEEE, 2021a. doi: 10.1109/ICSE43902.2021.00109. URL https://doi.org/10.1109/ICSE43902.2021.00109.
- Self-supervised contrastive learning for code retrieval and summarization via semantic-preserving transformations. In Fernando Diaz, Chirag Shah, Torsten Suel, Pablo Castells, Rosie Jones, and Tetsuya Sakai (eds.), SIGIR ’21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Virtual Event, Canada, July 11-15, 2021, pp. 511–521. ACM, 2021b. doi: 10.1145/3404835.3462840. URL https://doi.org/10.1145/3404835.3462840.
- Leveraging grammar and reinforcement learning for neural program synthesis. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. OpenReview.net, 2018. URL https://openreview.net/forum?id=H1Xw62kRZ.
- When deep learning met code search. In Marlon Dumas, Dietmar Pfahl, Sven Apel, and Alessandra Russo (eds.), Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2019, Tallinn, Estonia, August 26-30, 2019, pp. 964–974. ACM, 2019. doi: 10.1145/3338906.3340458. URL https://doi.org/10.1145/3338906.3340458.
- A study on prompt design, advantages and limitations of chatgpt for deep learning program repair. CoRR, abs/2304.08191, 2023. doi: 10.48550/ARXIV.2304.08191. URL https://doi.org/10.48550/arXiv.2304.08191.
- LGESQL: line graph enhanced text-to-sql model with mixed local and non-local relations. In Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (eds.), Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL/IJCNLP 2021, (Volume 1: Long Papers), Virtual Event, August 1-6, 2021, pp. 2541–2555. Association for Computational Linguistics, 2021. doi: 10.18653/v1/2021.acl-long.198. URL https://doi.org/10.18653/v1/2021.acl-long.198.
- Multipl-e: a scalable and polyglot approach to benchmarking neural code generation. IEEE Transactions on Software Engineering, 2023a.
- Type prediction with program decomposition and fill-in-the-type training. CoRR, abs/2305.17145, 2023b. doi: 10.48550/ARXIV.2305.17145. URL https://doi.org/10.48550/arXiv.2305.17145.
- Program-adaptive mutational fuzzing. In 2015 IEEE Symposium on Security and Privacy, SP 2015, San Jose, CA, USA, May 17-21, 2015, pp. 725–741. IEEE Computer Society, 2015. doi: 10.1109/SP.2015.50. URL https://doi.org/10.1109/SP.2015.50.
- Ernie-code: Beyond english-centric cross-lingual pretraining for programming languages. In Anna Rogers, Jordan L. Boyd-Graber, and Naoaki Okazaki (eds.), Findings of the Association for Computational Linguistics: ACL 2023, Toronto, Canada, July 9-14, 2023, pp. 10628–10650. Association for Computational Linguistics, 2023. doi: 10.18653/V1/2023.FINDINGS-ACL.676. URL https://doi.org/10.18653/v1/2023.findings-acl.676.
- Cross-domain deep code search with meta learning. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 487–498. ACM, 2022. doi: 10.1145/3510003.3510125. URL https://doi.org/10.1145/3510003.3510125.
- On multi-modal learning of editing source code. In 36th IEEE/ACM International Conference on Automated Software Engineering, ASE 2021, Melbourne, Australia, November 15-19, 2021, pp. 443–455. IEEE, 2021. doi: 10.1109/ASE51524.2021.9678559. URL https://doi.org/10.1109/ASE51524.2021.9678559.
- Natgen: generative pre-training by "naturalizing" source code. In Abhik Roychoudhury, Cristian Cadar, and Miryung Kim (eds.), Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, Singapore, Singapore, November 14-18, 2022, pp. 18–30. ACM, 2022a. doi: 10.1145/3540250.3549162. URL https://doi.org/10.1145/3540250.3549162.
- CODIT: code editing with tree-based neural models. IEEE Trans. Software Eng., 48(4):1385–1399, 2022b. doi: 10.1109/TSE.2020.3020502. URL https://doi.org/10.1109/TSE.2020.3020502.
- Deep learning based vulnerability detection: Are we there yet? IEEE Trans. Software Eng., 48(9):3280–3296, 2022c. doi: 10.1109/TSE.2021.3087402. URL https://doi.org/10.1109/TSE.2021.3087402.
- Transformer-based vulnerability detection in code at edittime: Zero-shot, few-shot, or fine-tuning? CoRR, abs/2306.01754, 2023. doi: 10.48550/ARXIV.2306.01754. URL https://doi.org/10.48550/arXiv.2306.01754.
- Training and evaluating a jupyter notebook data science assistant. CoRR, abs/2201.12901, 2022. URL https://arxiv.org/abs/2201.12901.
- How to prompt llms for text-to-sql: A study in zero-shot, single-domain, and cross-domain settings. CoRR, abs/2305.11853, 2023. doi: 10.48550/ARXIV.2305.11853. URL https://doi.org/10.48550/arXiv.2305.11853.
- Codet: Code generation with generated tests. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023a. URL https://openreview.net/pdf?id=ktrw68Cmu9c.
- Mining likely analogical apis across third-party libraries via large-scale unsupervised API semantics embedding. IEEE Trans. Software Eng., 47(3):432–447, 2021a. doi: 10.1109/TSE.2019.2896123. URL https://doi.org/10.1109/TSE.2019.2896123.
- Evaluating large language models trained on code. CoRR, abs/2107.03374, 2021b. URL https://arxiv.org/abs/2107.03374.
- Angora: Efficient fuzzing by principled search. In 2018 IEEE Symposium on Security and Privacy, SP 2018, Proceedings, 21-23 May 2018, San Francisco, California, USA, pp. 711–725. IEEE Computer Society, 2018. doi: 10.1109/SP.2018.00046. URL https://doi.org/10.1109/SP.2018.00046.
- Extending context window of large language models via positional interpolation. CoRR, abs/2306.15595, 2023b. doi: 10.48550/ARXIV.2306.15595. URL https://doi.org/10.48550/arXiv.2306.15595.
- Program of thoughts prompting: Disentangling computation from reasoning for numerical reasoning tasks. CoRR, abs/2211.12588, 2022. doi: 10.48550/ARXIV.2211.12588. URL https://doi.org/10.48550/arXiv.2211.12588.
- Tree-to-tree neural networks for program translation. In Samy Bengio, Hanna M. Wallach, Hugo Larochelle, Kristen Grauman, Nicolò Cesa-Bianchi, and Roman Garnett (eds.), Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, December 3-8, 2018, Montréal, Canada, pp. 2552–2562, 2018. URL https://proceedings.neurips.cc/paper/2018/hash/d759175de8ea5b1d9a2660e45554894f-Abstract.html.
- Plotcoder: Hierarchical decoding for synthesizing visualization code in programmatic context. In Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (eds.), Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL/IJCNLP 2021, (Volume 1: Long Papers), Virtual Event, August 1-6, 2021, pp. 2169–2181. Association for Computational Linguistics, 2021c. doi: 10.18653/V1/2021.ACL-LONG.169. URL https://doi.org/10.18653/v1/2021.acl-long.169.
- Teaching large language models to self-debug. CoRR, abs/2304.05128, 2023c. doi: 10.48550/arXiv.2304.05128. URL https://doi.org/10.48550/arXiv.2304.05128.
- Diversevul: A new vulnerable source code dataset for deep learning based vulnerability detection. In Proceedings of the 26th International Symposium on Research in Attacks, Intrusions and Defenses, RAID 2023, Hong Kong, China, October 16-18, 2023, pp. 654–668. ACM, 2023d. doi: 10.1145/3607199.3607242. URL https://doi.org/10.1145/3607199.3607242.
- Experience report: Deep learning-based system log analysis for anomaly detection. CoRR, abs/2107.05908, 2021d. URL https://arxiv.org/abs/2107.05908.
- Sequencer: Sequence-to-sequence learning for end-to-end program repair. IEEE Trans. Software Eng., 47(9):1943–1959, 2021e. doi: 10.1109/TSE.2019.2940179. URL https://doi.org/10.1109/TSE.2019.2940179.
- David Chiang. Hierarchical phrase-based translation. Comput. Linguistics, 33(2):201–228, 2007. doi: 10.1162/coli.2007.33.2.201. URL https://doi.org/10.1162/coli.2007.33.2.201.
- Using a nearest-neighbour, bert-based approach for scalable clone detection. In IEEE International Conference on Software Maintenance and Evolution, ICSME 2022, Limassol, Cyprus, October 3-7, 2022, pp. 582–591. IEEE, 2022. doi: 10.1109/ICSME55016.2022.00080. URL https://doi.org/10.1109/ICSME55016.2022.00080.
- RYANSQL: recursively applying sketch-based slot fillings for complex text-to-sql in cross-domain databases. Comput. Linguistics, 47(2):309–332, 2021. doi: 10.1162/coli_a_00403. URL https://doi.org/10.1162/coli_a_00403.
- Rethinking attention with performers. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021. URL https://openreview.net/forum?id=Ua6zuk0WRH.
- Palm: Scaling language modeling with pathways. CoRR, abs/2204.02311, 2022. doi: 10.48550/arXiv.2204.02311. URL https://doi.org/10.48550/arXiv.2204.02311.
- Deep reinforcement learning from human preferences. In Isabelle Guyon, Ulrike von Luxburg, Samy Bengio, Hanna M. Wallach, Rob Fergus, S. V. N. Vishwanathan, and Roman Garnett (eds.), Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, CA, USA, pp. 4299–4307, 2017. URL https://proceedings.neurips.cc/paper/2017/hash/d5e2c0adad503c91f91df240d0cd4e49-Abstract.html.
- Pangu-coder: Program synthesis with function-level language modeling. CoRR, abs/2207.11280, 2022. doi: 10.48550/arXiv.2207.11280. URL https://doi.org/10.48550/arXiv.2207.11280.
- Scaling instruction-finetuned language models. CoRR, abs/2210.11416, 2022. doi: 10.48550/arXiv.2210.11416. URL https://doi.org/10.48550/arXiv.2210.11416.
- ELECTRA: pre-training text encoders as discriminators rather than generators. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. OpenReview.net, 2020. URL https://openreview.net/forum?id=r1xMH1BtvB.
- Pymt5: multi-mode translation of natural language and python code with transformers. In Bonnie Webber, Trevor Cohn, Yulan He, and Yang Liu (eds.), Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP 2020, Online, November 16-20, 2020, pp. 9052–9065. Association for Computational Linguistics, 2020. doi: 10.18653/v1/2020.emnlp-main.728. URL https://doi.org/10.18653/v1/2020.emnlp-main.728.
- Long-range modeling of source code files with ewash: Extended window access by syntax hierarchy. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih (eds.), Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021, pp. 4713–4722. Association for Computational Linguistics, 2021. doi: 10.18653/V1/2021.EMNLP-MAIN.387. URL https://doi.org/10.18653/v1/2021.emnlp-main.387.
- Training verifiers to solve math word problems. CoRR, abs/2110.14168, 2021. URL https://arxiv.org/abs/2110.14168.
- PIT: a practical mutation testing tool for java (demo). In Andreas Zeller and Abhik Roychoudhury (eds.), Proceedings of the 25th International Symposium on Software Testing and Analysis, ISSTA 2016, Saarbrücken, Germany, July 18-20, 2016, pp. 449–452. ACM, 2016. doi: 10.1145/2931037.2948707. URL https://doi.org/10.1145/2931037.2948707.
- Christian S. Collberg and Clark Thomborson. Watermarking, tamper-proofing, and obfuscation - tools for software protection. IEEE Transactions on Software Engineering, 28(8):735–746, August 2002. ISSN 0098-5589. doi: 10.1109/TSE.2002.1027797. Funding Information: The authors are grateful for the extensive and insightful comments of two anonymous referees. This material is based upon work supported by the US National Science Foundation under Grant No. 0073483.
- Cross-lingual language model pretraining. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d’Alché-Buc, Emily B. Fox, and Roman Garnett (eds.), Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, pp. 7057–7067, 2019. URL https://proceedings.neurips.cc/paper/2019/hash/c04c19c2c2474dbf5f7ac4372c5b9af1-Abstract.html.
- On automatically generating commit messages via summarization of source code changes. In 14th IEEE International Working Conference on Source Code Analysis and Manipulation, SCAM 2014, Victoria, BC, Canada, September 28-29, 2014, pp. 275–284. IEEE Computer Society, 2014. doi: 10.1109/SCAM.2014.14. URL https://doi.org/10.1109/SCAM.2014.14.
- Fixjs: A dataset of bug-fixing javascript commits. In 19th IEEE/ACM International Conference on Mining Software Repositories, MSR 2022, Pittsburgh, PA, USA, May 23-24, 2022, pp. 712–716. ACM, 2022. doi: 10.1145/3524842.3528480. URL https://doi.org/10.1145/3524842.3528480.
- Pyinfer: Deep learning semantic type inference for python variables. CoRR, abs/2106.14316, 2021. URL https://arxiv.org/abs/2106.14316.
- Programl: A graph-based program representation for data flow analysis and compiler optimizations. In Marina Meila and Tong Zhang (eds.), Proceedings of the 38th International Conference on Machine Learning, volume 139 of Proceedings of Machine Learning Research, pp. 2244–2253. PMLR, 18–24 Jul 2021. URL https://proceedings.mlr.press/v139/cummins21a.html.
- Expanding the scope of the ATIS task: The ATIS-3 corpus. In Human Language Technology, Proceedings of a Workshop held at Plainsboro, New Jerey, USA, March 8-11, 1994. Morgan Kaufmann, 1994. URL https://aclanthology.org/H94-1010/.
- Logram: Efficient log parsing using $n$n-gram dictionaries. IEEE Trans. Software Eng., 48(3):879–892, 2022. doi: 10.1109/TSE.2020.3007554. URL https://doi.org/10.1109/TSE.2020.3007554.
- Flashattention: Fast and memory-efficient exact attention with io-awareness. In NeurIPS, 2022. URL http://papers.nips.cc/paper_files/paper/2022/hash/67d57c32e20fd0a7a302cb81d36e40d5-Abstract-Conference.html.
- Javabert: Training a transformer-based model for the java programming language. In 36th IEEE/ACM International Conference on Automated Software Engineering, ASE 2021 - Workshops, Melbourne, Australia, November 15-19, 2021, pp. 90–95. IEEE, 2021. doi: 10.1109/ASEW52652.2021.00028. URL https://doi.org/10.1109/ASEW52652.2021.00028.
- μ𝜇\mathrm{\mu}italic_μbert: Mutation testing using pre-trained language models. In 15th IEEE International Conference on Software Testing, Verification and Validation Workshops ICST Workshops 2022, Valencia, Spain, April 4-13, 2022, pp. 160–169. IEEE, 2022. doi: 10.1109/ICSTW55395.2022.00039. URL https://doi.org/10.1109/ICSTW55395.2022.00039.
- Recent advances in text-to-sql: A survey of what we have and what we expect. In Nicoletta Calzolari, Chu-Ren Huang, Hansaem Kim, James Pustejovsky, Leo Wanner, Key-Sun Choi, Pum-Mo Ryu, Hsin-Hsi Chen, Lucia Donatelli, Heng Ji, Sadao Kurohashi, Patrizia Paggio, Nianwen Xue, Seokhwan Kim, Younggyun Hahm, Zhong He, Tony Kyungil Lee, Enrico Santus, Francis Bond, and Seung-Hoon Na (eds.), Proceedings of the 29th International Conference on Computational Linguistics, COLING 2022, Gyeongju, Republic of Korea, October 12-17, 2022, pp. 2166–2187. International Committee on Computational Linguistics, 2022a. URL https://aclanthology.org/2022.coling-1.190.
- Structure-grounded pretraining for text-to-sql. In Kristina Toutanova, Anna Rumshisky, Luke Zettlemoyer, Dilek Hakkani-Tür, Iz Beltagy, Steven Bethard, Ryan Cotterell, Tanmoy Chakraborty, and Yichao Zhou (eds.), Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2021, Online, June 6-11, 2021, pp. 1337–1350. Association for Computational Linguistics, 2021. doi: 10.18653/V1/2021.NAACL-MAIN.105. URL https://doi.org/10.18653/v1/2021.naacl-main.105.
- Fuzzing deep-learning libraries via automated relational API inference. In Abhik Roychoudhury, Cristian Cadar, and Miryung Kim (eds.), Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, Singapore, Singapore, November 14-18, 2022, pp. 44–56. ACM, 2022b. doi: 10.1145/3540250.3549085. URL https://doi.org/10.1145/3540250.3549085.
- Large language models are zero-shot fuzzers: Fuzzing deep-learning libraries via large language models. In René Just and Gordon Fraser (eds.), Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, ISSTA 2023, Seattle, WA, USA, July 17-21, 2023, pp. 423–435. ACM, 2023. doi: 10.1145/3597926.3598067. URL https://doi.org/10.1145/3597926.3598067.
- Robustfill: Neural program learning under noisy I/O. In Doina Precup and Yee Whye Teh (eds.), Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, NSW, Australia, 6-11 August 2017, volume 70 of Proceedings of Machine Learning Research, pp. 990–998. PMLR, 2017a. URL http://proceedings.mlr.press/v70/devlin17a.html.
- Semantic code repair using neuro-symbolic transformation networks. CoRR, abs/1710.11054, 2017b. URL http://arxiv.org/abs/1710.11054.
- BERT: pre-training of deep bidirectional transformers for language understanding. In Jill Burstein, Christy Doran, and Thamar Solorio (eds.), Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2-7, 2019, Volume 1 (Long and Short Papers), pp. 4171–4186. Association for Computational Linguistics, 2019. doi: 10.18653/v1/n19-1423. URL https://doi.org/10.18653/v1/n19-1423.
- Codefuse-13b: A pretrained multi-lingual code large language model. CoRR, abs/2310.06266, 2023. doi: 10.48550/ARXIV.2310.06266. URL https://doi.org/10.48550/arXiv.2310.06266.
- TOGA: A neural method for test oracle generation. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 2130–2141. ACM, 2022. doi: 10.1145/3510003.3510141. URL https://doi.org/10.1145/3510003.3510141.
- Towards learning (dis)-similarity of source code from program contrasts. In Smaranda Muresan, Preslav Nakov, and Aline Villavicencio (eds.), Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2022, Dublin, Ireland, May 22-27, 2022, pp. 6300–6312. Association for Computational Linguistics, 2022a. doi: 10.18653/v1/2022.acl-long.436. URL https://doi.org/10.18653/v1/2022.acl-long.436.
- Cocomic: Code completion by jointly modeling in-file and cross-file context. CoRR, abs/2212.10007, 2022b. doi: 10.48550/ARXIV.2212.10007. URL https://doi.org/10.48550/arXiv.2212.10007.
- Crosscodeeval: A diverse and multilingual benchmark for cross-file code completion. CoRR, abs/2310.11248, 2023. doi: 10.48550/ARXIV.2310.11248. URL https://doi.org/10.48550/arXiv.2310.11248.
- LAVA: large-scale automated vulnerability addition. In IEEE Symposium on Security and Privacy, SP 2016, San Jose, CA, USA, May 22-26, 2016, pp. 110–121. IEEE Computer Society, 2016. doi: 10.1109/SP.2016.15. URL https://doi.org/10.1109/SP.2016.15.
- FIRA: fine-grained graph-based code change representation for automated commit message generation. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 970–981. ACM, 2022. doi: 10.1145/3510003.3510069. URL https://doi.org/10.1145/3510003.3510069.
- Li Dong and Mirella Lapata. Coarse-to-fine decoding for neural semantic parsing. In Iryna Gurevych and Yusuke Miyao (eds.), Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Melbourne, Australia, July 15-20, 2018, Volume 1: Long Papers, pp. 731–742. Association for Computational Linguistics, 2018. doi: 10.18653/v1/P18-1068. URL https://aclanthology.org/P18-1068/.
- Unified language model pre-training for natural language understanding and generation. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d’Alché-Buc, Emily B. Fox, and Roman Garnett (eds.), Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, pp. 13042–13054, 2019. URL https://proceedings.neurips.cc/paper/2019/hash/c20bb2d9a50d5ac1f713f8b34d9aac5a-Abstract.html.
- Self-collaboration code generation via chatgpt. CoRR, abs/2304.07590, 2023. doi: 10.48550/ARXIV.2304.07590. URL https://doi.org/10.48550/arXiv.2304.07590.
- Towards understanding the capability of large language models on code clone detection: A survey. CoRR, abs/2308.01191, 2023. doi: 10.48550/arXiv.2308.01191. URL https://doi.org/10.48550/arXiv.2308.01191.
- Deepdebug: Fixing python bugs using stack traces, backtranslation, and code skeletons. CoRR, abs/2105.09352, 2021. URL https://arxiv.org/abs/2105.09352.
- Program language translation using a grammar-driven tree-to-tree model. CoRR, abs/1807.01784, 2018. URL http://arxiv.org/abs/1807.01784.
- Solving linear algebra by program synthesis. CoRR, abs/2111.08171, 2021. URL https://arxiv.org/abs/2111.08171.
- A neural network solves, explains, and generates university math problems by program synthesis and few-shot learning at human level. Proceedings of the National Academy of Sciences (PNAS), 119(32), 2022.
- Is a single model enough? mucos: A multi-model ensemble learning approach for semantic code search. In Gianluca Demartini, Guido Zuccon, J. Shane Culpepper, Zi Huang, and Hanghang Tong (eds.), CIKM ’21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1 - 5, 2021, pp. 2994–2998. ACM, 2021. doi: 10.1145/3459637.3482127. URL https://doi.org/10.1145/3459637.3482127.
- Spell: Streaming parsing of system event logs. In Francesco Bonchi, Josep Domingo-Ferrer, Ricardo Baeza-Yates, Zhi-Hua Zhou, and Xindong Wu (eds.), IEEE 16th International Conference on Data Mining, ICDM 2016, December 12-15, 2016, Barcelona, Spain, pp. 859–864. IEEE Computer Society, 2016. doi: 10.1109/ICDM.2016.0103. URL https://doi.org/10.1109/ICDM.2016.0103.
- Deeplog: Anomaly detection and diagnosis from system logs through deep learning. In Bhavani Thuraisingham, David Evans, Tal Malkin, and Dongyan Xu (eds.), Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, CCS 2017, Dallas, TX, USA, October 30 - November 03, 2017, pp. 1285–1298. ACM, 2017. doi: 10.1145/3133956.3134015. URL https://doi.org/10.1145/3133956.3134015.
- From commit message generation to history-aware commit message completion. In 38th IEEE/ACM International Conference on Automated Software Engineering, ASE 2023, Luxembourg, September 11-15, 2023, pp. 723–735. IEEE, 2023. doi: 10.1109/ASE56229.2023.00078. URL https://doi.org/10.1109/ASE56229.2023.00078.
- A C/C++ code vulnerability dataset with code changes and CVE summaries. In Sunghun Kim, Georgios Gousios, Sarah Nadi, and Joseph Hejderup (eds.), MSR ’20: 17th International Conference on Mining Software Repositories, Seoul, Republic of Korea, 29-30 June, 2020, pp. 508–512. ACM, 2020. doi: 10.1145/3379597.3387501. URL https://doi.org/10.1145/3379597.3387501.
- Automated repair of programs from large language models. In 45th IEEE/ACM International Conference on Software Engineering, ICSE 2023, Melbourne, Australia, May 14-20, 2023, pp. 1469–1481. IEEE, 2023. doi: 10.1109/ICSE48619.2023.00128. URL https://doi.org/10.1109/ICSE48619.2023.00128.
- Functional code clone detection with syntax and semantics fusion learning. In Sarfraz Khurshid and Corina S. Pasareanu (eds.), ISSTA ’20: 29th ACM SIGSOFT International Symposium on Software Testing and Analysis, Virtual Event, USA, July 18-22, 2020, pp. 516–527. ACM, 2020. doi: 10.1145/3395363.3397362. URL https://doi.org/10.1145/3395363.3397362.
- Self-attention networks for code search. Inf. Softw. Technol., 134:106542, 2021. doi: 10.1016/J.INFSOF.2021.106542. URL https://doi.org/10.1016/j.infsof.2021.106542.
- Towards revealing the mystery behind chain of thought: a theoretical perspective. CoRR, abs/2305.15408, 2023. doi: 10.48550/ARXIV.2305.15408. URL https://doi.org/10.48550/arXiv.2305.15408.
- Program synthesis using conflict-driven learning. In Jeffrey S. Foster and Dan Grossman (eds.), Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2018, Philadelphia, PA, USA, June 18-22, 2018, pp. 420–435. ACM, 2018. doi: 10.1145/3192366.3192382. URL https://doi.org/10.1145/3192366.3192382.
- Codebert: A pre-trained model for programming and natural languages. In Trevor Cohn, Yulan He, and Yang Liu (eds.), Findings of the Association for Computational Linguistics: EMNLP 2020, Online Event, 16-20 November 2020, volume EMNLP 2020 of Findings of ACL, pp. 1536–1547. Association for Computational Linguistics, 2020. doi: 10.18653/v1/2020.findings-emnlp.139. URL https://doi.org/10.18653/v1/2020.findings-emnlp.139.
- Structured neural summarization. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net, 2019. URL https://openreview.net/forum?id=H1ersoRqtm.
- Improving text-to-sql evaluation methodology. In Iryna Gurevych and Yusuke Miyao (eds.), Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Melbourne, Australia, July 15-20, 2018, Volume 1: Long Papers, pp. 351–360. Association for Computational Linguistics, 2018. doi: 10.18653/v1/P18-1033. URL https://aclanthology.org/P18-1033/.
- The significance of user-defined identifiers in java source code authorship identification. Comput. Syst. Sci. Eng., 26(2), 2011.
- Evosuite: automatic test suite generation for object-oriented software. In Tibor Gyimóthy and Andreas Zeller (eds.), SIGSOFT/FSE’11 19th ACM SIGSOFT Symposium on the Foundations of Software Engineering (FSE-19) and ESEC’11: 13th European Software Engineering Conference (ESEC-13), Szeged, Hungary, September 5-9, 2011, pp. 416–419. ACM, 2011. doi: 10.1145/2025113.2025179. URL https://doi.org/10.1145/2025113.2025179.
- A large-scale evaluation of automated unit test generation using evosuite. ACM Trans. Softw. Eng. Methodol., 24(2):8:1–8:42, 2014. doi: 10.1145/2685612. URL https://doi.org/10.1145/2685612.
- Incoder: A generative model for code infilling and synthesis. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. URL https://openreview.net/pdf?id=hQwb-lbM6EL.
- Codeapex: A bilingual programming evaluation benchmark for large language models. CoRR, abs/2309.01940, 2023. doi: 10.48550/arXiv.2309.01940. URL https://doi.org/10.48550/arXiv.2309.01940.
- Linevul: A transformer-based line-level vulnerability prediction. In 19th IEEE/ACM International Conference on Mining Software Repositories, MSR 2022, Pittsburgh, PA, USA, May 23-24, 2022, pp. 608–620. ACM, 2022. doi: 10.1145/3524842.3528452. URL https://doi.org/10.1145/3524842.3528452.
- Vulrepair: a t5-based automated software vulnerability repair. In Abhik Roychoudhury, Cristian Cadar, and Miryung Kim (eds.), Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, Singapore, Singapore, November 14-18, 2022, pp. 935–947. ACM, 2022. doi: 10.1145/3540250.3549098. URL https://doi.org/10.1145/3540250.3549098.
- Scalable inference and training of context-rich syntactic translation models. In Nicoletta Calzolari, Claire Cardie, and Pierre Isabelle (eds.), ACL 2006, 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, Sydney, Australia, 17-21 July 2006. The Association for Computer Linguistics, 2006. doi: 10.3115/1220175.1220296. URL https://aclanthology.org/P06-1121/.
- Towards robustness of text-to-sql models against synonym substitution. In Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (eds.), Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL/IJCNLP 2021, (Volume 1: Long Papers), Virtual Event, August 1-6, 2021, pp. 2505–2515. Association for Computational Linguistics, 2021a. doi: 10.18653/V1/2021.ACL-LONG.195. URL https://doi.org/10.18653/v1/2021.acl-long.195.
- Exploring underexplored limitations of cross-domain text-to-sql generalization. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih (eds.), Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021, pp. 8926–8931. Association for Computational Linguistics, 2021b. doi: 10.18653/V1/2021.EMNLP-MAIN.702. URL https://doi.org/10.18653/v1/2021.emnlp-main.702.
- Measuring and improving compositional generalization in text-to-sql via component alignment. In Marine Carpuat, Marie-Catherine de Marneffe, and Iván Vladimir Meza Ruíz (eds.), Findings of the Association for Computational Linguistics: NAACL 2022, Seattle, WA, United States, July 10-15, 2022, pp. 831–843. Association for Computational Linguistics, 2022. doi: 10.18653/V1/2022.FINDINGS-NAACL.62. URL https://doi.org/10.18653/v1/2022.findings-naacl.62.
- Towards generalizable and robust text-to-sql parsing. In Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (eds.), Findings of the Association for Computational Linguistics: EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11, 2022, pp. 2113–2125. Association for Computational Linguistics, 2022. doi: 10.18653/V1/2022.FINDINGS-EMNLP.155. URL https://doi.org/10.18653/v1/2022.findings-emnlp.155.
- Text-to-sql empowered by large language models: A benchmark evaluation. CoRR, abs/2308.15363, 2023a. doi: 10.48550/ARXIV.2308.15363. URL https://doi.org/10.48550/arXiv.2308.15363.
- The pile: An 800gb dataset of diverse text for language modeling. CoRR, abs/2101.00027, 2021. URL https://arxiv.org/abs/2101.00027.
- PAL: program-aided language models. In Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett (eds.), International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA, volume 202 of Proceedings of Machine Learning Research, pp. 10764–10799. PMLR, 2023b. URL https://proceedings.mlr.press/v202/gao23f.html.
- Code structure-guided transformer for source code summarization. ACM Trans. Softw. Eng. Methodol., 32(1):23:1–23:32, 2023c. doi: 10.1145/3522674. URL https://doi.org/10.1145/3522674.
- M2TS: multi-scale multi-modal approach based on transformer for source code summarization. In Ayushi Rastogi, Rosalia Tufano, Gabriele Bavota, Venera Arnaoudova, and Sonia Haiduc (eds.), Proceedings of the 30th IEEE/ACM International Conference on Program Comprehension, ICPC 2022, Virtual Event, May 16-17, 2022, pp. 24–35. ACM, 2022. doi: 10.1145/3524610.3527907. URL https://doi.org/10.1145/3524610.3527907.
- How far have we gone in vulnerability detection using large language models, 2023d.
- Automatic software repair: a survey. In Michel Chaudron, Ivica Crnkovic, Marsha Chechik, and Mark Harman (eds.), Proceedings of the 40th International Conference on Software Engineering, ICSE 2018, Gothenburg, Sweden, May 27 - June 03, 2018, pp. 1219. ACM, 2018. doi: 10.1145/3180155.3182526. URL https://doi.org/10.1145/3180155.3182526.
- The manybugs and introclass benchmarks for automated repair of C programs. IEEE Trans. Software Eng., 41(12):1236–1256, 2015. doi: 10.1109/TSE.2015.2454513. URL https://doi.org/10.1109/TSE.2015.2454513.
- The flores-101 evaluation benchmark for low-resource and multilingual machine translation. Trans. Assoc. Comput. Linguistics, 10:522–538, 2022. doi: 10.1162/TACL_A_00474. URL https://doi.org/10.1162/tacl_a_00474.
- Code search: A survey of techniques for finding code. ACM Comput. Surv., 55(11):220:1–220:31, 2023. doi: 10.1145/3565971. URL https://doi.org/10.1145/3565971.
- Multimodal representation for neural code search. In IEEE International Conference on Software Maintenance and Evolution, ICSME 2021, Luxembourg, September 27 - October 1, 2021, pp. 483–494. IEEE, 2021. doi: 10.1109/ICSME52107.2021.00049. URL https://doi.org/10.1109/ICSME52107.2021.00049.
- Muffin: Testing deep learning libraries via neural architecture fuzzing. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 1418–1430. ACM, 2022. doi: 10.1145/3510003.3510092. URL https://doi.org/10.1145/3510003.3510092.
- Deepam: Migrate apis with multi-modal sequence to sequence learning. In Carles Sierra (ed.), Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI 2017, Melbourne, Australia, August 19-25, 2017, pp. 3675–3681. ijcai.org, 2017. doi: 10.24963/IJCAI.2017/514. URL https://doi.org/10.24963/ijcai.2017/514.
- Deep code search. In Michel Chaudron, Ivica Crnkovic, Marsha Chechik, and Mark Harman (eds.), Proceedings of the 40th International Conference on Software Engineering, ICSE 2018, Gothenburg, Sweden, May 27 - June 03, 2018, pp. 933–944. ACM, 2018. doi: 10.1145/3180155.3180167. URL https://doi.org/10.1145/3180155.3180167.
- Textbooks are all you need. CoRR, abs/2306.11644, 2023. doi: 10.48550/arXiv.2306.11644. URL https://doi.org/10.48550/arXiv.2306.11644.
- Retrieval-augmented gpt-3.5-based text-to-sql framework with sample-aware prompting and dynamic revision chain. In Biao Luo, Long Cheng, Zheng-Guang Wu, Hongyi Li, and Chaojie Li (eds.), Neural Information Processing - 30th International Conference, ICONIP 2023, Changsha, China, November 20-23, 2023, Proceedings, Part VI, volume 14452 of Lecture Notes in Computer Science, pp. 341–356. Springer, 2023a. doi: 10.1007/978-981-99-8076-5_25. URL https://doi.org/10.1007/978-981-99-8076-5_25.
- Graphcodebert: Pre-training code representations with data flow. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021a. URL https://openreview.net/forum?id=jLoC4ez43PZ.
- Unixcoder: Unified cross-modal pre-training for code representation. In Smaranda Muresan, Preslav Nakov, and Aline Villavicencio (eds.), Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2022, Dublin, Ireland, May 22-27, 2022, pp. 7212–7225. Association for Computational Linguistics, 2022. doi: 10.18653/v1/2022.acl-long.499. URL https://doi.org/10.18653/v1/2022.acl-long.499.
- Longcoder: A long-range pre-trained language model for code completion. In Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett (eds.), International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA, volume 202 of Proceedings of Machine Learning Research, pp. 12098–12107. PMLR, 2023b. URL https://proceedings.mlr.press/v202/guo23j.html.
- Logbert: Log anomaly detection via BERT. In International Joint Conference on Neural Networks, IJCNN 2021, Shenzhen, China, July 18-22, 2021, pp. 1–8. IEEE, 2021b. doi: 10.1109/IJCNN52387.2021.9534113. URL https://doi.org/10.1109/IJCNN52387.2021.9534113.
- Towards complex text-to-sql in cross-domain database with intermediate representation. In Anna Korhonen, David R. Traum, and Lluís Màrquez (eds.), Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, Florence, Italy, July 28- August 2, 2019, Volume 1: Long Papers, pp. 4524–4535. Association for Computational Linguistics, 2019. doi: 10.18653/v1/p19-1444. URL https://doi.org/10.18653/v1/p19-1444.
- Audee: Automated testing for deep learning frameworks. In 35th IEEE/ACM International Conference on Automated Software Engineering, ASE 2020, Melbourne, Australia, September 21-25, 2020, pp. 486–498. IEEE, 2020. doi: 10.1145/3324884.3416571. URL https://doi.org/10.1145/3324884.3416571.
- Intelligent code reviews using deep learning, 2018.
- Deepfix: Fixing common C language errors by deep learning. In Satinder Singh and Shaul Markovitch (eds.), Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, February 4-9, 2017, San Francisco, California, USA, pp. 1345–1351. AAAI Press, 2017. doi: 10.1609/aaai.v31i1.10742. URL https://doi.org/10.1609/aaai.v31i1.10742.
- Bugsjs: a benchmark of javascript bugs. In 12th IEEE Conference on Software Testing, Validation and Verification, ICST 2019, Xi’an, China, April 22-27, 2019, pp. 90–101. IEEE, 2019. doi: 10.1109/ICST.2019.00019. URL https://doi.org/10.1109/ICST.2019.00019.
- Discovering bug patterns in javascript. In Thomas Zimmermann, Jane Cleland-Huang, and Zhendong Su (eds.), Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, FSE 2016, Seattle, WA, USA, November 13-18, 2016, pp. 144–156. ACM, 2016. doi: 10.1145/2950290.2950308. URL https://doi.org/10.1145/2950290.2950308.
- Vulberta: Simplified source code pre-training for vulnerability detection. In International Joint Conference on Neural Networks, IJCNN 2022, Padua, Italy, July 18-23, 2022, pp. 1–8. IEEE, 2022. doi: 10.1109/IJCNN55064.2022.9892280. URL https://doi.org/10.1109/IJCNN55064.2022.9892280.
- Aixbench: A code generation benchmark dataset. CoRR, abs/2206.13179, 2022. doi: 10.48550/arXiv.2206.13179. URL https://doi.org/10.48550/arXiv.2206.13179.
- Improved automatic summarization of subroutines via attention to file context. In Sunghun Kim, Georgios Gousios, Sarah Nadi, and Joseph Hejderup (eds.), MSR ’20: 17th International Conference on Mining Software Repositories, Seoul, Republic of Korea, 29-30 June, 2020, pp. 300–310. ACM, 2020. doi: 10.1145/3379597.3387449. URL https://doi.org/10.1145/3379597.3387449.
- A survey on automated software vulnerability detection using machine learning and deep learning. CoRR, abs/2306.11673, 2023. doi: 10.48550/ARXIV.2306.11673. URL https://doi.org/10.48550/arXiv.2306.11673.
- Maxsmt-based type inference for python 3. In Hana Chockler and Georg Weissenbacher (eds.), Computer Aided Verification - 30th International Conference, CAV 2018, Held as Part of the Federated Logic Conference, FloC 2018, Oxford, UK, July 14-17, 2018, Proceedings, Part II, volume 10982 of Lecture Notes in Computer Science, pp. 12–19. Springer, 2018. doi: 10.1007/978-3-319-96142-2_2. URL https://doi.org/10.1007/978-3-319-96142-2_2.
- Retrieval-based neural code generation. In Ellen Riloff, David Chiang, Julia Hockenmaier, and Jun’ichi Tsujii (eds.), Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018, pp. 925–930. Association for Computational Linguistics, 2018. doi: 10.18653/V1/D18-1111. URL https://doi.org/10.18653/v1/d18-1111.
- Text-to-sql in the wild: A naturally-occurring dataset based on stack exchange data. CoRR, abs/2106.05006, 2021. URL https://arxiv.org/abs/2106.05006.
- Debin: Predicting debug information in stripped binaries. In David Lie, Mohammad Mannan, Michael Backes, and XiaoFeng Wang (eds.), Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security, CCS 2018, Toronto, ON, Canada, October 15-19, 2018, pp. 1667–1680. ACM, 2018. doi: 10.1145/3243734.3243866. URL https://doi.org/10.1145/3243734.3243866.
- Drain: An online log parsing approach with fixed depth tree. In Ilkay Altintas and Shiping Chen (eds.), 2017 IEEE International Conference on Web Services, ICWS 2017, Honolulu, HI, USA, June 25-30, 2017, pp. 33–40. IEEE, 2017. doi: 10.1109/ICWS.2017.13. URL https://doi.org/10.1109/ICWS.2017.13.
- Loghub: A large collection of system log datasets towards automated log analytics. CoRR, abs/2008.06448, 2020. URL https://arxiv.org/abs/2008.06448.
- COME: commit message generation with modification embedding. In René Just and Gordon Fraser (eds.), Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, ISSTA 2023, Seattle, WA, USA, July 17-21, 2023, pp. 792–803. ACM, 2023a. doi: 10.1145/3597926.3598096. URL https://doi.org/10.1145/3597926.3598096.
- Exploring human-like translation strategy with large language models. CoRR, abs/2305.04118, 2023b. doi: 10.48550/ARXIV.2305.04118. URL https://doi.org/10.48550/arXiv.2305.04118.
- Solving math word problems by combining language models with symbolic solvers. CoRR, abs/2304.09102, 2023. doi: 10.48550/ARXIV.2304.09102. URL https://doi.org/10.48550/arXiv.2304.09102.
- Are deep neural networks the best choice for modeling source code? In Eric Bodden, Wilhelm Schäfer, Arie van Deursen, and Andrea Zisman (eds.), Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering, ESEC/FSE 2017, Paderborn, Germany, September 4-8, 2017, pp. 763–773. ACM, 2017. doi: 10.1145/3106237.3106290. URL https://doi.org/10.1145/3106237.3106290.
- Deep learning type inference. In Gary T. Leavens, Alessandro Garcia, and Corina S. Pasareanu (eds.), Proceedings of the 2018 ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2018, Lake Buena Vista, FL, USA, November 04-09, 2018, pp. 152–162. ACM, 2018. doi: 10.1145/3236024.3236051. URL https://doi.org/10.1145/3236024.3236051.
- Global relational models of source code. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. OpenReview.net, 2020. URL https://openreview.net/forum?id=B1lnbRNtwr.
- Towards automating code review at scale. In Diomidis Spinellis, Georgios Gousios, Marsha Chechik, and Massimiliano Di Penta (eds.), ESEC/FSE ’21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Athens, Greece, August 23-28, 2021, pp. 1479–1482. ACM, 2021. doi: 10.1145/3468264.3473134. URL https://doi.org/10.1145/3468264.3473134.
- The ATIS spoken language systems pilot corpus. In Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, Pennsylvania, USA, June 24-27, 1990. Morgan Kaufmann, 1990. URL https://aclanthology.org/H90-1021/.
- Measuring coding challenge competence with APPS. In Joaquin Vanschoren and Sai-Kit Yeung (eds.), Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, NeurIPS Datasets and Benchmarks 2021, December 2021, virtual, 2021a. URL https://datasets-benchmarks-proceedings.neurips.cc/paper/2021/hash/c24cd76e1ce41366a4bbe8a49b02a028-Abstract-round2.html.
- Measuring massive multitask language understanding. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021b. URL https://openreview.net/forum?id=d7KBjmI3GmQ.
- Neural code search revisited: Enhancing code snippet retrieval through natural language intent. CoRR, abs/2008.12193, 2020. URL https://arxiv.org/abs/2008.12193.
- On the naturalness of software. In Martin Glinz, Gail C. Murphy, and Mauro Pezzè (eds.), 34th International Conference on Software Engineering, ICSE 2012, June 2-9, 2012, Zurich, Switzerland, pp. 837–847. IEEE Computer Society, 2012. doi: 10.1109/ICSE.2012.6227135. URL https://doi.org/10.1109/ICSE.2012.6227135.
- Denoising diffusion probabilistic models. In Hugo Larochelle, Marc’Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin (eds.), Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, 2020. URL https://proceedings.neurips.cc/paper/2020/hash/4c5bcfec8584af0d967f1ab10179ca4b-Abstract.html.
- Cc2vec: distributed representations of code changes. In Gregg Rothermel and Doo-Hwan Bae (eds.), ICSE ’20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June - 19 July, 2020, pp. 518–529. ACM, 2020. doi: 10.1145/3377811.3380361. URL https://doi.org/10.1145/3377811.3380361.
- Long short-term memory. Neural Comput., 9(8):1735–1780, 1997. doi: 10.1162/neco.1997.9.8.1735. URL https://doi.org/10.1162/neco.1997.9.8.1735.
- Training compute-optimal large language models. CoRR, abs/2203.15556, 2022. doi: 10.48550/arXiv.2203.15556. URL https://doi.org/10.48550/arXiv.2203.15556.
- Metagpt: Meta programming for multi-agent collaborative framework. CoRR, abs/2308.00352, 2023. doi: 10.48550/ARXIV.2308.00352. URL https://doi.org/10.48550/arXiv.2308.00352.
- Commentfinder: a simpler, faster, more accurate code review comments recommendation. In Abhik Roychoudhury, Cristian Cadar, and Miryung Kim (eds.), Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, Singapore, Singapore, November 14-18, 2022, pp. 507–519. ACM, 2022. doi: 10.1145/3540250.3549119. URL https://doi.org/10.1145/3540250.3549119.
- Unnatural instructions: Tuning language models with (almost) no human labor. In Anna Rogers, Jordan L. Boyd-Graber, and Naoaki Okazaki (eds.), Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023, pp. 14409–14428. Association for Computational Linguistics, 2023. doi: 10.18653/v1/2023.acl-long.806. URL https://doi.org/10.18653/v1/2023.acl-long.806.
- Large language models for software engineering: A systematic literature review. CoRR, abs/2308.10620, 2023. doi: 10.48550/ARXIV.2308.10620. URL https://doi.org/10.48550/arXiv.2308.10620.
- Neural joint attention code search over structure embeddings for software q&a sites. J. Syst. Softw., 170:110773, 2020. doi: 10.1016/J.JSS.2020.110773. URL https://doi.org/10.1016/j.jss.2020.110773.
- Deep code comment generation. In Foutse Khomh, Chanchal K. Roy, and Janet Siegmund (eds.), Proceedings of the 26th Conference on Program Comprehension, ICPC 2018, Gothenburg, Sweden, May 27-28, 2018, pp. 200–210. ACM, 2018a. doi: 10.1145/3196321.3196334. URL https://doi.org/10.1145/3196321.3196334.
- Summarizing source code with transferred API knowledge. In Jérôme Lang (ed.), Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI 2018, July 13-19, 2018, Stockholm, Sweden, pp. 2269–2275. ijcai.org, 2018b. doi: 10.24963/ijcai.2018/314. URL https://doi.org/10.24963/ijcai.2018/314.
- Re-factoring based program repair applied to programming assignments. In 34th IEEE/ACM International Conference on Automated Software Engineering, ASE 2019, San Diego, CA, USA, November 11-15, 2019, pp. 388–398. IEEE, 2019. doi: 10.1109/ASE.2019.00044. URL https://doi.org/10.1109/ASE.2019.00044.
- Neural program synthesis with query. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022a. URL https://openreview.net/forum?id=NyJ2KIN8P17.
- Cosqa: 20, 000+ web queries for code search and question answering. In Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (eds.), Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL/IJCNLP 2021, (Volume 1: Long Papers), Virtual Event, August 1-6, 2021, pp. 5690–5700. Association for Computational Linguistics, 2021. doi: 10.18653/v1/2021.acl-long.442. URL https://doi.org/10.18653/v1/2021.acl-long.442.
- Execution-based evaluation for data science code generation models. CoRR, abs/2211.09374, 2022b. doi: 10.48550/ARXIV.2211.09374. URL https://doi.org/10.48550/arXiv.2211.09374.
- A survey on automated program repair techniques. CoRR, abs/2303.18184, 2023a. doi: 10.48550/ARXIV.2303.18184. URL https://doi.org/10.48550/arXiv.2303.18184.
- API method recommendation without worrying about the task-api knowledge gap. In Marianne Huchard, Christian Kästner, and Gordon Fraser (eds.), Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering, ASE 2018, Montpellier, France, September 3-7, 2018, pp. 293–304. ACM, 2018. doi: 10.1145/3238147.3238191. URL https://doi.org/10.1145/3238147.3238191.
- Learning human-written commit messages to document code changes. J. Comput. Sci. Technol., 35(6):1258–1277, 2020. doi: 10.1007/S11390-020-0496-0. URL https://doi.org/10.1007/s11390-020-0496-0.
- C-eval: A multi-level multi-discipline chinese evaluation suite for foundation models. CoRR, abs/2305.08322, 2023b. doi: 10.48550/arXiv.2305.08322. URL https://doi.org/10.48550/arXiv.2305.08322.
- Review4repair: Code review aided automatic program repairing. Inf. Softw. Technol., 143:106765, 2022. doi: 10.1016/J.INFSOF.2021.106765. URL https://doi.org/10.1016/j.infsof.2021.106765.
- Codesearchnet challenge: Evaluating the state of semantic code search. CoRR, abs/1909.09436, 2019. URL http://arxiv.org/abs/1909.09436.
- A comprehensive exploration on wikisql with table-aware word contextualization. CoRR, abs/1902.01069, 2019. URL http://arxiv.org/abs/1902.01069.
- Summarizing source code using a neural attention model. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7-12, 2016, Berlin, Germany, Volume 1: Long Papers. The Association for Computer Linguistics, 2016. doi: 10.18653/v1/p16-1195. URL https://doi.org/10.18653/v1/p16-1195.
- Learning a neural semantic parser from user feedback. In Regina Barzilay and Min-Yen Kan (eds.), Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 1: Long Papers, pp. 963–973. Association for Computational Linguistics, 2017. doi: 10.18653/V1/P17-1089. URL https://doi.org/10.18653/v1/P17-1089.
- Mapping language to code in programmatic context. In Ellen Riloff, David Chiang, Julia Hockenmaier, and Jun’ichi Tsujii (eds.), Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018, pp. 1643–1652. Association for Computational Linguistics, 2018. doi: 10.18653/v1/d18-1192. URL https://doi.org/10.18653/v1/d18-1192.
- OPT-IML: scaling language model instruction meta learning through the lens of generalization. CoRR, abs/2212.12017, 2022. doi: 10.48550/arXiv.2212.12017. URL https://doi.org/10.48550/arXiv.2212.12017.
- Jigsaw: Large language models meet program synthesis. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 1219–1231. ACM, 2022. doi: 10.1145/3510003.3510203. URL https://doi.org/10.1145/3510003.3510203.
- Attention, compilation, and solver-based symbolic analysis are all you need. CoRR, abs/2306.06755, 2023. doi: 10.48550/ARXIV.2306.06755. URL https://doi.org/10.48550/arXiv.2306.06755.
- Predicting variable types in dynamically typed programming languages. CoRR, abs/1901.05138, 2019. URL http://arxiv.org/abs/1901.05138.
- Manytypes4typescript: A comprehensive typescript dataset for sequence-based type inference. In 19th IEEE/ACM International Conference on Mining Software Repositories, MSR 2022, Pittsburgh, PA, USA, May 23-24, 2022, pp. 294–298. ACM, 2022. doi: 10.1145/3524842.3528507. URL https://doi.org/10.1145/3524842.3528507.
- Learning type annotation: is big data enough? In Diomidis Spinellis, Georgios Gousios, Marsha Chechik, and Massimiliano Di Penta (eds.), ESEC/FSE ’21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Athens, Greece, August 23-28, 2021, pp. 1483–1486. ACM, 2021. doi: 10.1145/3468264.3473135. URL https://doi.org/10.1145/3468264.3473135.
- Machine learning based recommendation of method names: How far are we. In 34th IEEE/ACM International Conference on Automated Software Engineering, ASE 2019, San Diego, CA, USA, November 11-15, 2019, pp. 602–614. IEEE, 2019. doi: 10.1109/ASE.2019.00062. URL https://doi.org/10.1109/ASE.2019.00062.
- DECKARD: scalable and accurate tree-based detection of code clones. In 29th International Conference on Software Engineering (ICSE 2007), Minneapolis, MN, USA, May 20-26, 2007, pp. 96–105. IEEE Computer Society, 2007. doi: 10.1109/ICSE.2007.30. URL https://doi.org/10.1109/ICSE.2007.30.
- CURE: code-aware neural machine translation for automatic program repair. In 43rd IEEE/ACM International Conference on Software Engineering, ICSE 2021, Madrid, Spain, 22-30 May 2021, pp. 1161–1173. IEEE, 2021a. doi: 10.1109/ICSE43902.2021.00107. URL https://doi.org/10.1109/ICSE43902.2021.00107.
- Impact of code language models on automated program repair. In 45th IEEE/ACM International Conference on Software Engineering, ICSE 2023, Melbourne, Australia, May 14-20, 2023, pp. 1430–1442. IEEE, 2023a. doi: 10.1109/ICSE48619.2023.00125. URL https://doi.org/10.1109/ICSE48619.2023.00125.
- Towards automatic generation of short summaries of commits. In Giuseppe Scanniello, David Lo, and Alexander Serebrenik (eds.), Proceedings of the 25th International Conference on Program Comprehension, ICPC 2017, Buenos Aires, Argentina, May 22-23, 2017, pp. 320–323. IEEE Computer Society, 2017. doi: 10.1109/ICPC.2017.12. URL https://doi.org/10.1109/ICPC.2017.12.
- Automatically generating commit messages from diffs using neural machine translation. In Grigore Rosu, Massimiliano Di Penta, and Tien N. Nguyen (eds.), Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering, ASE 2017, Urbana, IL, USA, October 30 - November 03, 2017, pp. 135–146. IEEE Computer Society, 2017. doi: 10.1109/ASE.2017.8115626. URL https://doi.org/10.1109/ASE.2017.8115626.
- Treebert: A tree-based pre-trained model for programming language. In Cassio P. de Campos, Marloes H. Maathuis, and Erik Quaeghebeur (eds.), Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, UAI 2021, Virtual Event, 27-30 July 2021, volume 161 of Proceedings of Machine Learning Research, pp. 54–63. AUAI Press, 2021b. URL https://proceedings.mlr.press/v161/jiang21a.html.
- Llmparser: A llm-based log parsing framework. CoRR, abs/2310.01796, 2023b. doi: 10.48550/ARXIV.2310.01796. URL https://doi.org/10.48550/arXiv.2310.01796.
- A large-scale benchmark for log parsing. CoRR, abs/2308.10828, 2023c. doi: 10.48550/ARXIV.2308.10828. URL https://doi.org/10.48550/arXiv.2308.10828.
- On the evaluation of neural code translation: Taxonomy and benchmark. CoRR, abs/2308.08961, 2023. doi: 10.48550/ARXIV.2308.08961. URL https://doi.org/10.48550/arXiv.2308.08961.
- Swe-bench: Can language models resolve real-world github issues? CoRR, abs/2310.06770, 2023. doi: 10.48550/ARXIV.2310.06770. URL https://doi.org/10.48550/arXiv.2310.06770.
- Repair is nearly generation: Multilingual program repair with llms. In Brian Williams, Yiling Chen, and Jennifer Neville (eds.), Thirty-Seventh AAAI Conference on Artificial Intelligence, AAAI 2023, Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence, IAAI 2023, Thirteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2023, Washington, DC, USA, February 7-14, 2023, pp. 5131–5140. AAAI Press, 2023. doi: 10.1609/AAAI.V37I4.25642. URL https://doi.org/10.1609/aaai.v37i4.25642.
- Tae-Hwan Jung. Commitbert: Commit message generation using pre-trained programming language model. CoRR, abs/2105.14242, 2021. URL https://arxiv.org/abs/2105.14242.
- René Just. The major mutation framework: efficient and scalable mutation analysis for java. In Corina S. Pasareanu and Darko Marinov (eds.), International Symposium on Software Testing and Analysis, ISSTA ’14, San Jose, CA, USA - July 21 - 26, 2014, pp. 433–436. ACM, 2014. doi: 10.1145/2610384.2628053. URL https://doi.org/10.1145/2610384.2628053.
- Defects4j: a database of existing faults to enable controlled testing studies for java programs. In Corina S. Pasareanu and Darko Marinov (eds.), International Symposium on Software Testing and Analysis, ISSTA ’14, San Jose, CA, USA - July 21 - 26, 2014, pp. 437–440. ACM, 2014. doi: 10.1145/2610384.2628055. URL https://doi.org/10.1145/2610384.2628055.
- Code authorship attribution: Methods and challenges. ACM Comput. Surv., 52(1):3:1–3:36, 2019. doi: 10.1145/3292577. URL https://doi.org/10.1145/3292577.
- Neural-guided deductive search for real-time program synthesis from examples. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. OpenReview.net, 2018. URL https://openreview.net/forum?id=rywDjg-RW.
- Learning and evaluating contextual embedding of source code. In Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event, volume 119 of Proceedings of Machine Learning Research, pp. 5110–5121. PMLR, 2020. URL http://proceedings.mlr.press/v119/kanade20a.html.
- Scaling laws for neural language models. CoRR, abs/2001.08361, 2020. URL https://arxiv.org/abs/2001.08361.
- Phrase-based statistical translation of programming languages. In Andrew P. Black, Shriram Krishnamurthi, Bernd Bruegge, and Joseph N. Ruskiewicz (eds.), Onward! 2014, Proceedings of the 2014 ACM International Symposium on New Ideas, New Paradigms, and Reflections on Programming & Software, part of SPLASH ’14, Portland, OR, USA, October 20-24, 2014, pp. 173–184. ACM, 2014. doi: 10.1145/2661136.2661148. URL https://doi.org/10.1145/2661136.2661148.
- How often do single-statement bugs occur?: The manysstubs4j dataset. In Sunghun Kim, Georgios Gousios, Sarah Nadi, and Joseph Hejderup (eds.), MSR ’20: 17th International Conference on Mining Software Repositories, Seoul, Republic of Korea, 29-30 June, 2020, pp. 573–577. ACM, 2020. doi: 10.1145/3379597.3387491. URL https://doi.org/10.1145/3379597.3387491.
- Big code != big vocabulary: open-vocabulary models for source code. In Gregg Rothermel and Doo-Hwan Bae (eds.), ICSE ’20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June - 19 July, 2020, pp. 1073–1085. ACM, 2020. doi: 10.1145/3377811.3380342. URL https://doi.org/10.1145/3377811.3380342.
- Automated customized bug-benchmark generation. In 19th International Working Conference on Source Code Analysis and Manipulation, SCAM 2019, Cleveland, OH, USA, September 30 - October 1, 2019, pp. 103–114. IEEE, 2019. doi: 10.1109/SCAM.2019.00020. URL https://doi.org/10.1109/SCAM.2019.00020.
- A survey on deep learning approaches for text-to-sql. VLDB J., 32(4):905–936, 2023. doi: 10.1007/S00778-022-00776-8. URL https://doi.org/10.1007/s00778-022-00776-8.
- Bertrand-dr: Improving text-to-sql using a discriminative re-ranker. CoRR, abs/2002.00557, 2020. URL https://arxiv.org/abs/2002.00557.
- Evaluating few shot and contrastive learning methods for code clone detection. CoRR, abs/2204.07501, 2022. doi: 10.48550/ARXIV.2204.07501. URL https://doi.org/10.48550/arXiv.2204.07501.
- xcodeeval: A large scale multilingual multitask benchmark for code understanding, generation, translation and retrieval. CoRR, abs/2303.03004, 2023. doi: 10.48550/ARXIV.2303.03004. URL https://doi.org/10.48550/arXiv.2303.03004.
- Efficient mutation testing via pre-trained language models. CoRR, abs/2301.03543, 2023a. doi: 10.48550/arXiv.2301.03543. URL https://doi.org/10.48550/arXiv.2301.03543.
- ibir: Bug-report-driven fault injection. ACM Trans. Softw. Eng. Methodol., 32(2):33:1–33:31, 2023b. doi: 10.1145/3542946. URL https://doi.org/10.1145/3542946.
- Facoy: a code-to-code search engine. In Michel Chaudron, Ivica Crnkovic, Marsha Chechik, and Mark Harman (eds.), Proceedings of the 40th International Conference on Software Engineering, ICSE 2018, Gothenburg, Sweden, May 27 - June 03, 2018, pp. 946–957. ACM, 2018. doi: 10.1145/3180155.3180187. URL https://doi.org/10.1145/3180155.3180187.
- Reformer: The efficient transformer. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. OpenReview.net, 2020. URL https://openreview.net/forum?id=rkgNKkHtvB.
- The stack: 3 TB of permissively licensed source code. CoRR, abs/2211.15533, 2022. doi: 10.48550/arXiv.2211.15533. URL https://doi.org/10.48550/arXiv.2211.15533.
- Code comment generation based on graph neural network enhanced transformer model for code understanding in open-source software ecosystems. Autom. Softw. Eng., 29(2):43, 2022. doi: 10.1007/S10515-022-00341-1. URL https://doi.org/10.1007/s10515-022-00341-1.
- Spoc: Search-based pseudocode to code. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d’Alché-Buc, Emily B. Fox, and Roman Garnett (eds.), Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, pp. 11883–11894, 2019. URL https://proceedings.neurips.cc/paper/2019/hash/7298332f04ac004a0ca44cc69ecf6f6b-Abstract.html.
- Deep learning driven natural languages text to SQL query conversion: A survey. CoRR, abs/2208.04415, 2022. doi: 10.48550/ARXIV.2208.04415. URL https://doi.org/10.48550/arXiv.2208.04415.
- DOBF: A deobfuscation pre-training objective for programming languages. In Marc’Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, and Jennifer Wortman Vaughan (eds.), Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual, pp. 14967–14979, 2021. URL https://proceedings.neurips.cc/paper/2021/hash/7d6548bdc0082aacc950ed35e91fcccb-Abstract.html.
- DIRE: A neural approach to decompiled identifier naming. In 34th IEEE/ACM International Conference on Automated Software Engineering, ASE 2019, San Diego, CA, USA, November 11-15, 2019, pp. 628–639. IEEE, 2019. doi: 10.1109/ASE.2019.00064. URL https://doi.org/10.1109/ASE.2019.00064.
- Interactive code generation via test-driven user-intent formalization. CoRR, abs/2208.05950, 2022. doi: 10.48550/arXiv.2208.05950. URL https://doi.org/10.48550/arXiv.2208.05950.
- DS-1000: A natural and reliable benchmark for data science code generation. In Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett (eds.), International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA, volume 202 of Proceedings of Machine Learning Research, pp. 18319–18345. PMLR, 2023. URL https://proceedings.mlr.press/v202/lai23b.html.
- ALBERT: A lite BERT for self-supervised learning of language representations. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. OpenReview.net, 2020. URL https://openreview.net/forum?id=H1eA7AEtvS.
- Deep learning for anomaly detection in log data: A survey. CoRR, abs/2207.03820, 2022. doi: 10.48550/ARXIV.2207.03820. URL https://doi.org/10.48550/arXiv.2207.03820.
- LLVM: A compilation framework for lifelong program analysis & transformation. In 2nd IEEE / ACM International Symposium on Code Generation and Optimization (CGO 2004), 20-24 March 2004, San Jose, CA, USA, pp. 75–88. IEEE Computer Society, 2004. doi: 10.1109/CGO.2004.1281665. URL https://doi.org/10.1109/CGO.2004.1281665.
- The bigscience ROOTS corpus: A 1.6tb composite multilingual dataset. In NeurIPS, 2022. URL http://papers.nips.cc/paper_files/paper/2022/hash/ce9e92e3de2372a4b93353eb7f3dc0bd-Abstract-Datasets_and_Benchmarks.html.
- Coderl: Mastering code generation through pretrained models and deep reinforcement learning. In NeurIPS, 2022. URL http://papers.nips.cc/paper_files/paper/2022/hash/8636419dea1aa9fbd25fc4248e702da4-Abstract-Conference.html.
- Log-based anomaly detection without log parsing. In 36th IEEE/ACM International Conference on Automated Software Engineering, ASE 2021, Melbourne, Australia, November 15-19, 2021, pp. 492–504. IEEE, 2021. doi: 10.1109/ASE51524.2021.9678773. URL https://doi.org/10.1109/ASE51524.2021.9678773.
- Log parsing with prompt-based few-shot learning. In 45th IEEE/ACM International Conference on Software Engineering, ICSE 2023, Melbourne, Australia, May 14-20, 2023, pp. 2438–2449. IEEE, 2023a. doi: 10.1109/ICSE48619.2023.00204. URL https://doi.org/10.1109/ICSE48619.2023.00204.
- Log parsing: How far can chatgpt go? In 38th IEEE/ACM International Conference on Automated Software Engineering, ASE 2023, Luxembourg, September 11-15, 2023, pp. 1699–1704. IEEE, 2023b. doi: 10.1109/ASE56229.2023.00206. URL https://doi.org/10.1109/ASE56229.2023.00206.
- A neural model for generating natural language summaries of program subroutines. In Joanne M. Atlee, Tevfik Bultan, and Jon Whittle (eds.), Proceedings of the 41st International Conference on Software Engineering, ICSE 2019, Montreal, QC, Canada, May 25-31, 2019, pp. 795–806. IEEE / ACM, 2019. doi: 10.1109/ICSE.2019.00087. URL https://doi.org/10.1109/ICSE.2019.00087.
- CS1QA: A dataset for assisting code-based question answering in an introductory programming course. In Marine Carpuat, Marie-Catherine de Marneffe, and Iván Vladimir Meza Ruíz (eds.), Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022, Seattle, WA, United States, July 10-15, 2022, pp. 2026–2040. Association for Computational Linguistics, 2022. doi: 10.18653/V1/2022.NAACL-MAIN.148. URL https://doi.org/10.18653/v1/2022.naacl-main.148.
- Kaggledbqa: Realistic evaluation of text-to-sql parsers. In Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (eds.), Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL/IJCNLP 2021, (Volume 1: Long Papers), Virtual Event, August 1-6, 2021, pp. 2261–2273. Association for Computational Linguistics, 2021. doi: 10.18653/V1/2021.ACL-LONG.176. URL https://doi.org/10.18653/v1/2021.acl-long.176.
- Accelerating search-based program synthesis using learned probabilistic models. In Jeffrey S. Foster and Dan Grossman (eds.), Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2018, Philadelphia, PA, USA, June 18-22, 2018, pp. 436–449. ACM, 2018. doi: 10.1145/3192366.3192410. URL https://doi.org/10.1145/3192366.3192410.
- Re-examining the role of schema linking in text-to-sql. In Bonnie Webber, Trevor Cohn, Yulan He, and Yang Liu (eds.), Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP 2020, Online, November 16-20, 2020, pp. 6943–6954. Association for Computational Linguistics, 2020. doi: 10.18653/V1/2020.EMNLP-MAIN.564. URL https://doi.org/10.18653/v1/2020.emnlp-main.564.
- Fairfuzz: a targeted mutation strategy for increasing greybox fuzz testing coverage. In Marianne Huchard, Christian Kästner, and Gordon Fraser (eds.), Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering, ASE 2018, Montpellier, France, September 3-7, 2018, pp. 475–485. ACM, 2018. doi: 10.1145/3238147.3238176. URL https://doi.org/10.1145/3238147.3238176.
- Codamosa: Escaping coverage plateaus in test generation with pre-trained large language models. In 45th IEEE/ACM International Conference on Software Engineering, ICSE 2023, Melbourne, Australia, May 14-20, 2023, pp. 919–931. IEEE, 2023. doi: 10.1109/ICSE48619.2023.00085. URL https://doi.org/10.1109/ICSE48619.2023.00085.
- Unsupervised pivot translation for distant languages. In Anna Korhonen, David R. Traum, and Lluís Màrquez (eds.), Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, Florence, Italy, July 28- August 2, 2019, Volume 1: Long Papers, pp. 175–183. Association for Computational Linguistics, 2019. doi: 10.18653/v1/p19-1017. URL https://doi.org/10.18653/v1/p19-1017.
- The power of scale for parameter-efficient prompt tuning. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih (eds.), Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021, pp. 3045–3059. Association for Computational Linguistics, 2021. doi: 10.18653/v1/2021.emnlp-main.243. URL https://doi.org/10.18653/v1/2021.emnlp-main.243.
- BART: denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In Dan Jurafsky, Joyce Chai, Natalie Schluter, and Joel R. Tetreault (eds.), Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, July 5-10, 2020, pp. 7871–7880. Association for Computational Linguistics, 2020. doi: 10.18653/v1/2020.acl-main.703. URL https://doi.org/10.18653/v1/2020.acl-main.703.
- Solving quantitative reasoning problems with language models. In NeurIPS, 2022. URL http://papers.nips.cc/paper_files/paper/2022/hash/18abbeef8cfe9203fdf9053c9c4fe191-Abstract-Conference.html.
- Constructing an interactive natural language interface for relational databases. Proc. VLDB Endow., 8(1):73–84, 2014. doi: 10.14778/2735461.2735468. URL http://www.vldb.org/pvldb/vol8/p73-li.pdf.
- SAGA: efficient and large-scale detection of near-miss clones with GPU acceleration. In Kostas Kontogiannis, Foutse Khomh, Alexander Chatzigeorgiou, Marios-Eleftherios Fokaefs, and Minghui Zhou (eds.), 27th IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2020, London, ON, Canada, February 18-21, 2020, pp. 272–283. IEEE, 2020a. doi: 10.1109/SANER48275.2020.9054832. URL https://doi.org/10.1109/SANER48275.2020.9054832.
- Exploring representation-level augmentation for code search. In Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (eds.), Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11, 2022, pp. 4924–4936. Association for Computational Linguistics, 2022a. doi: 10.18653/v1/2022.emnlp-main.327. URL https://doi.org/10.18653/v1/2022.emnlp-main.327.
- CMMLU: measuring massive multitask language understanding in chinese. CoRR, abs/2306.09212, 2023a. doi: 10.48550/ARXIV.2306.09212. URL https://doi.org/10.48550/arXiv.2306.09212.
- RESDSQL: decoupling schema linking and skeleton parsing for text-to-sql. In Brian Williams, Yiling Chen, and Jennifer Neville (eds.), Thirty-Seventh AAAI Conference on Artificial Intelligence, AAAI 2023, Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence, IAAI 2023, Thirteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2023, Washington, DC, USA, February 7-14, 2023, pp. 13067–13075. AAAI Press, 2023b. doi: 10.1609/AAAI.V37I11.26535. URL https://doi.org/10.1609/aaai.v37i11.26535.
- Deepreview: Automatic code review using deep multi-instance learning. In Qiang Yang, Zhi-Hua Zhou, Zhiguo Gong, Min-Ling Zhang, and Sheng-Jun Huang (eds.), Advances in Knowledge Discovery and Data Mining - 23rd Pacific-Asia Conference, PAKDD 2019, Macau, China, April 14-17, 2019, Proceedings, Part II, volume 11440 of Lecture Notes in Computer Science, pp. 318–330. Springer, 2019a. doi: 10.1007/978-3-030-16145-3_25. URL https://doi.org/10.1007/978-3-030-16145-3_25.
- Neural code search evaluation dataset. CoRR, abs/1908.09804, 2019b. URL http://arxiv.org/abs/1908.09804.
- ZC3: zero-shot cross-language code clone detection. CoRR, abs/2308.13754, 2023c. doi: 10.48550/ARXIV.2308.13754. URL https://doi.org/10.48550/arXiv.2308.13754.
- Acecoder: Utilizing existing code to enhance code generation, 2023d.
- Code completion with neural attention and pointer networks. In Jérôme Lang (ed.), Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI 2018, July 13-19, 2018, Stockholm, Sweden, pp. 4159–4165. ijcai.org, 2018a. doi: 10.24963/ijcai.2018/578. URL https://doi.org/10.24963/ijcai.2018/578.
- Graphix-t5: Mixing pre-trained transformers with graph-aware layers for text-to-sql parsing. In Brian Williams, Yiling Chen, and Jennifer Neville (eds.), Thirty-Seventh AAAI Conference on Artificial Intelligence, AAAI 2023, Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence, IAAI 2023, Thirteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2023, Washington, DC, USA, February 7-14, 2023, pp. 13076–13084. AAAI Press, 2023e. doi: 10.1609/AAAI.V37I11.26536. URL https://doi.org/10.1609/aaai.v37i11.26536.
- Can LLM already serve as A database interface? A big bench for large-scale database grounded text-to-sqls. CoRR, abs/2305.03111, 2023f. doi: 10.48550/ARXIV.2305.03111. URL https://doi.org/10.48550/arXiv.2305.03111.
- AUGER: automatically generating review comments with pre-training models. In Abhik Roychoudhury, Cristian Cadar, and Miryung Kim (eds.), Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, Singapore, Singapore, November 14-18, 2022, pp. 1009–1021. ACM, 2022b. doi: 10.1145/3540250.3549099. URL https://doi.org/10.1145/3540250.3549099.
- Starcoder: may the source be with you! CoRR, abs/2305.06161, 2023g. doi: 10.48550/arXiv.2305.06161. URL https://doi.org/10.48550/arXiv.2305.06161.
- Hierarchical embedding for code search in software q&a sites. In 2020 International Joint Conference on Neural Networks, IJCNN 2020, Glasgow, United Kingdom, July 19-24, 2020, pp. 1–10. IEEE, 2020b. doi: 10.1109/IJCNN48605.2020.9207101. URL https://doi.org/10.1109/IJCNN48605.2020.9207101.
- Diffusion-lm improves controllable text generation. In NeurIPS, 2022c. URL http://papers.nips.cc/paper_files/paper/2022/hash/1be5bc25d50895ee656b8c2d9eb89d6a-Abstract-Conference.html.
- Coderetriever: A large scale contrastive pre-training method for code search. In Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (eds.), Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11, 2022, pp. 2898–2910. Association for Computational Linguistics, 2022d. doi: 10.18653/v1/2022.emnlp-main.187. URL https://doi.org/10.18653/v1/2022.emnlp-main.187.
- Soft-labeled contrastive pre-training for function-level code representation. In Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (eds.), Findings of the Association for Computational Linguistics: EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11, 2022, pp. 118–129. Association for Computational Linguistics, 2022e. doi: 10.18653/v1/2022.findings-emnlp.9. URL https://doi.org/10.18653/v1/2022.findings-emnlp.9.
- Relationship-aware code search for javascript frameworks. In Thomas Zimmermann, Jane Cleland-Huang, and Zhendong Su (eds.), Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, FSE 2016, Seattle, WA, USA, November 13-18, 2016, pp. 690–701. ACM, 2016. doi: 10.1145/2950290.2950341. URL https://doi.org/10.1145/2950290.2950341.
- Improving bug detection via context-based code representation learning and attention-based neural networks. Proc. ACM Program. Lang., 3(OOPSLA):162:1–162:30, 2019c. doi: 10.1145/3360588. URL https://doi.org/10.1145/3360588.
- Dlfix: context-based code transformation learning for automated program repair. In Gregg Rothermel and Doo-Hwan Bae (eds.), ICSE ’20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June - 19 July, 2020, pp. 602–614. ACM, 2020c. doi: 10.1145/3377811.3380345. URL https://doi.org/10.1145/3377811.3380345.
- Vulnerability detection with fine-grained interpretations. In Diomidis Spinellis, Georgios Gousios, Marsha Chechik, and Massimiliano Di Penta (eds.), ESEC/FSE ’21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Athens, Greece, August 23-28, 2021, pp. 292–303. ACM, 2021a. doi: 10.1145/3468264.3468597. URL https://doi.org/10.1145/3468264.3468597.
- A context-based automated approach for method name consistency checking and suggestion. In 43rd IEEE/ACM International Conference on Software Engineering, ICSE 2021, Madrid, Spain, 22-30 May 2021, pp. 574–586. IEEE, 2021b. doi: 10.1109/ICSE43902.2021.00060. URL https://doi.org/10.1109/ICSE43902.2021.00060.
- DEAR: A novel deep learning-based approach for automated program repair. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 511–523. ACM, 2022f. doi: 10.1145/3510003.3510177. URL https://doi.org/10.1145/3510003.3510177.
- Textbooks are all you need II: phi-1.5 technical report. CoRR, abs/2309.05463, 2023h. doi: 10.48550/arXiv.2309.05463. URL https://doi.org/10.48550/arXiv.2309.05463.
- Competition-level code generation with alphacode. Science, 378(6624):1092–1097, 2022g. doi: 10.1126/science.abq1158. URL https://www.science.org/doi/abs/10.1126/science.abq1158.
- Sysevr: A framework for using deep learning to detect software vulnerabilities. https://doi.org/10.21227/fhg0-1b35, November 2018b. URL https://doi.org/10.21227/fhg0-1b35. Accessed on YYYY-MM-DD.
- Vuldeepecker: A deep learning-based system for vulnerability detection. In 25th Annual Network and Distributed System Security Symposium, NDSS 2018, San Diego, California, USA, February 18-21, 2018. The Internet Society, 2018c. URL https://www.ndss-symposium.org/wp-content/uploads/2018/02/ndss2018_03A-2_Li_paper.pdf.
- Vuldeelocator: A deep learning-based fine-grained vulnerability detector. IEEE Trans. Dependable Secur. Comput., 19(4):2821–2837, 2022h. doi: 10.1109/TDSC.2021.3076142. URL https://doi.org/10.1109/TDSC.2021.3076142.
- Automating code review activities by large-scale pre-training. In Abhik Roychoudhury, Cristian Cadar, and Miryung Kim (eds.), Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, Singapore, Singapore, November 14-18, 2022, pp. 1035–1047. ACM, 2022i. doi: 10.1145/3540250.3549081. URL https://doi.org/10.1145/3540250.3549081.
- Quixbugs: a multi-lingual program repair benchmark set based on the quixey challenge. In Gail C. Murphy (ed.), Proceedings Companion of the 2017 ACM SIGPLAN International Conference on Systems, Programming, Languages, and Applications: Software for Humanity, SPLASH 2017, Vancouver, BC, Canada, October 23 - 27, 2017, pp. 55–56. ACM, 2017. doi: 10.1145/3135932.3135941. URL https://doi.org/10.1145/3135932.3135941.
- Cross-project transfer representation learning for vulnerable function discovery. IEEE Trans. Ind. Informatics, 14(7):3289–3297, 2018a. doi: 10.1109/TII.2018.2821768. URL https://doi.org/10.1109/TII.2018.2821768.
- Deep learning-based vulnerable function detection: A benchmark. In Jianying Zhou, Xiapu Luo, Qingni Shen, and Zhen Xu (eds.), Information and Communications Security - 21st International Conference, ICICS 2019, Beijing, China, December 15-17, 2019, Revised Selected Papers, volume 11999 of Lecture Notes in Computer Science, pp. 219–232. Springer, 2019. doi: 10.1007/978-3-030-41579-2_13. URL https://doi.org/10.1007/978-3-030-41579-2_13.
- Software vulnerability discovery via learning multi-domain knowledge bases. IEEE Trans. Dependable Secur. Comput., 18(5):2469–2485, 2021. doi: 10.1109/TDSC.2019.2954088. URL https://doi.org/10.1109/TDSC.2019.2954088.
- Nl2bash: A corpus and semantic parser for natural language interface to the linux operating system. In Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Kôiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, Stelios Piperidis, and Takenobu Tokunaga (eds.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018. European Language Resources Association (ELRA), 2018b. URL http://www.lrec-conf.org/proceedings/lrec2018/summaries/1021.html.
- Text generation with diffusion language models: A pre-training approach with continuous paragraph denoise. In Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett (eds.), International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA, volume 202 of Proceedings of Machine Learning Research, pp. 21051–21064. PMLR, 2023. URL https://proceedings.mlr.press/v202/lin23d.html.
- Latent predictor networks for code generation. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7-12, 2016, Berlin, Germany, Volume 1: Long Papers. The Association for Computer Linguistics, 2016. doi: 10.18653/V1/P16-1057. URL https://doi.org/10.18653/v1/p16-1057.
- Deep graph matching and searching for semantic code retrieval. ACM Trans. Knowl. Discov. Data, 15(5):88:1–88:21, 2021. doi: 10.1145/3447571. URL https://doi.org/10.1145/3447571.
- A comprehensive evaluation of chatgpt’s zero-shot text-to-sql capability. CoRR, abs/2303.13547, 2023a. doi: 10.48550/ARXIV.2303.13547. URL https://doi.org/10.48550/arXiv.2303.13547.
- Mftcoder: Boosting code llms with multitask fine-tuning. CoRR, abs/2311.02303, 2023b. doi: 10.48550/ARXIV.2311.02303. URL https://doi.org/10.48550/arXiv.2311.02303.
- Codematcher: Searching code based on sequential semantics of important query words. ACM Trans. Softw. Eng. Methodol., 31(1):12:1–12:37, 2022a. doi: 10.1145/3465403. URL https://doi.org/10.1145/3465403.
- Codeqa: A question answering dataset for source code comprehension. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih (eds.), Findings of the Association for Computational Linguistics: EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 16-20 November, 2021, pp. 2618–2632. Association for Computational Linguistics, 2021. doi: 10.18653/V1/2021.FINDINGS-EMNLP.223. URL https://doi.org/10.18653/v1/2021.findings-emnlp.223.
- Multi-task learning based pre-trained language model for code completion. In 35th IEEE/ACM International Conference on Automated Software Engineering, ASE 2020, Melbourne, Australia, September 21-25, 2020, pp. 473–485. IEEE, 2020. doi: 10.1145/3324884.3416591. URL https://doi.org/10.1145/3324884.3416591.
- Learning to recommend method names with global context. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 1294–1306. ACM, 2022b. doi: 10.1145/3510003.3510154. URL https://doi.org/10.1145/3510003.3510154.
- Syntax and domain aware model for unsupervised program translation. In 45th IEEE/ACM International Conference on Software Engineering, ICSE 2023, Melbourne, Australia, May 14-20, 2023, pp. 755–767. IEEE, 2023c. doi: 10.1109/ICSE48619.2023.00072. URL https://doi.org/10.1109/ICSE48619.2023.00072.
- RLTF: reinforcement learning from unit test feedback. CoRR, abs/2307.04349, 2023d. doi: 10.48550/ARXIV.2307.04349. URL https://doi.org/10.48550/arXiv.2307.04349.
- Nnsmith: Generating diverse and valid test cases for deep learning compilers. In Tor M. Aamodt, Natalie D. Enright Jerger, and Michael M. Swift (eds.), Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2, ASPLOS 2023, Vancouver, BC, Canada, March 25-29, 2023, pp. 530–543. ACM, 2023e. doi: 10.1145/3575693.3575707. URL https://doi.org/10.1145/3575693.3575707.
- Is your code generated by chatgpt really correct? rigorous evaluation of large language models for code generation. CoRR, abs/2305.01210, 2023f. doi: 10.48550/arXiv.2305.01210. URL https://doi.org/10.48550/arXiv.2305.01210.
- Learning to spot and refactor inconsistent method names. In Joanne M. Atlee, Tevfik Bultan, and Jon Whittle (eds.), Proceedings of the 41st International Conference on Software Engineering, ICSE 2019, Montreal, QC, Canada, May 25-31, 2019, pp. 1–12. IEEE / ACM, 2019a. doi: 10.1109/ICSE.2019.00019. URL https://doi.org/10.1109/ICSE.2019.00019.
- Tbar: revisiting template-based automated program repair. In Dongmei Zhang and Anders Møller (eds.), Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis, ISSTA 2019, Beijing, China, July 15-19, 2019, pp. 31–42. ACM, 2019b. doi: 10.1145/3293882.3330577. URL https://doi.org/10.1145/3293882.3330577.
- Verilogeval: Evaluating large language models for verilog code generation. CoRR, abs/2309.07544, 2023g. doi: 10.48550/ARXIV.2309.07544. URL https://doi.org/10.48550/arXiv.2309.07544.
- Generating commit messages from diffs using pointer-generator network. In Margaret-Anne D. Storey, Bram Adams, and Sonia Haiduc (eds.), Proceedings of the 16th International Conference on Mining Software Repositories, MSR 2019, 26-27 May 2019, Montreal, Canada, pp. 299–309. IEEE / ACM, 2019c. doi: 10.1109/MSR.2019.00056. URL https://doi.org/10.1109/MSR.2019.00056.
- ATOM: commit message generation based on abstract syntax tree and hybrid ranking. IEEE Trans. Software Eng., 48(5):1800–1817, 2022c. doi: 10.1109/TSE.2020.3038681. URL https://doi.org/10.1109/TSE.2020.3038681.
- Graphsearchnet: Enhancing gnns via capturing global dependencies for semantic code search. IEEE Trans. Software Eng., 49(4):2839–2855, 2023h. doi: 10.1109/TSE.2022.3233901. URL https://doi.org/10.1109/TSE.2022.3233901.
- Repobench: Benchmarking repository-level code auto-completion systems. CoRR, abs/2306.03091, 2023i. doi: 10.48550/ARXIV.2306.03091. URL https://doi.org/10.48550/arXiv.2306.03091.
- Adaptivepaste: Code adaptation through learning semantics-aware variable usage representations. CoRR, abs/2205.11023, 2022d. doi: 10.48550/arXiv.2205.11023. URL https://doi.org/10.48550/arXiv.2205.11023.
- Argugpt: evaluating, understanding and identifying argumentative essays generated by GPT models. CoRR, abs/2304.07666, 2023j. doi: 10.48550/ARXIV.2304.07666. URL https://doi.org/10.48550/arXiv.2304.07666.
- Logprompt: Prompt engineering towards zero-shot and interpretable log analysis. CoRR, abs/2308.07610, 2023k. doi: 10.48550/ARXIV.2308.07610. URL https://doi.org/10.48550/arXiv.2308.07610.
- Roberta: A robustly optimized BERT pretraining approach. CoRR, abs/1907.11692, 2019d. URL http://arxiv.org/abs/1907.11692.
- Uniparser: A unified log parser for heterogeneous log data. In Frédérique Laforest, Raphaël Troncy, Elena Simperl, Deepak Agarwal, Aristides Gionis, Ivan Herman, and Lionel Médini (eds.), WWW ’22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25 - 29, 2022, pp. 1893–1901. ACM, 2022e. doi: 10.1145/3485447.3511993. URL https://doi.org/10.1145/3485447.3511993.
- Neural-machine-translation-based commit message generation: how far are we? In Marianne Huchard, Christian Kästner, and Gordon Fraser (eds.), Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering, ASE 2018, Montpellier, France, September 3-7, 2018, pp. 373–384. ACM, 2018. doi: 10.1145/3238147.3238190. URL https://doi.org/10.1145/3238147.3238190.
- Automatic patch generation by learning correct code. In Rastislav Bodík and Rupak Majumdar (eds.), Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, POPL 2016, St. Petersburg, FL, USA, January 20 - 22, 2016, pp. 298–312. ACM, 2016. doi: 10.1145/2837614.2837617. URL https://doi.org/10.1145/2837614.2837617.
- A neural architecture for generating natural language descriptions from source code changes. In Regina Barzilay and Min-Yen Kan (eds.), Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 2: Short Papers, pp. 287–292. Association for Computational Linguistics, 2017. doi: 10.18653/V1/P17-2045. URL https://doi.org/10.18653/v1/P17-2045.
- Content aware source code change description generation. In Emiel Krahmer, Albert Gatt, and Martijn Goudbeek (eds.), Proceedings of the 11th International Conference on Natural Language Generation, Tilburg University, The Netherlands, November 5-8, 2018, pp. 119–128. Association for Computational Linguistics, 2018. doi: 10.18653/V1/W18-6513. URL https://doi.org/10.18653/v1/w18-6513.
- Llama-reviewer: Advancing code review automation with large language models through parameter-efficient fine-tuning (practical experience report). CoRR, abs/2308.11148, 2023. doi: 10.48550/arXiv.2308.11148. URL https://doi.org/10.48550/arXiv.2308.11148.
- Query expansion via wordnet for effective code search. In Yann-Gaël Guéhéneuc, Bram Adams, and Alexander Serebrenik (eds.), 22nd IEEE International Conference on Software Analysis, Evolution, and Reengineering, SANER 2015, Montreal, QC, Canada, March 2-6, 2015, pp. 545–549. IEEE Computer Society, 2015. doi: 10.1109/SANER.2015.7081874. URL https://doi.org/10.1109/SANER.2015.7081874.
- Codexglue: A machine learning benchmark dataset for code understanding and generation. In Joaquin Vanschoren and Sai-Kit Yeung (eds.), Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, NeurIPS Datasets and Benchmarks 2021, December 2021, virtual, 2021. URL https://datasets-benchmarks-proceedings.neurips.cc/paper/2021/hash/c16a5320fa475530d9583c34fd356ef5-Abstract-round1.html.
- Detecting anomaly in big data system logs using convolutional neural network. In 2018 IEEE 16th Intl Conf on Dependable, Autonomic and Secure Computing, 16th Intl Conf on Pervasive Intelligence and Computing, 4th Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress, DASC/PiCom/DataCom/CyberSciTech 2018, Athens, Greece, August 12-15, 2018, pp. 151–158. IEEE Computer Society, 2018. doi: 10.1109/DASC/PICOM/DATACOM/CYBERSCITEC.2018.00037. URL https://doi.org/10.1109/DASC/PiCom/DataCom/CyberSciTec.2018.00037.
- Learning to infer API mappings from API documents. In Gang Li, Yong Ge, Zili Zhang, Zhi Jin, and Michael Blumenstein (eds.), Knowledge Science, Engineering and Management - 10th International Conference, KSEM 2017, Melbourne, VIC, Australia, August 19-20, 2017, Proceedings, volume 10412 of Lecture Notes in Computer Science, pp. 237–248. Springer, 2017. doi: 10.1007/978-3-319-63558-3_20. URL https://doi.org/10.1007/978-3-319-63558-3_20.
- Aroma: code recommendation via structural code search. Proc. ACM Program. Lang., 3(OOPSLA):152:1–152:28, 2019. doi: 10.1145/3360578. URL https://doi.org/10.1145/3360578.
- Wizardcoder: Empowering code large language models with evol-instruct. CoRR, abs/2306.08568, 2023. doi: 10.48550/arXiv.2306.08568. URL https://doi.org/10.48550/arXiv.2306.08568.
- Coconut: combining context-aware neural translation models using ensemble for program repair. In Sarfraz Khurshid and Corina S. Pasareanu (eds.), ISSTA ’20: 29th ACM SIGSOFT International Symposium on Software Testing and Analysis, Virtual Event, USA, July 18-22, 2020, pp. 101–114. ACM, 2020. doi: 10.1145/3395363.3397369. URL https://doi.org/10.1145/3395363.3397369.
- Codehow: Effective code search based on API understanding and extended boolean model (E). In Myra B. Cohen, Lars Grunske, and Michael Whalen (eds.), 30th IEEE/ACM International Conference on Automated Software Engineering, ASE 2015, Lincoln, NE, USA, November 9-13, 2015, pp. 260–270. IEEE Computer Society, 2015. doi: 10.1109/ASE.2015.42. URL https://doi.org/10.1109/ASE.2015.42.
- Hybrid ranking network for text-to-sql. CoRR, abs/2008.04759, 2020. URL https://arxiv.org/abs/2008.04759.
- BEARS: an extensible java bug benchmark for automatic program repair studies. In Xinyu Wang, David Lo, and Emad Shihab (eds.), 26th IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2019, Hangzhou, China, February 24-27, 2019, pp. 468–478. IEEE, 2019. doi: 10.1109/SANER.2019.8667991. URL https://doi.org/10.1109/SANER.2019.8667991.
- Authorship identification of source code segments written by multiple authors using stacking ensemble method. CoRR, abs/2212.05610, 2022. doi: 10.48550/arXiv.2212.05610. URL https://doi.org/10.48550/arXiv.2212.05610.
- Nl2type: inferring javascript function types from natural language information. In Joanne M. Atlee, Tevfik Bultan, and Jon Whittle (eds.), Proceedings of the 41st International Conference on Software Engineering, ICSE 2019, Montreal, QC, Canada, May 25-31, 2019, pp. 304–315. IEEE / ACM, 2019. doi: 10.1109/ICSE.2019.00045. URL https://doi.org/10.1109/ICSE.2019.00045.
- On ml-based program translation: Perils and promises. In 45th IEEE/ACM International Conference on Software Engineering: New Ideas and Emerging Results, NIER@ICSE, Melbourne, Australia, May 14-20, 2023, pp. 60–65. IEEE, 2023. doi: 10.1109/ICSE-NIER58687.2023.00017. URL https://doi.org/10.1109/ICSE-NIER58687.2023.00017.
- Studying the usage of text-to-text transfer transformer to support code-related tasks. In 43rd IEEE/ACM International Conference on Software Engineering, ICSE 2021, Madrid, Spain, 22-30 May 2021, pp. 336–347. IEEE, 2021. doi: 10.1109/ICSE43902.2021.00041. URL https://doi.org/10.1109/ICSE43902.2021.00041.
- Cross-language code search using static and dynamic analyses. In Diomidis Spinellis, Georgios Gousios, Marsha Chechik, and Massimiliano Di Penta (eds.), ESEC/FSE ’21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Athens, Greece, August 23-28, 2021, pp. 205–217. ACM, 2021. doi: 10.1145/3468264.3468538. URL https://doi.org/10.1145/3468264.3468538.
- Loganomaly: Unsupervised detection of sequential and quantitative anomalies in unstructured logs. In Sarit Kraus (ed.), Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10-16, 2019, pp. 4739–4745. ijcai.org, 2019. doi: 10.24963/IJCAI.2019/658. URL https://doi.org/10.24963/ijcai.2019/658.
- A machine learning framework for programming by example. In Proceedings of the 30th International Conference on Machine Learning, ICML 2013, Atlanta, GA, USA, 16-21 June 2013, volume 28 of JMLR Workshop and Conference Proceedings, pp. 187–195. JMLR.org, 2013. URL http://proceedings.mlr.press/v28/menon13.html.
- Mixed precision training. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. OpenReview.net, 2018. URL https://openreview.net/forum?id=r1gs9JgRZ.
- Manytypes4py: A benchmark python dataset for machine learning-based type inference. In 18th IEEE/ACM International Conference on Mining Software Repositories, MSR 2021, Madrid, Spain, May 17-19, 2021, pp. 585–589. IEEE, 2021. doi: 10.1109/MSR52588.2021.00079. URL https://doi.org/10.1109/MSR52588.2021.00079.
- Type4py: Practical deep similarity learning-based type inference for python. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 2241–2252. ACM, 2022. doi: 10.1145/3510003.3510124. URL https://doi.org/10.1145/3510003.3510124.
- Detectgpt: Zero-shot machine-generated text detection using probability curvature. In Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett (eds.), International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA, volume 202 of Proceedings of Machine Learning Research, pp. 24950–24962. PMLR, 2023. URL https://proceedings.mlr.press/v202/mitchell23a.html.
- Fuzzing class specifications. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 1008–1020. ACM, 2022. doi: 10.1145/3510003.3510120. URL https://doi.org/10.1145/3510003.3510120.
- Martin Monperrus. Automatic software repair: A bibliography. ACM Comput. Surv., 51(1):17:1–17:24, 2018. doi: 10.1145/3105906. URL https://doi.org/10.1145/3105906.
- Megadiff: A dataset of 600k java source code changes categorized by diff size. CoRR, abs/2108.04631, 2021. URL https://arxiv.org/abs/2108.04631.
- Convolutional neural networks over tree structures for programming language processing. In Dale Schuurmans and Michael P. Wellman (eds.), Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, February 12-17, 2016, Phoenix, Arizona, USA, pp. 1287–1293. AAAI Press, 2016. doi: 10.1609/aaai.v30i1.10139. URL https://doi.org/10.1609/aaai.v30i1.10139.
- Clarifygpt: Empowering llm-based code generation with intention clarification. CoRR, abs/2310.10996, 2023. doi: 10.48550/ARXIV.2310.10996. URL https://doi.org/10.48550/arXiv.2310.10996.
- An assessment of chatgpt on log data. CoRR, abs/2309.07938, 2023. doi: 10.48550/ARXIV.2309.07938. URL https://doi.org/10.48550/arXiv.2309.07938.
- Octopack: Instruction tuning code large language models. CoRR, abs/2308.07124, 2023. doi: 10.48550/arXiv.2308.07124. URL https://doi.org/10.48550/arXiv.2308.07124.
- Evading virus detection using code obfuscation. In Tai-Hoon Kim, Young-Hoon Lee, Byeong Ho Kang, and Dominik Slezak (eds.), Future Generation Information Technology - Second International Conference, FGIT 2010, Jeju Island, Korea, December 13-15, 2010. Proceedings, volume 6485 of Lecture Notes in Computer Science, pp. 394–401. Springer, 2010. doi: 10.1007/978-3-642-17569-5_39. URL https://doi.org/10.1007/978-3-642-17569-5_39.
- CLCDSA: cross language code clone detection using syntactical features and API documentation. In 34th IEEE/ACM International Conference on Automated Software Engineering, ASE 2019, San Diego, CA, USA, November 11-15, 2019, pp. 1026–1037. IEEE, 2019. doi: 10.1109/ASE.2019.00099. URL https://doi.org/10.1109/ASE.2019.00099.
- NIL: large-scale detection of large-variance clones. In Diomidis Spinellis, Georgios Gousios, Marsha Chechik, and Massimiliano Di Penta (eds.), ESEC/FSE ’21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Athens, Greece, August 23-28, 2021, pp. 830–841. ACM, 2021. doi: 10.1145/3468264.3468564. URL https://doi.org/10.1145/3468264.3468564.
- Enhancing few-shot text-to-sql capabilities of large language models: A study on prompt design strategies. CoRR, abs/2305.12586, 2023. doi: 10.48550/arXiv.2305.12586. URL https://doi.org/10.48550/arXiv.2305.12586.
- Self-supervised log parsing. In Yuxiao Dong, Dunja Mladenic, and Craig Saunders (eds.), Machine Learning and Knowledge Discovery in Databases: Applied Data Science Track - European Conference, ECML PKDD 2020, Ghent, Belgium, September 14-18, 2020, Proceedings, Part IV, volume 12460 of Lecture Notes in Computer Science, pp. 122–138. Springer, 2020a. doi: 10.1007/978-3-030-67667-4_8. URL https://doi.org/10.1007/978-3-030-67667-4_8.
- Self-attentive classification-based anomaly detection in unstructured logs. In Claudia Plant, Haixun Wang, Alfredo Cuzzocrea, Carlo Zaniolo, and Xindong Wu (eds.), 20th IEEE International Conference on Data Mining, ICDM 2020, Sorrento, Italy, November 17-20, 2020, pp. 1196–1201. IEEE, 2020b. doi: 10.1109/ICDM50108.2020.00148. URL https://doi.org/10.1109/ICDM50108.2020.00148.
- Lexical statistical machine translation for language migration. In Bertrand Meyer, Luciano Baresi, and Mira Mezini (eds.), Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering, ESEC/FSE’13, Saint Petersburg, Russian Federation, August 18-26, 2013, pp. 651–654. ACM, 2013. doi: 10.1145/2491411.2494584. URL https://doi.org/10.1145/2491411.2494584.
- Divide-and-conquer approach for multi-phase statistical migration for source code (T). In Myra B. Cohen, Lars Grunske, and Michael Whalen (eds.), 30th IEEE/ACM International Conference on Automated Software Engineering, ASE 2015, Lincoln, NE, USA, November 9-13, 2015, pp. 585–596. IEEE Computer Society, 2015. doi: 10.1109/ASE.2015.74. URL https://doi.org/10.1109/ASE.2015.74.
- Graph-based mining of in-the-wild, fine-grained, semantic code change patterns. In Joanne M. Atlee, Tevfik Bultan, and Jon Whittle (eds.), Proceedings of the 41st International Conference on Software Engineering, ICSE 2019, Montreal, QC, Canada, May 25-31, 2019, pp. 819–830. IEEE / ACM, 2019. doi: 10.1109/ICSE.2019.00089. URL https://doi.org/10.1109/ICSE.2019.00089.
- Suggesting natural method names to check name consistencies. In Gregg Rothermel and Doo-Hwan Bae (eds.), ICSE ’20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June - 19 July, 2020, pp. 1372–1384. ACM, 2020. doi: 10.1145/3377811.3380926. URL https://doi.org/10.1145/3377811.3380926.
- Logdtl: Network log template generation with deep transfer learning. In Toufik Ahmed, Olivier Festor, Yacine Ghamri-Doudane, Joon-Myung Kang, Alberto E. Schaeffer Filho, Abdelkader Lahmadi, and Edmundo R. M. Madeira (eds.), 17th IFIP/IEEE International Symposium on Integrated Network Management, IM 2021, Bordeaux, France, May 17-21, 2021, pp. 848–853. IEEE, 2021. URL https://ieeexplore.ieee.org/document/9464068.
- Mapping API elements for code migration with vector representations. In Laura K. Dillon, Willem Visser, and Laurie A. Williams (eds.), Proceedings of the 38th International Conference on Software Engineering, ICSE 2016, Austin, TX, USA, May 14-22, 2016 - Companion Volume, pp. 756–758. ACM, 2016. doi: 10.1145/2889160.2892661. URL https://doi.org/10.1145/2889160.2892661.
- Regvd: Revisiting graph neural networks for vulnerability detection. In 44th IEEE/ACM International Conference on Software Engineering: Companion Proceedings, ICSE Companion 2022, Pittsburgh, PA, USA, May 22-24, 2022, pp. 178–182. ACM/IEEE, 2022. doi: 10.1145/3510454.3516865. URL https://doi.org/10.1145/3510454.3516865.
- Coregen: Contextualized code representation learning for commit message generation. Neurocomputing, 459:97–107, 2021. doi: 10.1016/J.NEUCOM.2021.05.039. URL https://doi.org/10.1016/j.neucom.2021.05.039.
- Learning deep semantics for test completion. In 45th IEEE/ACM International Conference on Software Engineering, ICSE 2023, Melbourne, Australia, May 14-20, 2023, pp. 2111–2123. IEEE, 2023. doi: 10.1109/ICSE48619.2023.00178. URL https://doi.org/10.1109/ICSE48619.2023.00178.
- Codegen2: Lessons for training llms on programming and natural languages. CoRR, abs/2305.02309, 2023a. doi: 10.48550/ARXIV.2305.02309. URL https://doi.org/10.48550/arXiv.2305.02309.
- Codegen: An open large language model for code with multi-turn program synthesis. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023b. URL https://openreview.net/pdf?id=iaYcJKpY2B_.
- Crossvul: a cross-language vulnerability dataset with commit data. In Diomidis Spinellis, Georgios Gousios, Marsha Chechik, and Massimiliano Di Penta (eds.), ESEC/FSE ’21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Athens, Greece, August 23-28, 2021, pp. 1565–1569. ACM, 2021. doi: 10.1145/3468264.3473122. URL https://doi.org/10.1145/3468264.3473122.
- DIRECT : A transformer-based model for decompiled identifier renaming. In Royi Lachmy, Ziyu Yao, Greg Durrett, Milos Gligoric, Junyi Jessy Li, Ray Mooney, Graham Neubig, Yu Su, Huan Sun, and Reut Tsarfaty (eds.), Proceedings of the 1st Workshop on Natural Language Processing for Programming (NLP4Prog 2021), pp. 48–57, Online, August 2021. Association for Computational Linguistics. doi: 10.18653/v1/2021.nlp4prog-1.6. URL https://aclanthology.org/2021.nlp4prog-1.6.
- Spt-code: Sequence-to-sequence pre-training for learning source code representations. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 1–13. ACM, 2022. doi: 10.1145/3510003.3510096. URL https://doi.org/10.1145/3510003.3510096.
- An empirical comparison of pre-trained models of source code. In 45th IEEE/ACM International Conference on Software Engineering, ICSE 2023, Melbourne, Australia, May 14-20, 2023, pp. 2136–2148. IEEE, 2023. doi: 10.1109/ICSE48619.2023.00180. URL https://doi.org/10.1109/ICSE48619.2023.00180.
- Open science in software engineering: A study on deep learning-based vulnerability detection. IEEE Trans. Software Eng., 49(4):1983–2005, 2023. doi: 10.1109/TSE.2022.3207149. URL https://doi.org/10.1109/TSE.2022.3207149.
- Tensorfuzz: Debugging neural networks with coverage-guided fuzzing. In Kamalika Chaudhuri and Ruslan Salakhutdinov (eds.), Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA, volume 97 of Proceedings of Machine Learning Research, pp. 4901–4911. PMLR, 2019. URL http://proceedings.mlr.press/v97/odena19a.html.
- Pyter: effective program repair for python type errors. In Abhik Roychoudhury, Cristian Cadar, and Miryung Kim (eds.), Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, Singapore, Singapore, November 14-18, 2022, pp. 922–934. ACM, 2022. doi: 10.1145/3540250.3549130. URL https://doi.org/10.1145/3540250.3549130.
- Syntactic vs. semantic similarity of artificial and real faults in mutation testing studies. CoRR, abs/2112.14508, 2021. URL https://arxiv.org/abs/2112.14508.
- On comparing mutation testing tools through learning-based mutant selection. In IEEE/ACM International Conference on Automation of Software Test, AST 2023, Melbourne, Australia, May 15-16, 2023, pp. 35–46. IEEE, 2023. doi: 10.1109/AST58925.2023.00008. URL https://doi.org/10.1109/AST58925.2023.00008.
- OpenAI. GPT-4 technical report. CoRR, abs/2303.08774, 2023. doi: 10.48550/arXiv.2303.08774. URL https://doi.org/10.48550/arXiv.2303.08774.
- Training language models to follow instructions with human feedback. In NeurIPS, 2022. URL http://papers.nips.cc/paper_files/paper/2022/hash/b1efde53be364a73914f58805a001731-Abstract-Conference.html.
- Understanding the effectiveness of large language models in code translation. CoRR, abs/2308.03109, 2023. doi: 10.48550/ARXIV.2308.03109. URL https://doi.org/10.48550/arXiv.2308.03109.
- Opttyper: Probabilistic type inference by optimising logical and natural constraints. CoRR, abs/2004.00348, 2020. URL https://arxiv.org/abs/2004.00348.
- Automated test case generation as a many-objective optimisation problem with dynamic selection of the targets. IEEE Trans. Software Eng., 44(2):122–158, 2018. doi: 10.1109/TSE.2017.2663435. URL https://doi.org/10.1109/TSE.2017.2663435.
- Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, July 6-12, 2002, Philadelphia, PA, USA, pp. 311–318. ACL, 2002. doi: 10.3115/1073083.1073135. URL https://aclanthology.org/P02-1040/.
- Neuro-symbolic program synthesis. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings. OpenReview.net, 2017. URL https://openreview.net/forum?id=rJ0JwFcex.
- Building language models for text with named entities. In Iryna Gurevych and Yusuke Miyao (eds.), Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Melbourne, Australia, July 15-20, 2018, Volume 1: Long Papers, pp. 2373–2383. Association for Computational Linguistics, 2018. doi: 10.18653/v1/P18-1221. URL https://aclanthology.org/P18-1221/.
- Retrieval augmented code generation and summarization. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih (eds.), Findings of the Association for Computational Linguistics: EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 16-20 November, 2021, pp. 2719–2734. Association for Computational Linguistics, 2021. doi: 10.18653/V1/2021.FINDINGS-EMNLP.232. URL https://doi.org/10.18653/v1/2021.findings-emnlp.232.
- Pytorch: An imperative style, high-performance deep learning library. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d’Alché-Buc, Emily B. Fox, and Roman Garnett (eds.), Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, pp. 8024–8035, 2019. URL https://proceedings.neurips.cc/paper/2019/hash/bdbca288fee7f92f2bfa9f7012727740-Abstract.html.
- Semantic bug seeding: a learning-based approach for creating realistic bugs. In Diomidis Spinellis, Georgios Gousios, Marsha Chechik, and Massimiliano Di Penta (eds.), ESEC/FSE ’21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Athens, Greece, August 23-28, 2021, pp. 906–918. ACM, 2021. doi: 10.1145/3468264.3468623. URL https://doi.org/10.1145/3468264.3468623.
- Automated program repair based on code review: How do pre-trained transformer models perform? CoRR, abs/2304.07840, 2023. doi: 10.48550/ARXIV.2304.07840. URL https://doi.org/10.48550/arXiv.2304.07840.
- Asleep at the keyboard? assessing the security of github copilot’s code contributions. In 43rd IEEE Symposium on Security and Privacy, SP 2022, San Francisco, CA, USA, May 22-26, 2022, pp. 754–768. IEEE, 2022. doi: 10.1109/SP46214.2022.9833571. URL https://doi.org/10.1109/SP46214.2022.9833571.
- The refinedweb dataset for falcon LLM: outperforming curated corpora with web data, and web data only. CoRR, abs/2306.01116, 2023. doi: 10.48550/arXiv.2306.01116. URL https://doi.org/10.48550/arXiv.2306.01116.
- Yarn: Efficient context window extension of large language models. CoRR, abs/2309.00071, 2023a. doi: 10.48550/ARXIV.2309.00071. URL https://doi.org/10.48550/arXiv.2309.00071.
- Static inference meets deep learning: A hybrid type inference approach for python. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 2019–2030. ACM, 2022. doi: 10.1145/3510003.3510038. URL https://doi.org/10.1145/3510003.3510038.
- Domain knowledge matters: Improving prompts with fix templates for repairing python type errors. CoRR, abs/2306.01394, 2023b. doi: 10.48550/ARXIV.2306.01394. URL https://doi.org/10.48550/arXiv.2306.01394.
- Generative type inference for python. CoRR, abs/2307.09163, 2023c. doi: 10.48550/ARXIV.2307.09163. URL https://doi.org/10.48550/arXiv.2307.09163.
- Cross-language clone detection by learning over abstract syntax trees. In Margaret-Anne D. Storey, Bram Adams, and Sonia Haiduc (eds.), Proceedings of the 16th International Conference on Mining Software Repositories, MSR 2019, 26-27 May 2019, Montreal, Canada, pp. 518–528. IEEE / ACM, 2019. doi: 10.1109/MSR.2019.00078. URL https://doi.org/10.1109/MSR.2019.00078.
- Evilcoder: automated bug insertion. In Stephen Schwab, William K. Robertson, and Davide Balzarotti (eds.), Proceedings of the 32nd Annual Conference on Computer Security Applications, ACSAC 2016, Los Angeles, CA, USA, December 5-9, 2016, pp. 214–225. ACM, 2016. URL http://dl.acm.org/citation.cfm?id=2991103.
- Statistical migration of API usages. In Sebastián Uchitel, Alessandro Orso, and Martin P. Robillard (eds.), Proceedings of the 39th International Conference on Software Engineering, ICSE 2017, Buenos Aires, Argentina, May 20-28, 2017 - Companion Volume, pp. 47–50. IEEE Computer Society, 2017. doi: 10.1109/ICSE-C.2017.17. URL https://doi.org/10.1109/ICSE-C.2017.17.
- A manually-curated dataset of fixes to vulnerabilities of open-source software. In Margaret-Anne D. Storey, Bram Adams, and Sonia Haiduc (eds.), Proceedings of the 16th International Conference on Mining Software Repositories, MSR 2019, 26-27 May 2019, Montreal, Canada, pp. 383–387. IEEE / ACM, 2019. doi: 10.1109/MSR.2019.00064. URL https://doi.org/10.1109/MSR.2019.00064.
- DIN-SQL: decomposed in-context learning of text-to-sql with self-correction. CoRR, abs/2304.11015, 2023. doi: 10.48550/ARXIV.2304.11015. URL https://doi.org/10.48550/arXiv.2304.11015.
- Deepbugs: a learning approach to name-based bug detection. Proc. ACM Program. Lang., 2(OOPSLA):147:1–147:25, 2018. doi: 10.1145/3276517. URL https://doi.org/10.1145/3276517.
- Typedevil: Dynamic type inconsistency analysis for javascript. In Antonia Bertolino, Gerardo Canfora, and Sebastian G. Elbaum (eds.), 37th IEEE/ACM International Conference on Software Engineering, ICSE 2015, Florence, Italy, May 16-24, 2015, Volume 1, pp. 314–324. IEEE Computer Society, 2015. doi: 10.1109/ICSE.2015.51. URL https://doi.org/10.1109/ICSE.2015.51.
- Typewriter: neural type prediction with search-based validation. In Prem Devanbu, Myra B. Cohen, and Thomas Zimmermann (eds.), ESEC/FSE ’20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Virtual Event, USA, November 8-13, 2020, pp. 209–220. ACM, 2020. doi: 10.1145/3368089.3409715. URL https://doi.org/10.1145/3368089.3409715.
- Runbugrun - an executable dataset for automated program repair. CoRR, abs/2304.01102, 2023. doi: 10.48550/ARXIV.2304.01102. URL https://doi.org/10.48550/arXiv.2304.01102.
- Train short, test long: Attention with linear biases enables input length extrapolation. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. URL https://openreview.net/forum?id=R8sQPpGCv0.
- sk_p: a neural program corrector for moocs. In Eelco Visser (ed.), Companion Proceedings of the 2016 ACM SIGPLAN International Conference on Systems, Programming, Languages and Applications: Software for Humanity, SPLASH 2016, Amsterdam, Netherlands, October 30 - November 4, 2016, pp. 39–40. ACM, 2016. doi: 10.1145/2984043.2989222. URL https://doi.org/10.1145/2984043.2989222.
- Codenet: A large-scale AI for code dataset for learning a diversity of coding tasks. In Joaquin Vanschoren and Sai-Kit Yeung (eds.), Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, NeurIPS Datasets and Benchmarks 2021, December 2021, virtual, 2021. URL https://datasets-benchmarks-proceedings.neurips.cc/paper/2021/hash/a5bfc9e07964f8dddeb95fc584cd965d-Abstract-round2.html.
- Loggpt: Exploring chatgpt for log-based anomaly detection. CoRR, abs/2309.01189, 2023. doi: 10.48550/ARXIV.2309.01189. URL https://doi.org/10.48550/arXiv.2309.01189.
- Communicative agents for software development. CoRR, abs/2307.07924, 2023. doi: 10.48550/ARXIV.2307.07924. URL https://doi.org/10.48550/arXiv.2307.07924.
- A survey on text-to-sql parsing: Concepts, methods, and future directions. CoRR, abs/2208.13629, 2022a. doi: 10.48550/ARXIV.2208.13629. URL https://doi.org/10.48550/arXiv.2208.13629.
- cosformer: Rethinking softmax in attention. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022b. URL https://openreview.net/forum?id=Bl8CQrx2Up4.
- Abstract syntax networks for code generation and semantic parsing. In Regina Barzilay and Min-Yen Kan (eds.), Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 1: Long Papers, pp. 1139–1149. Association for Computational Linguistics, 2017. doi: 10.18653/V1/P17-1105. URL https://doi.org/10.18653/v1/P17-1105.
- Improving language understanding by generative pre-training, 2018.
- Language models are unsupervised multitask learners. OpenAI blog, 1(8):9, 2019.
- Scaling language models: Methods, analysis & insights from training gopher. CoRR, abs/2112.11446, 2021. URL https://arxiv.org/abs/2112.11446.
- Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res., 21:140:1–140:67, 2020. URL http://jmlr.org/papers/v21/20-074.html.
- Towards causal deep learning for vulnerability detection. CoRR, abs/2310.07958, 2023. doi: 10.48550/ARXIV.2310.07958. URL https://doi.org/10.48550/arXiv.2310.07958.
- On the "naturalness" of buggy code. In Laura K. Dillon, Willem Visser, and Laurie A. Williams (eds.), Proceedings of the 38th International Conference on Software Engineering, ICSE 2016, Austin, TX, USA, May 14-22, 2016, pp. 428–439. ACM, 2016. doi: 10.1145/2884781.2884848. URL https://doi.org/10.1145/2884781.2884848.
- Code completion with statistical language models. In Michael F. P. O’Boyle and Keshav Pingali (eds.), ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI ’14, Edinburgh, United Kingdom - June 09 - 11, 2014, pp. 419–428. ACM, 2014. doi: 10.1145/2594291.2594321. URL https://doi.org/10.1145/2594291.2594321.
- Predicting program properties from "big code". In Sriram K. Rajamani and David Walker (eds.), Proceedings of the 42nd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, POPL 2015, Mumbai, India, January 15-17, 2015, pp. 111–124. ACM, 2015. doi: 10.1145/2676726.2677009. URL https://doi.org/10.1145/2676726.2677009.
- Probabilistic model for code with decision trees. In Eelco Visser and Yannis Smaragdakis (eds.), Proceedings of the 2016 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications, OOPSLA 2016, part of SPLASH 2016, Amsterdam, The Netherlands, October 30 - November 4, 2016, pp. 731–747. ACM, 2016a. doi: 10.1145/2983990.2984041. URL https://doi.org/10.1145/2983990.2984041.
- Learning programs from noisy data. In Rastislav Bodík and Rupak Majumdar (eds.), Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, POPL 2016, St. Petersburg, FL, USA, January 20 - 22, 2016, pp. 761–774. ACM, 2016b. doi: 10.1145/2837614.2837671. URL https://doi.org/10.1145/2837614.2837671.
- Codebleu: a method for automatic evaluation of code synthesis. CoRR, abs/2009.10297, 2020. URL https://arxiv.org/abs/2009.10297.
- TSSB-3M: mining single statement bugs at massive scale. In 19th IEEE/ACM International Conference on Mining Software Repositories, MSR 2022, Pittsburgh, PA, USA, May 23-24, 2022, pp. 418–422. ACM, 2022. doi: 10.1145/3524842.3528505. URL https://doi.org/10.1145/3524842.3528505.
- Limits of machine learning for automatic vulnerability detection. CoRR, abs/2306.17193, 2023. doi: 10.48550/ARXIV.2306.17193. URL https://doi.org/10.48550/arXiv.2306.17193.
- Bug synthesis: challenging bug-finding tools with deep faults. In Gary T. Leavens, Alessandro Garcia, and Corina S. Pasareanu (eds.), Proceedings of the 2018 ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2018, Lake Buena Vista, FL, USA, November 04-09, 2018, pp. 224–234. ACM, 2018. doi: 10.1145/3236024.3236084. URL https://doi.org/10.1145/3236024.3236084.
- Unsupervised translation of programming languages. In Hugo Larochelle, Marc’Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin (eds.), Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, 2020. URL https://proceedings.neurips.cc/paper/2020/hash/ed23fbf18c2cd35f8c7f8de44f85c08d-Abstract.html.
- Leveraging automated unit tests for unsupervised code translation. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. URL https://openreview.net/forum?id=cmt-6KtR4c4.
- Code llama: Open foundation models for code. CoRR, abs/2308.12950, 2023. doi: 10.48550/arXiv.2308.12950. URL https://doi.org/10.48550/arXiv.2308.12950.
- TPTU: task planning and tool usage of large language model-based AI agents. CoRR, abs/2308.03427, 2023. doi: 10.48550/ARXIV.2308.03427. URL https://doi.org/10.48550/arXiv.2308.03427.
- Smbop: Semi-autoregressive bottom-up semantic parsing. In Kristina Toutanova, Anna Rumshisky, Luke Zettlemoyer, Dilek Hakkani-Tür, Iz Beltagy, Steven Bethard, Ryan Cotterell, Tanmoy Chakraborty, and Yichao Zhou (eds.), Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2021, Online, June 6-11, 2021, pp. 311–324. Association for Computational Linguistics, 2021. doi: 10.18653/v1/2021.naacl-main.29. URL https://doi.org/10.18653/v1/2021.naacl-main.29.
- Automated vulnerability detection in source code using deep representation learning. In M. Arif Wani, Mehmed M. Kantardzic, Moamar Sayed Mouchaweh, João Gama, and Edwin Lughofer (eds.), 17th IEEE International Conference on Machine Learning and Applications, ICMLA 2018, Orlando, FL, USA, December 17-20, 2018, pp. 757–762. IEEE, 2018. doi: 10.1109/ICMLA.2018.00120. URL https://doi.org/10.1109/ICMLA.2018.00120.
- Retrieval on source code: a neural code search. In Justin Gottschlich and Alvin Cheung (eds.), Proceedings of the 2nd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages, MAPL@PLDI 2018, Philadelphia, PA, USA, June 18-22, 2018, pp. 31–41. ACM, 2018. doi: 10.1145/3211346.3211353. URL https://doi.org/10.1145/3211346.3211353.
- Tricorder: Building a program analysis ecosystem. In Antonia Bertolino, Gerardo Canfora, and Sebastian G. Elbaum (eds.), 37th IEEE/ACM International Conference on Software Engineering, ICSE 2015, Florence, Italy, May 16-24, 2015, Volume 1, pp. 598–608. IEEE Computer Society, 2015. doi: 10.1109/ICSE.2015.76. URL https://doi.org/10.1109/ICSE.2015.76.
- Bugs.jar: a large-scale, diverse dataset of real-world java bugs. In Andy Zaidman, Yasutaka Kamei, and Emily Hill (eds.), Proceedings of the 15th International Conference on Mining Software Repositories, MSR 2018, Gothenburg, Sweden, May 28-29, 2018, pp. 10–13. ACM, 2018. doi: 10.1145/3196398.3196473. URL https://doi.org/10.1145/3196398.3196473.
- On contrastive learning of semantic similarity forcode to code search. CoRR, abs/2305.03843, 2023. doi: 10.48550/ARXIV.2305.03843. URL https://doi.org/10.48550/arXiv.2305.03843.
- Oreo: detection of clones in the twilight zone. In Gary T. Leavens, Alessandro Garcia, and Corina S. Pasareanu (eds.), Proceedings of the 2018 ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2018, Lake Buena Vista, FL, USA, November 04-09, 2018, pp. 354–365. ACM, 2018. doi: 10.1145/3236024.3236026. URL https://doi.org/10.1145/3236024.3236026.
- Sourcerercc: scaling code clone detection to big-code. In Laura K. Dillon, Willem Visser, and Laurie A. Williams (eds.), Proceedings of the 38th International Conference on Software Engineering, ICSE 2016, Austin, TX, USA, May 14-22, 2016, pp. 1157–1168. ACM, 2016. doi: 10.1145/2884781.2884877. URL https://doi.org/10.1145/2884781.2884877.
- On the effectiveness of transfer learning for code search. IEEE Trans. Software Eng., 49(4):1804–1822, 2023. doi: 10.1109/TSE.2022.3192755. URL https://doi.org/10.1109/TSE.2022.3192755.
- Multitask prompted training enables zero-shot task generalization. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. URL https://openreview.net/forum?id=9Vrb9D0WI4.
- BLOOM: A 176b-parameter open-access multilingual language model. CoRR, abs/2211.05100, 2022. doi: 10.48550/arXiv.2211.05100. URL https://doi.org/10.48550/arXiv.2211.05100.
- Toolformer: Language models can teach themselves to use tools. CoRR, abs/2302.04761, 2023. doi: 10.48550/arXiv.2302.04761. URL https://doi.org/10.48550/arXiv.2302.04761.
- PICARD: parsing incrementally for constrained auto-regressive decoding from language models. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih (eds.), Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021, pp. 9895–9901. Association for Computational Linguistics, 2021. doi: 10.18653/v1/2021.emnlp-main.779. URL https://doi.org/10.18653/v1/2021.emnlp-main.779.
- Proximal policy optimization algorithms. CoRR, abs/1707.06347, 2017. URL http://arxiv.org/abs/1707.06347.
- An empirical evaluation of using large language models for automated unit test generation, 2023.
- Test generation for higher-order functions in dynamic languages. Proc. ACM Program. Lang., 2(OOPSLA):161:1–161:27, 2018. doi: 10.1145/3276531. URL https://doi.org/10.1145/3276531.
- Improving neural machine translation models with monolingual data. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7-12, 2016, Berlin, Germany, Volume 1: Long Papers. The Association for Computer Linguistics, 2016. doi: 10.18653/v1/p16-1009. URL https://doi.org/10.18653/v1/p16-1009.
- Sina Shamshiri. Automated unit test generation for evolving software. In Elisabetta Di Nitto, Mark Harman, and Patrick Heymans (eds.), Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering, ESEC/FSE 2015, Bergamo, Italy, August 30 - September 4, 2015, pp. 1038–1041. ACM, 2015. doi: 10.1145/2786805.2803196. URL https://doi.org/10.1145/2786805.2803196.
- Compositional generalization and natural language variation: Can a semantic parsing approach handle both? In Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (eds.), Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL/IJCNLP 2021, (Volume 1: Long Papers), Virtual Event, August 1-6, 2021, pp. 922–938. Association for Computational Linguistics, 2021. doi: 10.18653/v1/2021.acl-long.75. URL https://doi.org/10.18653/v1/2021.acl-long.75.
- Noam Shazeer. Fast transformer decoding: One write-head is all you need. CoRR, abs/1911.02150, 2019. URL http://arxiv.org/abs/1911.02150.
- NEUZZ: efficient fuzzing with neural program smoothing. In 2019 IEEE Symposium on Security and Privacy, SP 2019, San Francisco, CA, USA, May 19-23, 2019, pp. 803–817. IEEE, 2019. doi: 10.1109/SP.2019.00052. URL https://doi.org/10.1109/SP.2019.00052.
- Mtfuzz: fuzzing with a multi-task neural network. In Prem Devanbu, Myra B. Cohen, and Thomas Zimmermann (eds.), ESEC/FSE ’20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Virtual Event, USA, November 8-13, 2020, pp. 737–749. ACM, 2020. doi: 10.1145/3368089.3409723. URL https://doi.org/10.1145/3368089.3409723.
- Pitfalls in language models for code intelligence: A taxonomy and survey. CoRR, abs/2310.17903, 2023. doi: 10.48550/ARXIV.2310.17903. URL https://doi.org/10.48550/arXiv.2310.17903.
- Pangu-coder2: Boosting large language models for code with ranking feedback. CoRR, abs/2307.14936, 2023. doi: 10.48550/arXiv.2307.14936. URL https://doi.org/10.48550/arXiv.2307.14936.
- Coss: Leveraging statement semantics for code summarization. IEEE Trans. Software Eng., 49(6):3472–3486, 2023a. doi: 10.1109/TSE.2023.3256362. URL https://doi.org/10.1109/TSE.2023.3256362.
- RACE: retrieval-augmented commit message generation. In Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (eds.), Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11, 2022, pp. 5520–5530. Association for Computational Linguistics, 2022. doi: 10.18653/V1/2022.EMNLP-MAIN.372. URL https://doi.org/10.18653/v1/2022.emnlp-main.372.
- Cocosoda: Effective contrastive learning for code search. In 45th IEEE/ACM International Conference on Software Engineering, ICSE 2023, Melbourne, Australia, May 14-20, 2023, pp. 2198–2210. IEEE, 2023b. doi: 10.1109/ICSE48619.2023.00185. URL https://doi.org/10.1109/ICSE48619.2023.00185.
- Learning contextual representations for semantic parsing with generation-augmented pre-training. In Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, Thirty-Third Conference on Innovative Applications of Artificial Intelligence, IAAI 2021, The Eleventh Symposium on Educational Advances in Artificial Intelligence, EAAI 2021, Virtual Event, February 2-9, 2021, pp. 13806–13814. AAAI Press, 2021. doi: 10.1609/AAAI.V35I15.17627. URL https://doi.org/10.1609/aaai.v35i15.17627.
- Automatic code review by learning the revision of source code. In The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, The Thirty-First Innovative Applications of Artificial Intelligence Conference, IAAI 2019, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, Honolulu, Hawaii, USA, January 27 - February 1, 2019, pp. 4910–4917. AAAI Press, 2019. doi: 10.1609/aaai.v33i01.33014910. URL https://doi.org/10.1609/aaai.v33i01.33014910.
- On the potential of lexico-logical alignments for semantic parsing to SQL queries. In Trevor Cohn, Yulan He, and Yang Liu (eds.), Findings of the Association for Computational Linguistics: EMNLP 2020, Online Event, 16-20 November 2020, volume EMNLP 2020 of Findings of ACL, pp. 1849–1864. Association for Computational Linguistics, 2020. doi: 10.18653/V1/2020.FINDINGS-EMNLP.167. URL https://doi.org/10.18653/v1/2020.findings-emnlp.167.
- Leveraging code-test co-evolution patterns for automated test case recommendation. In IEEE/ACM International Conference on Automation of Software Test, AST@ICSE 2022, Pittsburgh, PA, USA, May 21-22, 2022, pp. 65–76. ACM/IEEE, 2022. doi: 10.1145/3524481.3527222. URL https://doi.org/10.1145/3524481.3527222.
- Execution-based code generation using deep reinforcement learning. Transactions on Machine Learning Research, 2023. ISSN 2835-8856. URL https://openreview.net/forum?id=0XBuaxqEcG.
- Repofusion: Training code models to understand your repository. CoRR, abs/2306.10998, 2023a. doi: 10.48550/ARXIV.2306.10998. URL https://doi.org/10.48550/arXiv.2306.10998.
- Repository-level prompt generation for large language models of code. In Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett (eds.), International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA, volume 202 of Proceedings of Machine Learning Research, pp. 31693–31715. PMLR, 2023b. URL https://proceedings.mlr.press/v202/shrivastava23a.html.
- Improving code search with co-attentive representation learning. In ICPC ’20: 28th International Conference on Program Comprehension, Seoul, Republic of Korea, July 13-15, 2020, pp. 196–207. ACM, 2020. doi: 10.1145/3387904.3389269. URL https://doi.org/10.1145/3387904.3389269.
- Codefusion: A pre-trained diffusion model for code generation, 2023.
- CORE: automating review recommendation for code changes. In Kostas Kontogiannis, Foutse Khomh, Alexander Chatzigeorgiou, Marios-Eleftherios Fokaefs, and Minghui Zhou (eds.), 27th IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2020, London, ON, Canada, February 18-21, 2020, pp. 284–295. IEEE, 2020. doi: 10.1109/SANER48275.2020.9054794. URL https://doi.org/10.1109/SANER48275.2020.9054794.
- Mining idioms in the wild. In 44th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, ICSE (SEIP) 2022, Pittsburgh, PA, USA, May 22-24, 2022, pp. 187–196. IEEE, 2022. doi: 10.1109/ICSE-SEIP55303.2022.9794062. URL https://doi.org/10.1109/ICSE-SEIP55303.2022.9794062.
- Alexatm 20b: Few-shot learning using a large-scale multilingual seq2seq model. CoRR, abs/2208.01448, 2022. doi: 10.48550/arXiv.2208.01448. URL https://doi.org/10.48550/arXiv.2208.01448.
- Cct-code: Cross-consistency training for multilingual clone detection and code search. CoRR, abs/2305.11626, 2023. doi: 10.48550/ARXIV.2305.11626. URL https://doi.org/10.48550/arXiv.2305.11626.
- An empirical study of deep learning models for vulnerability detection. In 45th IEEE/ACM International Conference on Software Engineering, ICSE 2023, Melbourne, Australia, May 14-20, 2023, pp. 2237–2248. IEEE, 2023. doi: 10.1109/ICSE48619.2023.00188. URL https://doi.org/10.1109/ICSE48619.2023.00188.
- Semantic similarity loss for neural source code summarization. CoRR, abs/2308.07429, 2023a. doi: 10.48550/ARXIV.2308.07429. URL https://doi.org/10.48550/arXiv.2308.07429.
- Distilled GPT for source code summarization. CoRR, abs/2308.14731, 2023b. doi: 10.48550/ARXIV.2308.14731. URL https://doi.org/10.48550/arXiv.2308.14731.
- A language model of java methods with train/test deduplication. CoRR, abs/2305.08286, 2023. doi: 10.48550/ARXIV.2305.08286. URL https://doi.org/10.48550/arXiv.2305.08286.
- Roformer: Enhanced transformer with rotary position embedding. CoRR, abs/2104.09864, 2021. URL https://arxiv.org/abs/2104.09864.
- Learning to map context-dependent sentences to executable formal queries. In Marilyn A. Walker, Heng Ji, and Amanda Stent (eds.), Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2018, New Orleans, Louisiana, USA, June 1-6, 2018, Volume 1 (Long Papers), pp. 2238–2249. Association for Computational Linguistics, 2018. doi: 10.18653/v1/n18-1203. URL https://doi.org/10.18653/v1/n18-1203.
- Sql-palm: Improved large language model adaptation for text-to-sql. CoRR, abs/2306.00739, 2023a. doi: 10.48550/ARXIV.2306.00739. URL https://doi.org/10.48550/arXiv.2306.00739.
- Code search based on context-aware code translation. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 388–400. ACM, 2022. doi: 10.1145/3510003.3510140. URL https://doi.org/10.1145/3510003.3510140.
- Automatic code summarization via chatgpt: How far are we? CoRR, abs/2305.12865, 2023b. doi: 10.48550/ARXIV.2305.12865. URL https://doi.org/10.48550/arXiv.2305.12865.
- Treegen: A tree-based transformer architecture for code generation. In The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, February 7-12, 2020, pp. 8984–8991. AAAI Press, 2020. doi: 10.1609/AAAI.V34I05.6430. URL https://doi.org/10.1609/aaai.v34i05.6430.
- Vipergpt: Visual inference via python execution for reasoning. CoRR, abs/2303.08128, 2023. doi: 10.48550/ARXIV.2303.08128. URL https://doi.org/10.48550/arXiv.2303.08128.
- Sequence to sequence learning with neural networks. In Zoubin Ghahramani, Max Welling, Corinna Cortes, Neil D. Lawrence, and Kilian Q. Weinberger (eds.), Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8-13 2014, Montreal, Quebec, Canada, pp. 3104–3112, 2014. URL https://proceedings.neurips.cc/paper/2014/hash/a14ac55a4f27472c5d894ec1c3c743d2-Abstract.html.
- A survey on the evaluation of clone detection performance and benchmarking. CoRR, abs/2006.15682, 2020. URL https://arxiv.org/abs/2006.15682.
- Towards a big data curated benchmark of inter-project code clones. In 30th IEEE International Conference on Software Maintenance and Evolution, Victoria, BC, Canada, September 29 - October 3, 2014, pp. 476–480. IEEE Computer Society, 2014. doi: 10.1109/ICSME.2014.77. URL https://doi.org/10.1109/ICSME.2014.77.
- Pythia: Ai-assisted code completion system. In Ankur Teredesai, Vipin Kumar, Ying Li, Rómer Rosales, Evimaria Terzi, and George Karypis (eds.), Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD 2019, Anchorage, AK, USA, August 4-8, 2019, pp. 2727–2735. ACM, 2019. doi: 10.1145/3292500.3330699. URL https://doi.org/10.1145/3292500.3330699.
- Intellicode compose: code generation using transformer. In Prem Devanbu, Myra B. Cohen, and Thomas Zimmermann (eds.), ESEC/FSE ’20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Virtual Event, USA, November 8-13, 2020, pp. 1433–1443. ACM, 2020. doi: 10.1145/3368089.3417058. URL https://doi.org/10.1145/3368089.3417058.
- Code translation with compiler representations. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. URL https://openreview.net/pdf?id=XomEU3eNeSQ.
- Codeflaws: a programming competition benchmark for evaluating automated program repair tools. In Sebastián Uchitel, Alessandro Orso, and Martin P. Robillard (eds.), Proceedings of the 39th International Conference on Software Engineering, ICSE 2017, Buenos Aires, Argentina, May 20-28, 2017 - Companion Volume, pp. 180–182. IEEE Computer Society, 2017. doi: 10.1109/ICSE-C.2017.76. URL https://doi.org/10.1109/ICSE-C.2017.76.
- Automated construction of database interfaces: Intergrating statistical and relational learning for semantic parsing. In Hinrich Schütze and Keh-Yih Su (eds.), Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, EMNLP 2000, Hong Kong, October 7-8, 2000, pp. 133–141. Association for Computational Linguistics, 2000. doi: 10.3115/1117794.1117811. URL https://aclanthology.org/W00-1317/.
- Ast-trans: Code summarization with efficient tree-structured attention. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 150–162. ACM, 2022. doi: 10.1145/3510003.3510224. URL https://doi.org/10.1145/3510003.3510224.
- Explain-then-translate: An analysis on improving program translation with self-generated explanations. CoRR, abs/2311.07070, 2023. doi: 10.48550/ARXIV.2311.07070. URL https://doi.org/10.48550/arXiv.2311.07070.
- Logstamp: Automatic online log parsing based on sequence labelling. SIGMETRICS Perform. Evaluation Rev., 49(4):93–98, 2022. doi: 10.1145/3543146.3543168. URL https://doi.org/10.1145/3543146.3543168.
- On the evaluation of commit message generation models: An experimental study. In IEEE International Conference on Software Maintenance and Evolution, ICSME 2021, Luxembourg, September 27 - October 1, 2021, pp. 126–136. IEEE, 2021. doi: 10.1109/ICSME52107.2021.00018. URL https://doi.org/10.1109/ICSME52107.2021.00018.
- Stanford alpaca: An instruction-following llama model. https://github.com/tatsu-lab/stanford_alpaca, 2023.
- UL2: unifying language learning paradigms. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. URL https://openreview.net/pdf?id=6ruVLB727MC.
- Lamda: Language models for dialog applications. CoRR, abs/2201.08239, 2022. URL https://arxiv.org/abs/2201.08239.
- Bugswarm: mining and continuously growing a dataset of reproducible failures and fixes. In Joanne M. Atlee, Tevfik Bultan, and Jon Whittle (eds.), Proceedings of the 41st International Conference on Software Engineering, ICSE 2019, Montreal, QC, Canada, May 25-31, 2019, pp. 339–349. IEEE / ACM, 2019. doi: 10.1109/ICSE.2019.00048. URL https://doi.org/10.1109/ICSE.2019.00048.
- Llama: Open and efficient foundation language models. CoRR, abs/2302.13971, 2023a. doi: 10.48550/arXiv.2302.13971. URL https://doi.org/10.48550/arXiv.2302.13971.
- Llama 2: Open foundation and fine-tuned chat models. CoRR, abs/2307.09288, 2023b. doi: 10.48550/arXiv.2307.09288. URL https://doi.org/10.48550/arXiv.2307.09288.
- Recovering variable names for minified code with usage contexts. In Joanne M. Atlee, Tevfik Bultan, and Jon Whittle (eds.), Proceedings of the 41st International Conference on Software Engineering, ICSE 2019, Montreal, QC, Canada, May 25-31, 2019, pp. 1165–1175. IEEE / ACM, 2019. doi: 10.1109/ICSE.2019.00119. URL https://doi.org/10.1109/ICSE.2019.00119.
- Immanuel Trummer. Codexdb: Synthesizing code for query processing from natural language instructions using GPT-3 codex. Proc. VLDB Endow., 15(11):2921–2928, 2022. URL https://www.vldb.org/pvldb/vol15/p2921-trummer.pdf.
- On the localness of software. In Shing-Chi Cheung, Alessandro Orso, and Margaret-Anne D. Storey (eds.), Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, (FSE-22), Hong Kong, China, November 16 - 22, 2014, pp. 269–280. ACM, 2014. doi: 10.1145/2635868.2635875. URL https://doi.org/10.1145/2635868.2635875.
- On learning meaningful code changes via neural machine translation. In Joanne M. Atlee, Tevfik Bultan, and Jon Whittle (eds.), Proceedings of the 41st International Conference on Software Engineering, ICSE 2019, Montreal, QC, Canada, May 25-31, 2019, pp. 25–36. IEEE / ACM, 2019a. doi: 10.1109/ICSE.2019.00021. URL https://doi.org/10.1109/ICSE.2019.00021.
- An empirical study on learning bug-fixing patches in the wild via neural machine translation. ACM Trans. Softw. Eng. Methodol., 28(4):19:1–19:29, 2019b. doi: 10.1145/3340544. URL https://doi.org/10.1145/3340544.
- Learning how to mutate source code from bug-fixes. In 2019 IEEE International Conference on Software Maintenance and Evolution, ICSME 2019, Cleveland, OH, USA, September 29 - October 4, 2019, pp. 301–312. IEEE, 2019c. doi: 10.1109/ICSME.2019.00046. URL https://doi.org/10.1109/ICSME.2019.00046.
- Deepmutation: a neural mutation tool. In Gregg Rothermel and Doo-Hwan Bae (eds.), ICSE ’20: 42nd International Conference on Software Engineering, Companion Volume, Seoul, South Korea, 27 June - 19 July, 2020, pp. 29–32. ACM, 2020. doi: 10.1145/3377812.3382146. URL https://doi.org/10.1145/3377812.3382146.
- Unit test case generation with transformers and focal context, 2021a.
- Generating accurate assert statements for unit test cases using pretrained transformers. In IEEE/ACM International Conference on Automation of Software Test, AST@ICSE 2022, Pittsburgh, PA, USA, May 21-22, 2022, pp. 54–64. ACM/IEEE, 2022a. doi: 10.1145/3524481.3527220. URL https://doi.org/10.1145/3524481.3527220.
- Towards automating code review activities. In 43rd IEEE/ACM International Conference on Software Engineering, ICSE 2021, Madrid, Spain, 22-30 May 2021, pp. 163–174. IEEE, 2021b. doi: 10.1109/ICSE43902.2021.00027. URL https://doi.org/10.1109/ICSE43902.2021.00027.
- Using pre-trained models to boost code review automation. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 2291–2302. ACM, 2022b. doi: 10.1145/3510003.3510621. URL https://doi.org/10.1145/3510003.3510621.
- Natural Language Processing with Transformers: Building Language Applications with Hugging Face. O’Reilly Media, Incorporated, 2022. ISBN 1098103246. URL https://books.google.ch/books?id=7hhyzgEACAAJ.
- Neural program repair by jointly learning to localize and repair. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net, 2019. URL https://openreview.net/forum?id=ByloJ20qtm.
- Recovering clear, natural identifiers from obfuscated JS names. In Eric Bodden, Wilhelm Schäfer, Arie van Deursen, and Andrea Zisman (eds.), Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering, ESEC/FSE 2017, Paderborn, Germany, September 4-8, 2017, pp. 683–693. ACM, 2017. doi: 10.1145/3106237.3106289. URL https://doi.org/10.1145/3106237.3106289.
- Changescribe: A tool for automatically generating commit messages. In Antonia Bertolino, Gerardo Canfora, and Sebastian G. Elbaum (eds.), 37th IEEE/ACM International Conference on Software Engineering, ICSE 2015, Florence, Italy, May 16-24, 2015, Volume 2, pp. 709–712. IEEE Computer Society, 2015. doi: 10.1109/ICSE.2015.229. URL https://doi.org/10.1109/ICSE.2015.229.
- Attention is all you need. In Isabelle Guyon, Ulrike von Luxburg, Samy Bengio, Hanna M. Wallach, Rob Fergus, S. V. N. Vishwanathan, and Roman Garnett (eds.), Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, CA, USA, pp. 5998–6008, 2017. URL https://proceedings.neurips.cc/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html.
- ConTest: A unit test completion benchmark featuring context. In Royi Lachmy, Ziyu Yao, Greg Durrett, Milos Gligoric, Junyi Jessy Li, Ray Mooney, Graham Neubig, Yu Su, Huan Sun, and Reut Tsarfaty (eds.), Proceedings of the 1st Workshop on Natural Language Processing for Programming (NLP4Prog 2021), pp. 17–25, Online, August 2021. Association for Computational Linguistics. doi: 10.18653/v1/2021.nlp4prog-1.2. URL https://aclanthology.org/2021.nlp4prog-1.2.
- Improving automatic source code summarization via deep reinforcement learning. In Marianne Huchard, Christian Kästner, and Gordon Fraser (eds.), Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering, ASE 2018, Montpellier, France, September 3-7, 2018, pp. 397–407. ACM, 2018. doi: 10.1145/3238147.3238206. URL https://doi.org/10.1145/3238147.3238206.
- GLUE: A multi-task benchmark and analysis platform for natural language understanding. In Tal Linzen, Grzegorz Chrupala, and Afra Alishahi (eds.), Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, BlackboxNLP@EMNLP 2018, Brussels, Belgium, November 1, 2018, pp. 353–355. Association for Computational Linguistics, 2018a. doi: 10.18653/v1/w18-5446. URL https://doi.org/10.18653/v1/w18-5446.
- Superglue: A stickier benchmark for general-purpose language understanding systems. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d’Alché-Buc, Emily B. Fox, and Roman Garnett (eds.), Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, pp. 3261–3275, 2019. URL https://proceedings.neurips.cc/paper/2019/hash/4496bf24afe7fab6f046bf4923da8de6-Abstract.html.
- RAT-SQL: relation-aware schema encoding and linking for text-to-sql parsers. In Dan Jurafsky, Joyce Chai, Natalie Schluter, and Joel R. Tetreault (eds.), Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, July 5-10, 2020, pp. 7567–7578. Association for Computational Linguistics, 2020a. doi: 10.18653/v1/2020.acl-main.677. URL https://doi.org/10.18653/v1/2020.acl-main.677.
- Learning to synthesize data for semantic parsing. In Kristina Toutanova, Anna Rumshisky, Luke Zettlemoyer, Dilek Hakkani-Tür, Iz Beltagy, Steven Bethard, Ryan Cotterell, Tanmoy Chakraborty, and Yichao Zhou (eds.), Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2021, Online, June 6-11, 2021, pp. 2760–2766. Association for Computational Linguistics, 2021a. doi: 10.18653/V1/2021.NAACL-MAIN.220. URL https://doi.org/10.18653/v1/2021.naacl-main.220.
- GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model. https://github.com/kingoflolz/mesh-transformer-jax, May 2021.
- Bridging pre-trained models and downstream tasks for source code understanding. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 287–298. ACM, 2022a. doi: 10.1145/3510003.3510062. URL https://doi.org/10.1145/3510003.3510062.
- Context-aware retrieval-based deep commit message generation. ACM Trans. Softw. Eng. Methodol., 30(4):56:1–56:30, 2021b. doi: 10.1145/3464689. URL https://doi.org/10.1145/3464689.
- Ke Wang and Zhendong Su. Blended, precise semantic program embeddings. In Proceedings of the 41st ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2020, pp. 121–134, New York, NY, USA, 2020. Association for Computing Machinery. ISBN 9781450376136. doi: 10.1145/3385412.3385999. URL https://doi.org/10.1145/3385412.3385999.
- Delving into commit-issue correlation to enhance commit message generation models. In 38th IEEE/ACM International Conference on Automated Software Engineering, ASE 2023, Luxembourg, September 11-15, 2023, pp. 710–722. IEEE, 2023a. doi: 10.1109/ASE56229.2023.00050. URL https://doi.org/10.1109/ASE56229.2023.00050.
- Ccaligner: a token based large-gap clone detector. In Michel Chaudron, Ivica Crnkovic, Marsha Chechik, and Mark Harman (eds.), Proceedings of the 40th International Conference on Software Engineering, ICSE 2018, Gothenburg, Sweden, May 27 - June 03, 2018, pp. 1066–1077. ACM, 2018b. doi: 10.1145/3180155.3180179. URL https://doi.org/10.1145/3180155.3180179.
- Text-to-sql generation for question answering on electronic medical records. In Yennun Huang, Irwin King, Tie-Yan Liu, and Maarten van Steen (eds.), WWW ’20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, pp. 350–361. ACM / IW3C2, 2020b. doi: 10.1145/3366423.3380120. URL https://doi.org/10.1145/3366423.3380120.
- Pre-implementation method name prediction for object-oriented programming. ACM Trans. Softw. Eng. Methodol., 32(6), sep 2023b. ISSN 1049-331X. doi: 10.1145/3597203. URL https://doi.org/10.1145/3597203.
- Linformer: Self-attention with linear complexity. CoRR, abs/2006.04768, 2020c. URL https://arxiv.org/abs/2006.04768.
- Bugram: bug detection with n-gram language models. In David Lo, Sven Apel, and Sarfraz Khurshid (eds.), Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, ASE 2016, Singapore, September 3-7, 2016, pp. 708–719. ACM, 2016a. doi: 10.1145/2970276.2970341. URL https://doi.org/10.1145/2970276.2970341.
- Automatically learning semantic features for defect prediction. In Laura K. Dillon, Willem Visser, and Laurie A. Williams (eds.), Proceedings of the 38th International Conference on Software Engineering, ICSE 2016, Austin, TX, USA, May 14-22, 2016, pp. 297–308. ACM, 2016b. doi: 10.1145/2884781.2884804. URL https://doi.org/10.1145/2884781.2884804.
- What language model architecture and pretraining objective works best for zero-shot generalization? In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu, and Sivan Sabato (eds.), International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA, volume 162 of Proceedings of Machine Learning Research, pp. 22964–22984. PMLR, 2022b. URL https://proceedings.mlr.press/v162/wang22u.html.
- Detecting code clones with graph neural network and flow-augmented abstract syntax tree. In Kostas Kontogiannis, Foutse Khomh, Alexander Chatzigeorgiou, Marios-Eleftherios Fokaefs, and Minghui Zhou (eds.), 27th IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2020, London, ON, Canada, February 18-21, 2020, pp. 261–271. IEEE, 2020d. doi: 10.1109/SANER48275.2020.9054857. URL https://doi.org/10.1109/SANER48275.2020.9054857.
- Trans^3: A transformer-based framework for unifying code summarization and code search. CoRR, abs/2003.03238, 2020e. URL https://arxiv.org/abs/2003.03238.
- Syncobert: Syntax-guided multi-modal contrastive pre-training for code representation, 2021c.
- Compilable neural code generation with compiler feedback. In Smaranda Muresan, Preslav Nakov, and Aline Villavicencio (eds.), Findings of the Association for Computational Linguistics: ACL 2022, Dublin, Ireland, May 22-27, 2022, pp. 9–19. Association for Computational Linguistics, 2022c. doi: 10.18653/V1/2022.FINDINGS-ACL.2. URL https://doi.org/10.18653/v1/2022.findings-acl.2.
- CODE-MVP: learning to represent source code from multiple views with contrastive pre-training. In Marine Carpuat, Marie-Catherine de Marneffe, and Iván Vladimir Meza Ruíz (eds.), Findings of the Association for Computational Linguistics: NAACL 2022, Seattle, WA, United States, July 10-15, 2022, pp. 1066–1077. Association for Computational Linguistics, 2022d. doi: 10.18653/v1/2022.findings-naacl.80. URL https://doi.org/10.18653/v1/2022.findings-naacl.80.
- SPINE: a scalable log parser with feedback guidance. In Abhik Roychoudhury, Cristian Cadar, and Miryung Kim (eds.), Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, Singapore, Singapore, November 14-18, 2022, pp. 1198–1208. ACM, 2022e. doi: 10.1145/3540250.3549176. URL https://doi.org/10.1145/3540250.3549176.
- Cocosum: Contextual code summarization with multi-relational graph neural network. CoRR, abs/2107.01933, 2021d. URL https://arxiv.org/abs/2107.01933.
- Super-naturalinstructions: Generalization via declarative instructions on 1600+ NLP tasks. In Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (eds.), Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11, 2022, pp. 5085–5109. Association for Computational Linguistics, 2022f. URL https://aclanthology.org/2022.emnlp-main.340.
- Self-instruct: Aligning language models with self-generated instructions. In Anna Rogers, Jordan L. Boyd-Graber, and Naoaki Okazaki (eds.), Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023, pp. 13484–13508. Association for Computational Linguistics, 2023c. doi: 10.18653/v1/2023.acl-long.754. URL https://doi.org/10.18653/v1/2023.acl-long.754.
- Codet5: Identifier-aware unified pre-trained encoder-decoder models for code understanding and generation. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih (eds.), Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021, pp. 8696–8708. Association for Computational Linguistics, 2021e. doi: 10.18653/v1/2021.emnlp-main.685. URL https://doi.org/10.18653/v1/2021.emnlp-main.685.
- Codet5+: Open code large language models for code understanding and generation. CoRR, abs/2305.07922, 2023d. doi: 10.48550/arXiv.2305.07922. URL https://doi.org/10.48550/arXiv.2305.07922.
- Deep learning library testing via effective model generation. In Prem Devanbu, Myra B. Cohen, and Thomas Zimmermann (eds.), ESEC/FSE ’20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Virtual Event, USA, November 8-13, 2020, pp. 788–799. ACM, 2020f. doi: 10.1145/3368089.3409761. URL https://doi.org/10.1145/3368089.3409761.
- Execution-based evaluation for open-domain code generation. CoRR, abs/2212.10481, 2022g. doi: 10.48550/ARXIV.2212.10481. URL https://doi.org/10.48550/arXiv.2212.10481.
- Mconala: A benchmark for code generation from multiple natural languages. In Andreas Vlachos and Isabelle Augenstein (eds.), Findings of the Association for Computational Linguistics: EACL 2023, Dubrovnik, Croatia, May 2-6, 2023, pp. 265–273. Association for Computational Linguistics, 2023e. doi: 10.18653/V1/2023.FINDINGS-EACL.20. URL https://doi.org/10.18653/v1/2023.findings-eacl.20.
- On learning meaningful assert statements for unit test cases. In Gregg Rothermel and Doo-Hwan Bae (eds.), ICSE ’20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June - 19 July, 2020, pp. 1398–1409. ACM, 2020. doi: 10.1145/3377811.3380429. URL https://doi.org/10.1145/3377811.3380429.
- Free lunch for testing: Fuzzing deep-learning libraries from open source. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 995–1007. ACM, 2022a. doi: 10.1145/3510003.3510041. URL https://doi.org/10.1145/3510003.3510041.
- Code generation as a dual task of code summarization. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d’Alché-Buc, Emily B. Fox, and Roman Garnett (eds.), Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, pp. 6559–6569, 2019. URL https://proceedings.neurips.cc/paper/2019/hash/e52ad5c9f751f599492b4f087ed7ecfc-Abstract.html.
- Supervised deep features for software functional clone detection by exploiting lexical and syntactical information in source code. In Carles Sierra (ed.), Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI 2017, Melbourne, Australia, August 19-25, 2017, pp. 3034–3040. ijcai.org, 2017. doi: 10.24963/ijcai.2017/423. URL https://doi.org/10.24963/ijcai.2017/423.
- Finetuned language models are zero-shot learners. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022b. URL https://openreview.net/forum?id=gEZrGCozdqR.
- Emergent abilities of large language models. Transactions on Machine Learning Research, 2022c. ISSN 2835-8856. URL https://openreview.net/forum?id=yzkSU5zdwD. Survey Certification.
- Chain-of-thought prompting elicits reasoning in large language models. In NeurIPS, 2022d. URL http://papers.nips.cc/paper_files/paper/2022/hash/9d5609613524ecf4f15af0f7b31abca4-Abstract-Conference.html.
- Lambdanet: Probabilistic type inference using graph neural networks. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. OpenReview.net, 2020. URL https://openreview.net/forum?id=Hkx6hANtwH.
- Typet5: Seq2seq type inference using static analysis. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. URL https://openreview.net/pdf?id=4TyNEhI2GdN.
- Babeltower: Learning to auto-parallelized program translation. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu, and Sivan Sabato (eds.), International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA, volume 162 of Proceedings of Machine Learning Research, pp. 23685–23700. PMLR, 2022. URL https://proceedings.mlr.press/v162/wen22b.html.
- Toward deep learning software repositories. In Massimiliano Di Penta, Martin Pinzger, and Romain Robbes (eds.), 12th IEEE/ACM Working Conference on Mining Software Repositories, MSR 2015, Florence, Italy, May 16-17, 2015, pp. 334–345. IEEE Computer Society, 2015. doi: 10.1109/MSR.2015.38. URL https://doi.org/10.1109/MSR.2015.38.
- Deep learning code fragments for code clone detection. In David Lo, Sven Apel, and Sarfraz Khurshid (eds.), Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, ASE 2016, Singapore, September 3-7, 2016, pp. 87–98. ACM, 2016. doi: 10.1145/2970276.2970326. URL https://doi.org/10.1145/2970276.2970326.
- Bugsinpy: a database of existing bugs in python programs to enable controlled testing and debugging studies. In Prem Devanbu, Myra B. Cohen, and Thomas Zimmermann (eds.), ESEC/FSE ’20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Virtual Event, USA, November 8-13, 2020, pp. 1556–1560. ACM, 2020. doi: 10.1145/3368089.3417943. URL https://doi.org/10.1145/3368089.3417943.
- Ronald J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach. Learn., 8:229–256, 1992. doi: 10.1007/BF00992696. URL https://doi.org/10.1007/BF00992696.
- Clocom: Mining existing source code for automatic comment generation. In Yann-Gaël Guéhéneuc, Bram Adams, and Alexander Serebrenik (eds.), 22nd IEEE International Conference on Software Analysis, Evolution, and Reengineering, SANER 2015, Montreal, QC, Canada, March 2-6, 2015, pp. 380–389. IEEE Computer Society, 2015. doi: 10.1109/SANER.2015.7081848. URL https://doi.org/10.1109/SANER.2015.7081848.
- Code summarization with structure-induced transformer. In Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (eds.), Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, Online Event, August 1-6, 2021, volume ACL/IJCNLP 2021 of Findings of ACL, pp. 1078–1090. Association for Computational Linguistics, 2021. doi: 10.18653/v1/2021.findings-acl.93. URL https://doi.org/10.18653/v1/2021.findings-acl.93.
- Lvmapper: A large-variance clone detector using sequencing alignment approach. IEEE Access, 8:27986–27997, 2020. doi: 10.1109/ACCESS.2020.2971545. URL https://doi.org/10.1109/ACCESS.2020.2971545.
- Evaluating and improving neural program-smoothing-based fuzzing. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022, pp. 847–858. ACM, 2022. doi: 10.1145/3510003.3510089. URL https://doi.org/10.1145/3510003.3510089.
- Less training, more repairing please: revisiting automated program repair via zero-shot learning. In Abhik Roychoudhury, Cristian Cadar, and Miryung Kim (eds.), Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, Singapore, Singapore, November 14-18, 2022, pp. 959–971. ACM, 2022. doi: 10.1145/3540250.3549101. URL https://doi.org/10.1145/3540250.3549101.
- Conversational automated program repair. CoRR, abs/2301.13246, 2023. doi: 10.48550/ARXIV.2301.13246. URL https://doi.org/10.48550/arXiv.2301.13246.
- Automated program repair in the era of large pre-trained language models. In 45th IEEE/ACM International Conference on Software Engineering, ICSE 2023, Melbourne, Australia, May 14-20, 2023, pp. 1482–1494. IEEE, 2023. doi: 10.1109/ICSE48619.2023.00129. URL https://doi.org/10.1109/ICSE48619.2023.00129.
- Docter: documentation-guided fuzzing for testing deep learning API functions. In Sukyoung Ryu and Yannis Smaragdakis (eds.), ISSTA ’22: 31st ACM SIGSOFT International Symposium on Software Testing and Analysis, Virtual Event, South Korea, July 18 - 22, 2022, pp. 176–188. ACM, 2022a. doi: 10.1145/3533767.3534220. URL https://doi.org/10.1145/3533767.3534220.
- Exploiting method names to improve code summarization: A deliberation multi-task learning approach. In 29th IEEE/ACM International Conference on Program Comprehension, ICPC 2021, Madrid, Spain, May 20-21, 2021, pp. 138–148. IEEE, 2021. doi: 10.1109/ICPC52881.2021.00022. URL https://doi.org/10.1109/ICPC52881.2021.00022.
- Unifiedskg: Unifying and multi-tasking structured knowledge grounding with text-to-text language models. In Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (eds.), Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11, 2022, pp. 602–631. Association for Computational Linguistics, 2022b. doi: 10.18653/v1/2022.emnlp-main.39. URL https://doi.org/10.18653/v1/2022.emnlp-main.39.
- A survey of deep code search. CoRR, abs/2305.05959, 2023a. doi: 10.48550/arXiv.2305.05959. URL https://doi.org/10.48550/arXiv.2305.05959.
- Chatunitest: a chatgpt-based automated unit test generation tool. CoRR, abs/2305.04764, 2023b. doi: 10.48550/ARXIV.2305.04764. URL https://doi.org/10.48550/arXiv.2305.04764.
- Wizardlm: Empowering large language models to follow complex instructions. CoRR, abs/2304.12244, 2023a. doi: 10.48550/arXiv.2304.12244. URL https://doi.org/10.48550/arXiv.2304.12244.
- A systematic evaluation of large language models of code. In Swarat Chaudhuri and Charles Sutton (eds.), MAPS@PLDI 2022: 6th ACM SIGPLAN International Symposium on Machine Programming, San Diego, CA, USA, 13 June 2022, pp. 1–10. ACM, 2022. doi: 10.1145/3520312.3534862. URL https://doi.org/10.1145/3520312.3534862.
- Two-stage attention-based model for code search with textual and structural features. In 28th IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2021, Honolulu, HI, USA, March 9-12, 2021, pp. 342–353. IEEE, 2021. doi: 10.1109/SANER50967.2021.00039. URL https://doi.org/10.1109/SANER50967.2021.00039.
- Commit message generation for source code changes. In Sarit Kraus (ed.), Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10-16, 2019, pp. 3975–3981. ijcai.org, 2019a. doi: 10.24963/IJCAI.2019/552. URL https://doi.org/10.24963/ijcai.2019/552.
- Method name suggestion with hierarchical attention networks. In Manuel V. Hermenegildo and Atsushi Igarashi (eds.), Proceedings of the 2019 ACM SIGPLAN Workshop on Partial Evaluation and Program Manipulation, PEPM@POPL 2019, Cascais, Portugal, January 14-15, 2019, pp. 10–21. ACM, 2019b. doi: 10.1145/3294032.3294079. URL https://doi.org/10.1145/3294032.3294079.
- Lmpa: Improving decompilation by synergy of large language model and program analysis. CoRR, abs/2306.02546, 2023b. doi: 10.48550/ARXIV.2306.02546. URL https://doi.org/10.48550/arXiv.2306.02546.
- Sqlnet: Generating structured queries from natural language without reinforcement learning. CoRR, abs/1711.04436, 2017. URL http://arxiv.org/abs/1711.04436.
- A survey on pretrained language models for neural code intelligence. CoRR, abs/2212.10079, 2022. doi: 10.48550/arXiv.2212.10079. URL https://doi.org/10.48550/arXiv.2212.10079.
- Python probabilistic type inference with natural language support. In Thomas Zimmermann, Jane Cleland-Huang, and Zhendong Su (eds.), Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, FSE 2016, Seattle, WA, USA, November 13-18, 2016, pp. 607–618. ACM, 2016. doi: 10.1145/2950290.2950343. URL https://doi.org/10.1145/2950290.2950343.
- Sqlizer: query synthesis from natural language. Proc. ACM Program. Lang., 1(OOPSLA):63:1–63:26, 2017. doi: 10.1145/3133887. URL https://doi.org/10.1145/3133887.
- Cross-language source code clone detection using deep learning with infercode. CoRR, abs/2205.04913, 2022. doi: 10.48550/arXiv.2205.04913. URL https://doi.org/10.48550/arXiv.2205.04913.
- Are the code snippets what we are searching for? A benchmark and an empirical study on code search with natural-language queries. In Kostas Kontogiannis, Foutse Khomh, Alexander Chatzigeorgiou, Marios-Eleftherios Fokaefs, and Minghui Zhou (eds.), 27th IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2020, London, ON, Canada, February 18-21, 2020, pp. 344–354. IEEE, 2020. doi: 10.1109/SANER48275.2020.9054840. URL https://doi.org/10.1109/SANER48275.2020.9054840.
- Codetransocean: A comprehensive multilingual benchmark for code translation. CoRR, abs/2310.04951, 2023. doi: 10.48550/ARXIV.2310.04951. URL https://doi.org/10.48550/arXiv.2310.04951.
- Baichuan 2: Open large-scale language models. CoRR, abs/2309.10305, 2023a. doi: 10.48550/arXiv.2309.10305. URL https://doi.org/10.48550/arXiv.2309.10305.
- White-box compiler fuzzing empowered by large language models. CoRR, abs/2310.15991, 2023b. doi: 10.48550/ARXIV.2310.15991. URL https://doi.org/10.48550/arXiv.2310.15991.
- Fuzzing automatic differentiation in deep-learning libraries. In 45th IEEE/ACM International Conference on Software Engineering, ICSE 2023, Melbourne, Australia, May 14-20, 2023, pp. 1174–1186. IEEE, 2023c. doi: 10.1109/ICSE48619.2023.00105. URL https://doi.org/10.1109/ICSE48619.2023.00105.
- Intercode: Standardizing and benchmarking interactive coding with execution feedback. CoRR, abs/2306.14898, 2023d. doi: 10.48550/ARXIV.2306.14898. URL https://doi.org/10.48550/arXiv.2306.14898.
- Hierarchical neural data synthesis for semantic parsing. CoRR, abs/2112.02212, 2021. URL https://arxiv.org/abs/2112.02212.
- Zero-shot detection of machine-generated codes. CoRR, abs/2310.05103, 2023e. doi: 10.48550/ARXIV.2310.05103. URL https://doi.org/10.48550/arXiv.2310.05103.
- A naming pattern based approach for method name recommendation. In IEEE 33rd International Symposium on Software Reliability Engineering, ISSRE 2022, Charlotte, NC, USA, October 31 - Nov. 3, 2022, pp. 344–354. IEEE, 2022. doi: 10.1109/ISSRE55969.2022.00041. URL https://doi.org/10.1109/ISSRE55969.2022.00041.
- Staqc: A systematically mined question-code dataset from stack overflow. In Pierre-Antoine Champin, Fabien Gandon, Mounia Lalmas, and Panagiotis G. Ipeirotis (eds.), Proceedings of the 2018 World Wide Web Conference on World Wide Web, WWW 2018, Lyon, France, April 23-27, 2018, pp. 1693–1703. ACM, 2018. doi: 10.1145/3178876.3186081. URL https://doi.org/10.1145/3178876.3186081.
- Graph-based, self-supervised program repair from diagnostic feedback. In Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event, volume 119 of Proceedings of Machine Learning Research, pp. 10799–10808. PMLR, 2020. URL http://proceedings.mlr.press/v119/yasunaga20a.html.
- Break-it-fix-it: Unsupervised learning for program repair. In Marina Meila and Tong Zhang (eds.), Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event, volume 139 of Proceedings of Machine Learning Research, pp. 11941–11952. PMLR, 2021. URL http://proceedings.mlr.press/v139/yasunaga21a.html.
- Do machine learning models produce typescript types that type check? In Karim Ali and Guido Salvaneschi (eds.), 37th European Conference on Object-Oriented Programming, ECOOP 2023, July 17-21, 2023, Seattle, Washington, United States, volume 263 of LIPIcs, pp. 37:1–37:28. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2023. doi: 10.4230/LIPIcs.ECOOP.2023.37. URL https://doi.org/10.4230/LIPIcs.ECOOP.2023.37.
- A syntactic neural model for general-purpose code generation. In Regina Barzilay and Min-Yen Kan (eds.), Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 1: Long Papers, pp. 440–450. Association for Computational Linguistics, 2017. doi: 10.18653/v1/P17-1041. URL https://doi.org/10.18653/v1/P17-1041.
- Learning to mine aligned code and natural language pairs from stack overflow. In Andy Zaidman, Yasutaka Kamei, and Emily Hill (eds.), Proceedings of the 15th International Conference on Mining Software Repositories, MSR 2018, Gothenburg, Sweden, May 28-29, 2018, pp. 476–486. ACM, 2018. doi: 10.1145/3196398.3196408. URL https://doi.org/10.1145/3196398.3196408.
- Tabert: Pretraining for joint understanding of textual and tabular data. In Dan Jurafsky, Joyce Chai, Natalie Schluter, and Joel R. Tetreault (eds.), Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, July 5-10, 2020, pp. 8413–8426. Association for Computational Linguistics, 2020. doi: 10.18653/V1/2020.ACL-MAIN.745. URL https://doi.org/10.18653/v1/2020.acl-main.745.
- Automatic code review by learning the structure information of code graph. Sensors, 23(5):2551, 2023. doi: 10.3390/s23052551. URL https://doi.org/10.3390/s23052551.
- Neural detection of semantic code clones via tree-based convolution. In Yann-Gaël Guéhéneuc, Foutse Khomh, and Federica Sarro (eds.), Proceedings of the 27th International Conference on Program Comprehension, ICPC 2019, Montreal, QC, Canada, May 25-31, 2019, pp. 70–80. IEEE / ACM, 2019a. doi: 10.1109/ICPC.2019.00021. URL https://doi.org/10.1109/ICPC.2019.00021.
- Codereval: A benchmark of pragmatic code generation with generative pre-trained models. CoRR, abs/2302.00288, 2023a. doi: 10.48550/arXiv.2302.00288. URL https://doi.org/10.48550/arXiv.2302.00288.
- Self-supervised log parsing using semantic contribution difference. J. Syst. Softw., 200:111646, 2023b. doi: 10.1016/J.JSS.2023.111646. URL https://doi.org/10.1016/j.jss.2023.111646.
- Typesql: Knowledge-based type-aware neural text-to-sql generation. In Marilyn A. Walker, Heng Ji, and Amanda Stent (eds.), Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT, New Orleans, Louisiana, USA, June 1-6, 2018, Volume 2 (Short Papers), pp. 588–594. Association for Computational Linguistics, 2018a. doi: 10.18653/v1/n18-2093. URL https://doi.org/10.18653/v1/n18-2093.
- Syntaxsqlnet: Syntax tree networks for complex and cross-domain text-to-sql task. In Ellen Riloff, David Chiang, Julia Hockenmaier, and Jun’ichi Tsujii (eds.), Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018, pp. 1653–1663. Association for Computational Linguistics, 2018b. doi: 10.18653/v1/d18-1193. URL https://doi.org/10.18653/v1/d18-1193.
- Spider: A large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-sql task. In Ellen Riloff, David Chiang, Julia Hockenmaier, and Jun’ichi Tsujii (eds.), Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018, pp. 3911–3921. Association for Computational Linguistics, 2018c. doi: 10.18653/v1/d18-1425. URL https://doi.org/10.18653/v1/d18-1425.
- Cosql: A conversational text-to-sql challenge towards cross-domain natural language interfaces to databases. In Kentaro Inui, Jing Jiang, Vincent Ng, and Xiaojun Wan (eds.), Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019, Hong Kong, China, November 3-7, 2019, pp. 1962–1979. Association for Computational Linguistics, 2019b. doi: 10.18653/v1/D19-1204. URL https://doi.org/10.18653/v1/D19-1204.
- Sparc: Cross-domain semantic parsing in context. In Anna Korhonen, David R. Traum, and Lluís Màrquez (eds.), Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, Florence, Italy, July 28- August 2, 2019, Volume 1: Long Papers, pp. 4511–4523. Association for Computational Linguistics, 2019c. doi: 10.18653/v1/p19-1443. URL https://doi.org/10.18653/v1/p19-1443.
- Grappa: Grammar-augmented pre-training for table semantic parsing. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021. URL https://openreview.net/forum?id=kyaIeYj4zZ.
- Dataset and enhanced model for eligibility criteria-to-sql semantic parsing. In Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, and Stelios Piperidis (eds.), Proceedings of The 12th Language Resources and Evaluation Conference, LREC 2020, Marseille, France, May 11-16, 2020, pp. 5829–5837. European Language Resources Association, 2020. URL https://aclanthology.org/2020.lrec-1.714/.
- Evaluating instruction-tuned large language models on code comprehension and generation. CoRR, abs/2308.01240, 2023a. doi: 10.48550/arXiv.2308.01240. URL https://doi.org/10.48550/arXiv.2308.01240.
- No more manual tests? evaluating and improving chatgpt for unit test generation. CoRR, abs/2305.04207, 2023b. doi: 10.48550/ARXIV.2305.04207. URL https://doi.org/10.48550/arXiv.2305.04207.
- CERT: continual pre-training on sketches for library-oriented code generation. In Luc De Raedt (ed.), Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI 2022, Vienna, Austria, 23-29 July 2022, pp. 2369–2375. ijcai.org, 2022. doi: 10.24963/ijcai.2022/329. URL https://doi.org/10.24963/ijcai.2022/329.
- Large language models meet nl2code: A survey. In Anna Rogers, Jordan L. Boyd-Graber, and Naoaki Okazaki (eds.), Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023, pp. 7443–7464. Association for Computational Linguistics, 2023. doi: 10.18653/v1/2023.acl-long.411. URL https://doi.org/10.18653/v1/2023.acl-long.411.
- Learning to parse database queries using inductive logic programming. In William J. Clancey and Daniel S. Weld (eds.), Proceedings of the Thirteenth National Conference on Artificial Intelligence and Eighth Innovative Applications of Artificial Intelligence Conference, AAAI 96, IAAI 96, Portland, Oregon, USA, August 4-8, 1996, Volume 2, pp. 1050–1055. AAAI Press / The MIT Press, 1996. URL http://www.aaai.org/Library/AAAI/1996/aaai96-156.php.
- Photon: A robust cross-domain text-to-sql system. In Asli Celikyilmaz and Tsung-Hsien Wen (eds.), Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, ACL 2020, Online, July 5-10, 2020, pp. 204–214. Association for Computational Linguistics, 2020. doi: 10.18653/V1/2020.ACL-DEMOS.24. URL https://doi.org/10.18653/v1/2020.acl-demos.24.
- Prompt-enhanced software vulnerability detection using chatgpt. CoRR, abs/2308.12697, 2023a. doi: 10.48550/ARXIV.2308.12697. URL https://doi.org/10.48550/arXiv.2308.12697.
- A survey of automatic source code summarization. Symmetry, 14(3):471, 2022. doi: 10.3390/SYM14030471. URL https://doi.org/10.3390/sym14030471.
- Repocoder: Repository-level code completion through iterative retrieval and generation. CoRR, abs/2303.12570, 2023b. doi: 10.48550/ARXIV.2303.12570. URL https://doi.org/10.48550/arXiv.2303.12570.
- A survey of software clone detection from security perspective. IEEE Access, 9:48157–48173, 2021. doi: 10.1109/ACCESS.2021.3065872. URL https://doi.org/10.1109/ACCESS.2021.3065872.
- A novel neural source code representation based on abstract syntax tree. In Joanne M. Atlee, Tevfik Bultan, and Jon Whittle (eds.), Proceedings of the 41st International Conference on Software Engineering, ICSE 2019, Montreal, QC, Canada, May 25-31, 2019, pp. 783–794. IEEE / ACM, 2019a. doi: 10.1109/ICSE.2019.00086. URL https://doi.org/10.1109/ICSE.2019.00086.
- Recommending apis for API related questions in stack overflow. IEEE Access, 6:6205–6219, 2018. doi: 10.1109/ACCESS.2017.2777845. URL https://doi.org/10.1109/ACCESS.2017.2777845.
- A survey of learning-based automated program repair. CoRR, abs/2301.03270, 2023c. doi: 10.48550/arXiv.2301.03270. URL https://doi.org/10.48550/arXiv.2301.03270.
- Editing-based SQL query generation for cross-domain context-dependent questions. In Kentaro Inui, Jing Jiang, Vincent Ng, and Xiaojun Wan (eds.), Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019, Hong Kong, China, November 3-7, 2019, pp. 5337–5348. Association for Computational Linguistics, 2019b. doi: 10.18653/v1/D19-1537. URL https://doi.org/10.18653/v1/D19-1537.
- System log parsing: A survey. IEEE Trans. Knowl. Data Eng., 35(8):8596–8614, 2023d. doi: 10.1109/TKDE.2022.3222417. URL https://doi.org/10.1109/TKDE.2022.3222417.
- Robust log-based anomaly detection on unstable log data. In Marlon Dumas, Dietmar Pfahl, Sven Apel, and Alessandra Russo (eds.), Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2019, Tallinn, Estonia, August 26-30, 2019, pp. 807–817. ACM, 2019c. doi: 10.1145/3338906.3338931. URL https://doi.org/10.1145/3338906.3338931.
- Deepsim: deep learning code functional similarity. In Gary T. Leavens, Alessandro Garcia, and Corina S. Pasareanu (eds.), Proceedings of the 2018 ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2018, Lake Buena Vista, FL, USA, November 04-09, 2018, pp. 141–151. ACM, 2018. doi: 10.1145/3236024.3236068. URL https://doi.org/10.1145/3236024.3236068.
- GP: context-free grammar pre-training for text-to-sql parsers. CoRR, abs/2101.09901, 2021. URL https://arxiv.org/abs/2101.09901.
- Codegeex: A pre-trained model for code generation with multilingual evaluations on humaneval-x. CoRR, abs/2303.17568, 2023a. doi: 10.48550/arXiv.2303.17568. URL https://doi.org/10.48550/arXiv.2303.17568.
- D2A: A dataset built for ai-based vulnerability detection methods using differential analysis. In 43rd IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, ICSE (SEIP) 2021, Madrid, Spain, May 25-28, 2021, pp. 111–120. IEEE, 2021. doi: 10.1109/ICSE-SEIP52600.2021.00020. URL https://doi.org/10.1109/ICSE-SEIP52600.2021.00020.
- Towards an understanding of large language models in software engineering tasks. CoRR, abs/2308.11396, 2023b. doi: 10.48550/ARXIV.2308.11396. URL https://doi.org/10.48550/arXiv.2308.11396.
- Seq2sql: Generating structured queries from natural language using reinforcement learning. CoRR, abs/1709.00103, 2017. URL http://arxiv.org/abs/1709.00103.
- Grounded adaptation for zero-shot executable semantic parsing. In Bonnie Webber, Trevor Cohn, Yulan He, and Yang Liu (eds.), Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP 2020, Online, November 16-20, 2020, pp. 6869–6882. Association for Computational Linguistics, 2020. doi: 10.18653/V1/2020.EMNLP-MAIN.558. URL https://doi.org/10.18653/v1/2020.emnlp-main.558.
- Neural program repair : Systems, challenges and solutions. In Internetware 2022: 13th Asia-Pacific Symposium on Internetware, Hohhot, China, June 11 - 12, 2022, pp. 96–106. ACM, 2022. doi: 10.1145/3545258.3545268. URL https://doi.org/10.1145/3545258.3545268.
- Solving challenging math word problems using GPT-4 code interpreter with code-based self-verification. CoRR, abs/2308.07921, 2023a. doi: 10.48550/ARXIV.2308.07921. URL https://doi.org/10.48550/arXiv.2308.07921.
- Hybrid API migration: A marriage of small API mapping models and large language models. In Hong Mei, Jian Lv, Zhi Jin, Xuandong Li, Xiaohu Yang, and Xin Xia (eds.), Proceedings of the 14th Asia-Pacific Symposium on Internetware, Internetware 2023, Hangzhou, China, August 4-6, 2023, pp. 12–21. ACM, 2023b. doi: 10.1145/3609437.3609466. URL https://doi.org/10.1145/3609437.3609466.
- Devign: Effective vulnerability identification by learning comprehensive program semantics via graph neural networks. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d’Alché-Buc, Emily B. Fox, and Roman Garnett (eds.), Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, pp. 10197–10207, 2019. URL https://proceedings.neurips.cc/paper/2019/hash/49265d2447bc3bbfe9e76306ce40a31f-Abstract.html.
- Towards retrieval-based neural code summarization: A meta-learning approach. IEEE Trans. Software Eng., 49(4):3008–3031, 2023c. doi: 10.1109/TSE.2023.3238161. URL https://doi.org/10.1109/TSE.2023.3238161.
- Tools and benchmarks for automated log parsing. In Helen Sharp and Mike Whalen (eds.), Proceedings of the 41st International Conference on Software Engineering: Software Engineering in Practice, ICSE (SEIP) 2019, Montreal, QC, Canada, May 25-31, 2019, pp. 121–130. IEEE / ACM, 2019. doi: 10.1109/ICSE-SEIP.2019.00021. URL https://doi.org/10.1109/ICSE-SEIP.2019.00021.
- Xlcost: A benchmark dataset for cross-lingual code intelligence. CoRR, abs/2206.08474, 2022a. doi: 10.48550/ARXIV.2206.08474. URL https://doi.org/10.48550/arXiv.2206.08474.
- Multilingual code snippets training for program translation. In Thirty-Sixth AAAI Conference on Artificial Intelligence, AAAI 2022, Thirty-Fourth Conference on Innovative Applications of Artificial Intelligence, IAAI 2022, The Twelveth Symposium on Educational Advances in Artificial Intelligence, EAAI 2022 Virtual Event, February 22 - March 1, 2022, pp. 11783–11790. AAAI Press, 2022b. doi: 10.1609/AAAI.V36I10.21434. URL https://doi.org/10.1609/aaai.v36i10.21434.
- A syntax-guided edit decoder for neural program repair. In Diomidis Spinellis, Georgios Gousios, Marsha Chechik, and Massimiliano Di Penta (eds.), ESEC/FSE ’21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Athens, Greece, August 23-28, 2021, pp. 341–353. ACM, 2021. doi: 10.1145/3468264.3468544. URL https://doi.org/10.1145/3468264.3468544.
- $\mu$μ𝜇\muitalic_μvuldeepecker: A deep learning-based system for multiclass vulnerability detection. IEEE Trans. Dependable Secur. Comput., 18(5):2224–2236, 2021. doi: 10.1109/TDSC.2019.2942930. URL https://doi.org/10.1109/TDSC.2019.2942930.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.