Exploring the Potential of Large Language Models for Improving Digital Forensic Investigation Efficiency
Abstract: The ever-increasing workload of digital forensic labs raises concerns about law enforcement's ability to conduct both cyber-related and non-cyber-related investigations promptly. Consequently, this article explores the potential and usefulness of integrating LLMs into digital forensic investigations to address challenges such as bias, explainability, censorship, resource-intensive infrastructure, and ethical and legal considerations. A comprehensive literature review is carried out, encompassing existing digital forensic models, tools, LLMs, deep learning techniques, and the use of LLMs in investigations. The review identifies current challenges within existing digital forensic processes and explores both the obstacles and the possibilities of incorporating LLMs. In conclusion, the study states that the adoption of LLMs in digital forensics, with appropriate constraints, has the potential to improve investigation efficiency, improve traceability, and alleviate the technical and judicial barriers faced by law enforcement entities.
- Transformer models for text-based emotion detection: a review of BERT-based approaches. Artificial Intelligence Review (2021), 1–41.
- GPT-4 Technical Report. CoRR abs/2303.08774 (2023). https://doi.org/10.48550/ARXIV.2303.08774 arXiv:2303.08774
- A Review of Mobile Forensic Investigation Process Models. IEEE Access 8 (2020), 173359–173375. https://doi.org/10.1109/ACCESS.2020.3014615
- Character-level language modeling with deeper self-attention. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 3159–3166.
- Flamingo: a Visual Language Model for Few-Shot Learning. In Advances in Neural Information Processing Systems, Vol. 35. Curran Associates, Inc., 23716–23736.
- Liaqat Ali. 2019. Cyber crimes-A constant threat for the business sectors and its growth (A study of the online banking sectors in GCC). The Journal of Developing Areas 53, 1 (2019).
- SantaCoder: don’t reach for the stars! Deep Learning for Code (DL4C) Workshop ([n. d.]). https://par.nsf.gov/biblio/10416454
- Structural Language Models of Code. In International Conference on Machine Learning. PMLR, 245–256.
- ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing. arXiv:cs.HC/2309.09128
- Meskó B. 2022. Prompt Engineering as an Important Emerging Skill for Medical Professionals: Tutorial. J Med Internet Res 2023;25:e50638 (2022). https://doi.org/10.2196/50638
- Venansius Baryamureeba and Florence Tushabe. 2004. The enhanced digital investigation process model. Digital Investigation (2004).
- On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency (Virtual Event, Canada) (FAccT ’21). Association for Computing Machinery, New York, NY, USA, 610–623. https://doi.org/10.1145/3442188.3445922
- On the Opportunities and Risks of Foundation Models. (2022). arXiv:cs.LG/2108.07258
- Large Language Model-Based Artificial Intelligence in the Language Classroom: Practical Ideas for Teaching. Teaching English with Technology 23, 1 (2023).
- Andres M Bran and Philippe Schwaller. 2023. Transformers and Large Language Models for Chemistry and Drug Discovery. (2023). arXiv:cs.LG/2310.06083
- DFRWS EU 10-Year Review and Future Directions in Digital Forensic Research. Forensic Science International: Digital Investigation (03 2024).
- Language Models Are Few-Shot Learners. In Proceedings of the 34th International Conference on Neural Information Processing Systems (Vancouver, BC, Canada) (NIPS’20). Curran Associates Inc., Red Hook, NY, USA, Article 159.
- On the Application of Large Language Models for Language Teaching and Assessment Technology. (2023). arXiv:cs.CL/2307.08393
- Inês Carvalho and Stanislav Ivanov. 2023. ChatGPT for tourism: applications, benefits and risks. Tourism Review (2023).
- Eoghan Casey. 2011. Digital Evidence and Computer Crime: Forensic Science, Computers, and the Internet. Academic press.
- ENFSI guideline for evaluative reporting in forensic science: A primer for legal practitioners. Criminal Law and Justice Weekly 180, 10 (2016), 189–193.
- A Survey on Evaluation of Large Language Models. ACM Trans. Intell. Syst. Technol. (jan 2024). https://doi.org/10.1145/3641289
- CodeT: Code Generation with Generated Tests. In The Eleventh International Conference on Learning Representations.
- Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models. arXiv:cs.CV/2308.13437
- GameGPT: Multi-agent Collaborative Framework for Game Development. arXiv:cs.AI/2310.08067
- Evaluating Large Language Models Trained on Code. ArXiv abs/2107.03374 (2021).
- Cheng-Han Chiang and Hung yi Lee. 2023. Can Large Language Models Be an Alternative to Human Evaluations? arXiv:cs.CL/2305.01937
- PaLM: Scaling Language Modeling with Pathways. Journal of Machine Learning Research 24, 240 (2023), 1–113. http://jmlr.org/papers/v24/22-1144.html
- PanGu-Coder: Program Synthesis with Function-Level Language Modeling. ArXiv abs/2207.11280 (2022).
- Robert Dale. 2021. GPT-3: What’s it good for? Natural Language Engineering 27, 1 (2021), 113–118.
- Erik Derner and Kristina Batistič. 2023. Beyond the Safeguards: Exploring the Security Risks of ChatGPT. (2023). arXiv:cs.CR/2305.08005
- QLoRA: Efficient Finetuning of Quantized LLMs. (2023). arXiv:cs.LG/2305.14314
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2-7, 2019, Volume 1 (Long and Short Papers), Jill Burstein, Christy Doran, and Thamar Solorio (Eds.). Association for Computational Linguistics, 4171–4186. https://doi.org/10.18653/V1/N19-1423
- Evaluation of Digital Forensic Process Models with Respect to Digital Forensics as a Service. In Proceedings of the 16th European Conference on Cyber Warfare and Security (ECCWS 2017). ACPI, Dublin, Ireland, 573–581.
- Xiaoyu Du and Mark Scanlon. 2019. Methodology for the Automated Metadata-Based Classification of Incriminating Digital Forensic Artefacts. In Proceedings of the 14th International Conference on Availability, Reliability and Security (Canterbury, CA, United Kingdom) (ARES ’19). Association for Computing Machinery, New York, NY, USA, Article 43. https://doi.org/10.1145/3339252.3340517
- Digital Forensics Techniques and Trends: A Review. The International Arab Journal of Information Technology (IAJIT) 20, 4 (2023), 644–654.
- Implications of large language models such as ChatGPT for dental medicine. Journal of Esthetic and Restorative Dentistry (2023).
- A Bibliometric Review of Large Language Models Research from 2017 to 2023. ArXiv abs/2304.02020 (2023).
- InCoder: A Generative Model for Code Infilling and Synthesis. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net. https://openreview.net/pdf?id=hQwb-lbM6EL
- Leon Fröhling and Arkaitz Zubiaga. 2021. Feature-Based Detection of Automated Language Models: Tackling GPT-2, GPT-3, and Grover. PeerJ Computer Science 7 (2021), e443.
- LLM Censorship: A Machine Learning Challenge or a Computer Security Problem? (2023). arXiv:cs.AI/2307.10719
- Large Language Models: A Comprehensive Survey of its Applications, Challenges, Limitations, and Future Prospects. (11 2023). https://doi.org/10.36227/techrxiv.23589741.v4
- Hans Henseler and Harm van Beek. 2023. ChatGPT as a Copilot for Investigating Digital Evidence. In Proceedings of the Third International Workshop on Artificial Intelligence and Intelligent Assistance for Legal Professionals in the Digital Workplace (LegalAIIA 2023) co-located with the 19th International Conference on Artificial Intelligence and Law (ICAIL 2023), Braga, Portugal, June 19, 2023 (CEUR Workshop Proceedings), Jack G. Conrad, Daniel W. Linna Jr., Jason R. Baron, Hans Henseler, Paheli Bhattacharya, Aileen Nielsen, Jyothi K. Vinjumur, Jeremy Pickens, and Amanda Jones (Eds.), Vol. 3423. CEUR-WS.org, 58–69. https://ceur-ws.org/Vol-3423/paper6.pdf
- Large Language Models for Software Engineering: A Systematic Literature Review. ArXiv abs/2308.10620 (2023).
- Joshua James and Pavel Gladyshev. 2013. Challenges with Automation in Digital Forensic Investigations. ArXiv abs/1303.4498 (2013).
- Maximizing Investigation Effectiveness in Digital Forensic Cases. In 2013 International Conference on Social Computing. 618–623. https://doi.org/10.1109/SocialCom.2013.93
- Mert Karabacak and Konstantinos Margetis. 2023. Embracing Large Language Models for Medical Applications: Opportunities and Challenges. Cureus 15, 5 (2023).
- On the Importance of Standardizing the Process of Generating Digital Forensic Reports. Forensic Science International: Reports 1 (2019), 100008. https://doi.org/10.1016/j.fsir.2019.100008
- Rethinking Positional Encoding in Language Pre-training. (2021). https://openreview.net/forum?id=09-528y2Fgf
- Multimodal Neural Language Models. In Proceedings of the 31st International Conference on Machine Learning (Proceedings of Machine Learning Research), Vol. 32. PMLR, Bejing, China, 595–603. https://proceedings.mlr.press/v32/kiros14.html
- Barbara Kitchenham. 2004. Procedures for Performing Systematic Reviews. Keele, UK, Keele University 33, 2004 (2004), 1–26.
- Generating Images with Multimodal Language Models. In Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10 - 16, 2023, Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, and Sergey Levine (Eds.). http://papers.nips.cc/paper_files/paper/2023/hash/43a69d143273bd8215578bde887bb552-Abstract-Conference.html
- Optimizing the Use of Technology in Policing: Results and Implications from a Multi-Site Study of the Social, Organizational, and Behavioural Aspects of Implementing Police Technologies. Policing: A Journal of Policy and Practice 8, 2 (05 2014), 212–221. https://doi.org/10.1093/police/pau015
- How to generate a good word embedding. IEEE Intelligent Systems 31, 6 (2016), 5–14.
- Deep learning. Nature 521, 7553 (2015), 436–444.
- Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. In Advances in Neural Information Processing Systems, Vol. 33. Curran Associates, Inc., 9459–9474.
- BLIP-2: bootstrapping language-image pre-training with frozen image encoders and large language models. , Article 814 (2023).
- StarCoder: May the Source Be with You! Transactions on Machine Learning Research (2023). https://openreview.net/forum?id=KoFOg41haE Reproducibility Certification.
- Competition-Level Code Generation with AlphaCode. Science 378, 6624 (2022), 1092–1097. https://doi.org/10.1126/science.abq1158 arXiv:https://www.science.org/doi/pdf/10.1126/science.abq1158
- Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection. arXiv:cs.CV/2310.19070
- AgentSims: An Open-Source Sandbox for Large Language Model Evaluation. arXiv:cs.AI/2308.04026
- Improved Baselines with Visual Instruction Tuning. CoRR abs/2310.03744 (2023). https://doi.org/10.48550/ARXIV.2310.03744 arXiv:2310.03744
- Visual Instruction Tuning. In Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10 - 16, 2023, Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, and Sergey Levine (Eds.). http://papers.nips.cc/paper_files/paper/2023/hash/6dcf277ea32ce3288914faf369fe6de0-Abstract-Conference.html
- Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation. arXiv:cs.SE/2305.01210
- InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language. (2023). arXiv:cs.CV/2305.05662
- Brady D Lund and Ting Wang. 2023. Chatting About ChatGPT: How AI and GPT May Impact Academia and Libraries? Library Hi Tech News 40, 3 (2023), 26–29.
- UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation. ArXiv abs/2002.06353 (2020).
- WizardCoder: Empowering Code Large Language Models with Evol-Instruct. (2024). https://openreview.net/forum?id=UnUwSIgK5W
- Raymond Lutui. 2016. A multidisciplinary digital forensic investigation process model. Business Horizons 59, 6 (2016), 593–604.
- Gaëtan Michelet and Frank Breitinger. 2023. ChatGPT, Llama, can you write my report? An experiment on assisted digital forensics reports written using (Local) Large Language Models. CoRR abs/2312.14607 (2023). https://doi.org/10.48550/ARXIV.2312.14607 arXiv:2312.14607
- Automation for Digital Forensics: Towards a definition for the community. Forensic Science International (2023), 111769.
- Efficient Estimation of Word Representations in Vector Space. In 1st International Conference on Learning Representations, ICLR 2013, Scottsdale, Arizona, USA, May 2-4, 2013, Workshop Track Proceedings, Yoshua Bengio and Yann LeCun (Eds.). http://arxiv.org/abs/1301.3781
- Analysis of digital forensic investigation models. Int. J. Comput. Sci. Inform. Secur 14, 11 (2016).
- Empowering Education with LLMs: The Next-Gen Interface and Content Generation. In International Conference on Artificial Intelligence in Education. Springer, 32–37.
- Shaaswat Mukherjee and Shazia Haque. 2018. Review Paper on Digital Forensics Practices: A Road Map for Building Digital Forensics Capability. Iconic Research and Engineering Journals 1, 9 (2018), 96–99.
- CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis. In The Eleventh International Conference on Learning Representations.
- Taking AI Risks Seriously: A New Assessment Model for the AI Act. AI & Society (12 07 2023). https://doi.org/10.1007/s00146-023-01723-z
- Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals. In Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion. Association for Computational Linguistics.
- Program Synthesis with Large Language Models. (2021). arXiv:2108.07732
- DialogBench: Evaluating LLMs as Human-like Dialogue Systems. (2023). arXiv:cs.CL/2311.01677
- The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data Only. (2023). https://openreview.net/forum?id=kM5eGcdCzq
- Kosmos-2: Grounding Multimodal Large Language Models to the World. CoRR abs/2306.14824 (2023). https://doi.org/10.48550/ARXIV.2306.14824 arXiv:2306.14824
- Maciej P. Polak and Dane Morgan. 2023. Extracting Accurate Materials Data from Research Papers with Conversational Language Models and Prompt Engineering - Example of ChatGPT. CoRR abs/2303.05352 (2023). https://doi.org/10.48550/ARXIV.2303.05352 arXiv:2303.05352
- The Framework to Support the Digital Evidence Handling: A Case Study of Procedures for the Management of Evidence in Indonesia. Journal of Cases on Information Technology (JCIT) 22, 3 (2020), 51–71.
- What Is the Limitation of Multimodal LLMs? A Deeper Look into Multimodal LLMs Through Prompt Probing. Information Processing & Management 60, 6 (2023), 103510. https://doi.org/10.1016/j.ipm.2023.103510
- Communicative Agents for Software Development. ArXiv abs/2307.07924 (2023).
- Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models. arXiv:cs.CV/2207.07646
- ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs. https://openreview.net/forum?id=dHng2O0Jjr
- Learning Transferable Visual Models From Natural Language Supervision. In Proceedings of the 38th International Conference on Machine Learning (Proceedings of Machine Learning Research), Vol. 139. PMLR, 8748–8763.
- Noorjahan Rahman and Eduardo Santacana. 2023. Beyond Fair Use: Legal Risk Evaluation for Training LLMs on Copyrighted Text. In ICML Workshop on Generative AI and Law.
- Evaluating the Text-to-SQL Capabilities of Large Language Models. arXiv:cs.CL/2204.00498
- Zero-Shot Text-to-Image Generation. In Proceedings of the 38th International Conference on Machine Learning (Proceedings of Machine Learning Research), Vol. 139. PMLR, 8821–8831.
- Partha Pratim Ray. 2023. ChatGPT: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope. Internet of Things and Cyber-Physical Systems 3 (2023), 121–154. https://doi.org/10.1016/j.iotcps.2023.04.003
- Review on natural language processing. IRACST Engineering Science and Technology: An International Journal (ESTIJ) 3, 1 (2013), 113–116.
- Risks and Benefits of Large Language Models for the Environment. Environmental Science & Technology 57, 9 (2023), 3464–3466. https://doi.org/10.1021/acs.est.3c01106 PMID: 36821477.
- The performance of ChatGPT on orthopaedic in-service training exams: A comparative study of the GPT-3.5 turbo and GPT-4 models in orthopaedic education. Journal of Orthopaedics 50 (2024), 70–75. https://doi.org/10.1016/j.jor.2023.11.056
- Code Llama: Open Foundation Models for Code. (2024). arXiv:cs.CL/2308.12950
- ChatGPT for digital forensic investigation: The good, the bad, and the unknown. Forensic Science International: Digital Investigation 46 (2023), 301609. https://doi.org/10.1016/j.fsidi.2023.301609
- Digital forensic investigation in the age of ChatGPT. Forensic Science International: Digital Investigation 44 (03 2023), 301543. https://doi.org/10.1016/j.fsidi.2023.301543
- Protecting digital evidence integrity and preserving chain of custody. Journal of Digital Forensics, Security and Law 12, 2 (2017), 12.
- PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback. (2023). arXiv:cs.CL/2307.14936
- ChatGPT and Other Large Language Models Are Double-edged Swords. Radiology 307, 2 (2023), e230163. https://doi.org/10.1148/radiol.230163 PMID: 36700838.
- Reflexion: Language Agents with Verbal Reinforcement Learning. In Thirty-seventh Conference on Neural Information Processing Systems.
- Transformer-based Sentiment Analysis for Anomaly Detection on Drone Forensic Timeline. In 2023 11th International Symposium on Digital Forensics and Security (ISDFS). 1–6. https://doi.org/10.1109/ISDFS58141.2023.10131749
- Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. Transactions on Machine Learning Research (2023). https://openreview.net/forum?id=uyTL5Bvosj
- The Science of Detecting LLM-Generated Texts. (2023). arXiv:cs.CL/2303.07205
- ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 17918–17928.
- From Humans to Machines: Can ChatGPT-like LLMs Effectively Replace Human Annotators in NLP Tasks. In Workshop Proceedings of the 17th International AAAI Conference on Web and Social Media.
- Large language models in medicine. Nature Medicine 29, 8 (2023), 1930–1940.
- LaMDA: Language Models for Dialog Applications. ArXiv abs/2201.08239 (2022).
- LLaMA: Open and Efficient Foundation Language Models. ArXiv abs/2302.13971 (2023).
- Exploring the use of large language models (LLMs) in chemical engineering education: Building core course problem models with Chat-GPT. Education for Chemical Engineers 44 (2023), 71–95.
- Multimodal research in vision and language: A review of current and emerging trends. Information Fusion 77 (2022), 149–171. https://doi.org/10.1016/j.inffus.2021.07.009
- ISO/IEC 27043:2015 — Role and Application. In 2016 24th Telecommunications Forum (TELFOR). 1–4. https://doi.org/10.1109/TELFOR.2016.7818718
- Digital Forensics as a Service: A game changer. Digital Investigation 11 (2014), S54–S62. https://doi.org/10.1016/j.diin.2014.03.007 Proceedings of the First Annual DFRWS Europe.
- Digital Forensics as a Service: Game on. Digital Investigation 15 (2015), 20–38. https://doi.org/10.1016/j.diin.2015.07.004 Special Issue: Big Data and Intelligent Data Analysis.
- Digital forensics as a service: Stepping up the game. Forensic Science International: Digital Investigation 35 (2020), 301021.
- Digital forensics as a service: Game on. Digital Investigation 15 (2015), 20–38.
- Attention is All you Need. In Advances in Neural Information Processing Systems, I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.), Vol. 30. Curran Associates, Inc.
- Large Language Models for Business Process Management: Opportunities and Challenges. In Business Process Management Forum, Chiara Di Francescomarino, Andrea Burattin, Christian Janiesch, and Shazia Sadiq (Eds.). Springer Nature Switzerland, Cham, 107–123.
- Voyager: An Open-Ended Embodied Agent with Large Language Models. arXiv:cs.AI/2305.16291
- On the Origin of Deep Learning. CoRR abs/1702.07800 (2017). arXiv:1702.07800 http://arxiv.org/abs/1702.07800
- A Survey on Large Language Model based Autonomous Agents. CoRR abs/2308.11432 (2023). https://doi.org/10.48550/ARXIV.2308.11432 arXiv:2308.11432
- VisionLLM: Large Language Model Is Also an Open-Ended Decoder for Vision-Centric Tasks. arXiv:cs.CV/2305.11175
- RecMind: Large Language Model Powered Agent For Recommendation. arXiv:cs.IR/2308.14296
- CodeT5+: Open Code Large Language Models for Code Understanding and Generation. (Dec. 2023), 1069–1088. https://doi.org/10.18653/v1/2023.emnlp-main.68
- Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners. In Advances in Neural Information Processing Systems, Vol. 35. Curran Associates, Inc., 8483–8497.
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. Advances in Neural Information Processing Systems 35 (2022), 24824–24837.
- A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT. (2023). arXiv:cs.SE/2302.11382
- Fundamental Limitations of Alignment in Large Language Models. (2024). arXiv:cs.CL/2304.11082
- Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models. ArXiv abs/2303.04671 (2023).
- AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation. Technical Report MSR-TR-2023-33. Microsoft. https://www.microsoft.com/en-us/research/publication/autogen-enabling-next-gen-llm-applications-via-multi-agent-conversation-framework/
- Digital forensic tools: Recent advances and enhancing the status quo. Forensic Science International: Digital Investigation 34 (2020), 300999.
- A Systematic Evaluation of Large Language Models of Code. In Proceedings of the 6th ACM SIGPLAN International Symposium on Machine Programming (San Diego, CA, USA) (MAPS 2022). Association for Computing Machinery, New York, NY, USA, 1–10. https://doi.org/10.1145/3520312.3534862
- LLMs’ Capabilities at the High School Level in Chemistry: Cases of ChatGPT and Microsoft Bing Chat. (2023). https://doi.org/10.26434/chemrxiv-2023-kxxpd
- Exploring the Limits of ChatGPT for Query or Aspect-based Text Summarization. arXiv:cs.CL/2302.08081
- A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models. ArXiv abs/2303.10420 (2023).
- Building Cooperative Embodied Agents Modularly with Large Language Models. https://openreview.net/forum?id=oBQVCTpKXW
- Vision-Language Models for Vision Tasks: A Survey. arXiv:cs.CV/2304.00685
- GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest. (2024). https://openreview.net/forum?id=DzxaRFVsgC
- DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations. Association for Computational Linguistics, Online, 270–278. https://doi.org/10.18653/v1/2020.acl-demos.30
- SVIT: Scaling up Visual Instruction Tuning. arXiv:cs.CV/2307.04087
- Learning Video Representations From Large Language Models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 6586–6597.
- CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X. (2023), 5673–5684. https://doi.org/10.1145/3580305.3599790
- Least-to-Most Prompting Enables Complex Reasoning in Large Language Models. In The Eleventh International Conference on Learning Representations.
- Large Language Models are Human-Level Prompt Engineers. (2023). https://openreview.net/forum?id=92gvk82DE-
- MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models. https://openreview.net/forum?id=1tZbq88f27
- Large Language Models for Information Retrieval: A Survey. ArXiv abs/2308.07107 (2023).
- Fine-Tuning Language Models from Human Preferences. (2020). arXiv:cs.CL/1909.08593
- Universal and Transferable Adversarial Attacks on Aligned Language Models. CoRR abs/2307.15043 (2023). https://doi.org/10.48550/ARXIV.2307.15043 arXiv:2307.15043
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.