Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

U-CREAT: Unsupervised Case Retrieval using Events extrAcTion (2307.05260v1)

Published 11 Jul 2023 in cs.IR, cs.AI, cs.CL, and cs.LG

Abstract: The task of Prior Case Retrieval (PCR) in the legal domain is about automatically citing relevant (based on facts and precedence) prior legal cases in a given query case. To further promote research in PCR, in this paper, we propose a new large benchmark (in English) for the PCR task: IL-PCR (Indian Legal Prior Case Retrieval) corpus. Given the complex nature of case relevance and the long size of legal documents, BM25 remains a strong baseline for ranking the cited prior documents. In this work, we explore the role of events in legal case retrieval and propose an unsupervised retrieval method-based pipeline U-CREAT (Unsupervised Case Retrieval using Events Extraction). We find that the proposed unsupervised retrieval method significantly increases performance compared to BM25 and makes retrieval faster by a considerable margin, making it applicable to real-time case retrieval systems. Our proposed system is generic, we show that it generalizes across two different legal systems (Indian and Canadian), and it shows state-of-the-art performance on the benchmarks for both the legal systems (IL-PCR and COLIEE corpora).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (52)
  1. Improving BERT-Based Query-by-Document Retrieval with Multi-Task Optimization. In Advances in Information Retrieval: 44th European Conference on IR Research, (ECIR).
  2. A Machine Learning Approach to Prior Case Retrieval. In Proceedings of the Eighth International Conference on Artificial Intelligence and Law (ICAIL).
  3. Predicting Judicial Decisions of the European Court of Human Rights: a Natural Language Processing Perspective. PeerJ Computer Science.
  4. DoSSIER@COLIEE 2021: Leveraging Dense Retrieval and Summarization-based Re-ranking for Case Law Retrieval. arXiv preprint arXiv:2108.03937.
  5. Combining Lexical and Neural Retrieval with Longformer-Based Summarization for Effective Case Law retrieva. In Proceedings of the Second International Conference on Design of Experimental Search & Information REtrieval Systems (DESIRES).
  6. Methods for Computing Legal Document Similarity: A Comparative Study. arXiv preprint arXiv:2004.12307.
  7. Shivangi Bithel and Sumitra S Malagi. 2021. Unsupervised Identification of Relevant Prior Cases. arXiv preprint arXiv:2107.08973.
  8. Neural Legal Judgment Prediction in English. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics(ACL).
  9. Nathanael Chambers and Dan Jurafsky. 2008. Unsupervised Learning of Narrative Event Chains. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-08:HLT).
  10. Nathanael Chambers and Dan Jurafsky. 2009. Unsupervised Learning of Narrative Schemas and Their Participants. In Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics (ACL).
  11. Charge-Based Prison Term Prediction with Deep Gating Network. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, (EMNLP-IJCNLP).
  12. Event-Centric Natural Language Processing. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: Tutorial Abstracts (ACL-IJCNLP).
  13. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT).
  14. Legal Judgment Prediction: A Survey of the State of the Art. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (IJCAI).
  15. Towards Automatic Generation of Catchphrases for Legal Case Reports. In Computational Linguistics and Intelligent Text Processing - 13th International Conference, (CICLing).
  16. Simcse: Simple contrastive learning of sentence embeddings. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP).
  17. Goran Glavaš and Jan Šnajder. 2014. Event Graphs for Information Retrieval and Multi-Document Summarization. Expert Systems with Applications, Elsevier.
  18. Montani Ines Honnibal Matthew and Boyd Adriane Van Landeghem Sofie. 2020. spaCy: Industrial-Strength Natural Language Processing in Python.
  19. Information Extraction from Case Law and Retrieval of Prior Cases. Artificial Intelligence, Elsevier.
  20. Corpus for Automatic Structuring of Legal Documents. In Proceedings of the 13th Language Resources and Evaluation Conference -Association for Computational Linguistics (ACL-LREC).
  21. HLDC: Hindi Legal Documents Corpus. In Findings of the Association for Computational Linguistics (ACL).
  22. Statute Law Information Retrieval and Entailment. In Proceedings of the Seventeenth International Conference on Artificial Intelligence and Law (ICAIL).
  23. Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. arXiv preprint arXiv:1412.6980.
  24. Dependency parsing. Synthesis Lectures on Human Language Technologies (SLHLT), Springer.
  25. Similarity Analysis of Legal Judgments. In COMPUTE ’11: Proceedings of the 4th Annual Association for Computing Machinery.
  26. Finding Similar Legal Judgements under Common Law System. In Databases in Networked Information Systems (DNIS),Springer.
  27. Automatic Judgment Prediction via Legal Reading Comprehension. In Chinese Computational Linguistics - 18th China National Conference, (CCL) Springer.
  28. Ilya Loshchilov and Frank Hutter. 2019. Decoupled Weight Decay Regularization. In International Conference on Learning Representations (ICLR).
  29. Retrieving Legal Cases from a Large-scale Candidate Corpus. In Proceedings of the Eighth International Competition on Legal Information Extraction/Entailment (COLIEE).
  30. Semantic Segmentation of Legal Documents via Rhetorical Roles. In Proceedings of the Natural Legal Language Processing Workshop (NLLP) EMNLP.
  31. ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP).
  32. Measuring Similarity among Legal Court Case Documents. In Compute ’17: Proceedings of the 10th Annual ACM India Compute Conference.
  33. Finding Relevant Indian Judgments Using Dispersion of Citation Network. In Proceedings of the 24th International Conference on World Wide Web.
  34. Ashutosh Modi. 2016. Event Embeddings for Semantic Script Modeling. In Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning (CoNLL).
  35. Ashutosh Modi and Ivan Titov. 2014. Inducing Neural Models of Script Knowledge. In Proceedings of the 18th SIGNLL Conference on Computational Natural Language Learning (CoNLL).
  36. Modeling semantic expectation: Using script knowledge for referent prediction. Transactions of the Association for Computational Linguistics (TACL).
  37. National Judicial Data Grid. 2021. National judicial data grid statistics. https://www.njdg.ecourts.gov.in/njdgnew/index.php.
  38. JNLP Team: Deep Learning Approaches for Legal Processing Tasks in COLIEE 2021. arXiv preprint arXiv:2106.13405.
  39. Pre-training Transformers on Indian Legal Text. arXiv preprint arXiv:2209.06049.
  40. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research (JMLR).
  41. Overview and Discussion of the Competition on Legal Information Extraction/Entailment (COLIEE) 2021. The Review of Socionetwork Strategies.
  42. Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP).
  43. Yes, BM25 is a strong baseline for legal case retrieval. arXiv preprint arXiv:2105.05686.
  44. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108.
  45. BERT-PLI: Modeling Paragraph-Level Interactions for Legal Case Retrieval. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI).
  46. Building Legal Case Retrieval Systems with Lexical Matching and Summarization Using A Pre-Trained Phrase Scoring Model. In Proceedings of the Seventeenth International Conference on Artificial Intelligence and Law (ICAIL).
  47. Hierarchical Matching Network for Crime Classification. In Proceedings of the 42nd International ACM Conference on Research and Development in Information Retrieval, (SIGIR).
  48. Modeling Dynamic Pairwise Attention for Crime Classification over Legal Articles. In The 41st International ACM Conference on Research & Development in Information Retrieval (SIGIR) .
  49. Transformers: State-of-the-Art Natural Language Processing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP).
  50. Distinguish Confusing Law Articles for Legal Judgment Prediction. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL) .
  51. Legal Judgment Prediction via Multi-Perspective Bi-Feedback Network. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI).
  52. Iteratively Questioning and Answering for Interpretable Legal Judgment Prediction. In The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference (IAAI), The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI).
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Abhinav Joshi (14 papers)
  2. Akshat Sharma (4 papers)
  3. Sai Kiran Tanikella (2 papers)
  4. Ashutosh Modi (60 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.