Combining topic modelling and citation network analysis to study case law from the European Court on Human Rights on the right to respect for private and family life (2401.16429v1)
Abstract: As legal case law databases such as HUDOC continue to grow rapidly, it has become essential for legal researchers to find efficient methods to handle such large-scale data sets. Such case law databases usually consist of the textual content of cases together with the citations between them. This paper focuses on case law from the European Court of Human Rights on Article 8 of the European Convention of Human Rights, the right to respect private and family life, home and correspondence. In this study, we demonstrate and compare the potential of topic modelling and citation network to find and organize case law on Article 8 based on their general themes and citation patterns, respectively. Additionally, we explore whether combining these two techniques leads to better results compared to the application of only one of the methods. We evaluate the effectiveness of the combined method on a unique manually collected and annotated dataset of Aricle 8 case law on evictions. The results of our experiments show that our combined (text and citation-based) approach provides the best results in finding and grouping case law, providing scholars with an effective way to extract and analyse relevant cases on a specific issue.
- George Letsas. The echr as a living instrument: Its meaning and legitimacy. Constituting Europe: The European Court of Human Rights in a National, European and Global Context, 2:106, 2013.
- Helena Wray. Article 8 ECHR, Family Reunification and the UK’s Supreme Court: Family Matters? Bloomsbury Publishing, 2023.
- Common trends in eviction research: a systematic literature review, pages 1–88. Studies in Housing Law. Eleven International Publishing, 2019.
- Automatic assignment of section structure to texts of dutch court judgments. In Legal knowledge and information systems, pages 167–172. IOS Press, 2016.
- The supreme court and the judicial genre. Ariz. L. Rev., 59:837, 2017.
- Recognizing cited facts and principles in legal judgements. Artificial Intelligence and Law, 25(1):107–126, 2017.
- Claudette: an automated detector of potentially unfair clauses in online terms of service. Artificial Intelligence and Law, 27:117–139, 2019.
- Detecting and explaining unfairness in consumer contracts through memory networks. Artificial Intelligence and Law, 30(1):59–92, 2022.
- Predicting citations in dutch case law with natural language processing. Artificial Intelligence and Law, pages 1–31, 2023.
- Using machine learning to predict decisions of the European Court of Human Rights. Artificial Intelligence and Law, pages 1–30, 2020.
- Automatic judgement forecasting for pending applications of the European Court of Human Rights. In The Fifth Workshop on Automatec Semantic Analysis of Information in Legal Text, 2021.
- Rethinking the field of automatic prediction of court decisions. Artificial Intelligence and Law, pages 1–18, 2022.
- Ylja Remmits. Finding the topics of case law: Latent dirichlet allocation on supreme court decisions. 2017.
- Arthur Dyevre. Text-mining for lawyers: how machine learning techniques can advance our understanding of legal discourse. Erasmus Law Review, 14:7, 2021.
- Topic modelling of legal documents via legal-bert. Proceedings of the First International Workshop RELATED - Relations in the Legal Domain, 1613:0073, 2021.
- Topic modelling supreme court case decisions using latent dirichlet allocation. In The 13th International Conference on Information and Communication Technology Convergence (ICTC), pages 284–289. IEEE, 2022.
- Pedro Henrique Luz De Araujo and Teófilo De Campos. Topic modelling brazilian supreme court lawsuits. In Legal Knowledge and Information Systems, pages 113–122. IOS Press, 2020.
- The authority of supreme court precedent. Social networks, 30(1):16–30, 2008.
- Holding complexity: analysing the cjeu’s vat case law as a network. World Journal of VAT/GST Law, 3(3):141–165, 2014.
- Identification of case content with quantitative network analysis: An example from the ecthr. In Legal Knowledge and Information Systems, pages 53–62, 2016.
- Ryan Whalen. Legal networks: The promises and challenges of legal network analysis. Mich. St. L. Rev., page 539, 2016.
- Jens Frankenreiter. Network analysis and the use of precedent in the case law of the cjeu–a reply to derlén and lindholm. German Law Journal, 18(3):687–694, 2017.
- Finding hidden patterns in ecthr’s case law: On how citation network analysis can improve our knowledge of ecthr’s article 14 practice. International Journal of Discrimination and the Law, 17(1):4–22, 2017.
- Is it good law? network analysis and the cjeu’s internal market jurisprudence. Journal of International Economic Law, 20(2):257–277, 2017.
- Can quantitative methods complement doctrinal legal studies? using citation network and corpus linguistic analysis to understand international courts. Leiden Journal of International Law, 30(2):327–349, 2017.
- Scaling court decisions with citation networks. Journal of Law and Courts, 11(1):25–44, 2023.
- Łukasz Górski. Network science in law: A framework for polish case-law citation network analysis. IT Professional, 23(5):62–66, 2021.
- Mapping europe’s cosmopolitan legal order: A network analysis of the european court of human rights, the court of justice of the european union, and high national courts. Eur. J. Legal Stud., 13:45, 2021.
- ‘don’t use a sledgehammer to crack a nut’: less restrictive means in the case law of the european court of human rights. Human Rights Law Review, 15(1):139–168, 2015.
- Michel Vols. Legal Research: One Hundred Questions and Answers. Eleven, 2021.
- Michel Vols. The optional protocol to the icescr, homelessness and moral hazard: The alternative adequate housing requirement in the cescr’s jurisprudence–an incentive not to pay for housing? International Human Rights Law Review, 1(aop):1–25, 2023.
- Deconstructing the eviction protections under the revised european social charter: A systematic content analysis of the interplay between the right to housing and the right to property. Human Rights Law Review, 23(4):ngad022, 2023.
- Named entity recognition and resolution in legal text. Springer, 2010.
- A low-cost, high-coverage legal named entity recognizer, classifier and linker. In Proceedings of the 16th edition of the International Conference on Articial Intelligence and Law, pages 9–18, 2017.
- From one to many: Identifying issues in cjeu jurisprudence. Journal of Law and Courts, 11(1):163–186, 2023.
- Varun Pandya. Automatic text summarization of legal cases: A hybrid approach. arXiv preprint arXiv:1908.09119, 2019.
- Text summarization from legal documents: a survey. Artificial Intelligence Review, 51:371–402, 2019.
- Summarization based on bi-directional citation analysis. Information processing & management, 51(1):1–24, 2015.
- Ravi Kumar and K Raghuveer. Legal document summarization using latent dirichlet allocation. International Journal of Computer Science and Telecommunications, 3(7):8–23, 2012.
- Summarization of legal texts with high cohesion and automatic compression rate. In New Frontiers in Artificial Intelligence: JSAI-isAI 2012 Workshops, LENLS, JURISIN, MiMI, Miyazaki, Japan, November 30 and December 1, 2012, Revised Selected Papers 4, pages 190–204. Springer, 2013.
- Predicting judicial decisions of the european court of human rights: A natural language processing perspective. PeerJ Computer Science, 2:e93, 2016.
- Automatically identifying eviction cases and outcomes within case law of Dutch courts of first instance. In Legal Knowledge and Information Systems, pages 13–22. IOS Press, 2021.
- Rethinking the field of automatic prediction of court decisions. Artificial Intelligence and Law, 31(1):195–212, 2023.
- Smart literature review: a practical topic modelling approach to exploratory literature review. Journal of Big Data, 6(1):1–18, 2019.
- A bibliometric analysis of topic modelling studies (2000–2017). Journal of Information Science, 47(2):161–175, 2021.
- Reading the high court at a distance: topic modelling the legal subject matter and judicial activity of the high court of australia, 1903-2015. The University of New South Wales Law Journal, 39(4):1300–1354, 2016.
- Using topic modeling in classification of brazilian lawsuits. In Computational Processing of the Portuguese Language: 15th International Conference, PROPOR 2022, Fortaleza, Brazil, March 21–23, 2022, Proceedings, pages 233–242. Springer, 2022.
- Topic modelling of the czech supreme court decisions. ASAIL 2020 Automated Semantic Analysis of Information in Legal Text, 2020.
- Why do tenants sue their landlords? answers from a topic model. In Legal Knowledge and Information Systems, pages 113–122. IOS Press, 2022.
- Kyra Wigard. Matter of opinion: Assessing the role of individual judicial opinions at the international criminal court. International Criminal Law Review, 1(aop):1–29, 2023.
- An analysis of topic modelling for legislative texts. CEUR Workshop Proceedings, 2016.
- Exploring the use of topic analysis in latvian legal documents. In Proceedings of the First International Workshop” CAiSE for Legal Documents”(COUrT 2020) co-located with the 32nd International Conference on Advanced Information Systems Engineering (CAiSE 2020), Grenoble, France, volume 2690, pages 39–47, 2020.
- Quantifying long-term impact of court decisions. Applied Network Science, 4(1):1–15, 2019.
- Emergence of network effects and predictability in the judicial system. Scientific reports, 11(1):1–10, 2021.
- Lewis Graham. Strategic admissibility decisions in the european court of human rights. International & Comparative Law Quarterly, 69(1):79–102, 2020.
- Lize R Glas. The age of subsidiarity? the ecthr’s approach to the admissibility requirement that applicants raise their convention complaint before domestic courts. Netherlands Quarterly of Human Rights, page 09240519231169837, 2023.
- Horizontality and housing rights: Protection against private evictions from a european and south african perspective. European Journal of Comparative Law and Governance, 9(2):118–151, 2022.
- Identifying latent toxic features on youtube using non-negative matrix factorization. In The Ninth International Conference on Social Media Technologies, Communication, and Informatics, IEEE, 2019.
- Latent dirichlet allocation. Journal of machine Learning research, 3(Jan):993–1022, 2003.
- Dimo Angelov. Top2vec: Distributed representations of topics. arXiv preprint arXiv:2008.09470, 2020.
- Maarten Grootendorst. Bertopic: Neural topic modeling with a class-based tf-idf procedure. arXiv preprint arXiv:2203.05794, 2022.
- spaCy 2: Natural language understanding with Bloom embeddings, convolutional neural networks and incremental parsing. To appear, 2017.
- Software Framework for Topic Modelling with Large Corpora. In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, pages 45–50, Valletta, Malta, May 2010. ELRA. http://is.muni.cz/publication/884893/en.
- Full-text or abstract? examining topic coherence scores using latent dirichlet allocation. In The IEEE International conference on data science and advanced analytics (DSAA), pages 165–174. IEEE, 2017.
- Laurens Van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of machine learning research, 9(11), 2008.
- Albert-László Barabási. Network science. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 371(1987):20120375, 2013.
- Fast unfolding of communities in large networks. Journal of statistical mechanics: theory and experiment, 2008(10):P10008, 2008.