Improving Legal Case Retrieval with Brain Signals (2403.13242v1)
Abstract: The tasks of legal case retrieval have received growing attention from the IR community in the last decade. Relevance feedback techniques with implicit user feedback (e.g., clicks) have been demonstrated to be effective in traditional search tasks (e.g., Web search). In legal case retrieval, however, collecting relevance feedback faces a couple of challenges that are difficult to resolve under existing feedback paradigms. First, legal case retrieval is a complex task as users often need to understand the relationship between legal cases in detail to correctly judge their relevance. Traditional feedback signal such as clicks is too coarse to use as they do not reflect any fine-grained relevance information. Second, legal case documents are usually long, users often need even tens of minutes to read and understand them. Simple behavior signal such as clicks and eye-tracking fixations can hardly be useful when users almost click and examine every part of the document. In this paper, we explore the possibility of solving the feedback problem in legal case retrieval with brain signal. Recent advances in brain signal processing have shown that human emotional can be collected in fine grains through Brain-Machine Interfaces (BMI) without interrupting the users in their tasks. Therefore, we propose a framework for legal case retrieval that uses EEG signal to optimize retrieval results. We collected and create a legal case retrieval dataset with users EEG signal and propose several methods to extract effective EEG features for relevance feedback. Our proposed features achieve a 71% accuracy for feedback prediction with an SVM-RFE model, and our proposed ranking method that takes into account the diverse needs of users can significantly improve user satisfaction for legal case retrieval. Experiment results show that re-ranked result list make user more satisfied.
- When relevance judgement is happening? An EEG-based study. In Proceedings of the 38th international acm sigir conference on research and development in information retrieval. 719–722.
- Sources of evidence for vertical selection. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval. ACM, 315–322.
- BCI for physiological text annotation. In Proceedings of the 2017 ACM Workshop on An Application-oriented Approach to BCI out of the laboratory. 9–13.
- Eye movements as implicit relevance feedback. In CHI’08 extended abstracts on Human factors in computing systems. 2991–2996.
- The good, the bad, and the random: an eye-tracking study of ad quality in web search. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval. 42–49.
- Large-scale analysis of individual and task differences in search result page examination strategies. In Proceedings of the fifth ACM international conference on Web search and data mining. 373–382.
- Olivier Chapelle and Ya Zhang. 2009. A dynamic bayesian network click model for web search ranking. In Proceedings of the 18th international conference on World wide web. 1–10.
- Inter-brain coupling reflects disciplinary differences in real-world classroom learning. bioRxiv (2022).
- Web Search via an Efficient and Effective Brain-Machine Interface. In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining. 1569–1572.
- The finite Fourier transform. IEEE Transactions on Audio and Electroacoustics 17, 2 (1969), 77–85. https://doi.org/10.1109/TAU.1969.1162036
- An experimental comparison of click position-bias models. In Proceedings of the 2008 international conference on web search and data mining. 87–94.
- Edward Cutrell and Zhiwei Guan. 2007. What are you looking for? An eye-tracking study of information usage in web search. In Proceedings of the SIGCHI conference on Human factors in computing systems. 407–416.
- David Freedman. 2005. Statistical Models : Theory and Practice. Cambridge University Press. 26 pages.
- B.A. Garner. 2001. A Dictionary of Modern Legal Usage. Oxford University Press, . https://books.google.com/books?id=35dZpfMmxqsC
- Bryan A. Garner. 2009. Black’s law dictionary.
- Zhiwei Guan and Edward Cutrell. 2007. An eye tracking study of the effect of target rank on web search. In Proceedings of the SIGCHI conference on Human factors in computing systems. 417–420.
- Efficient multiple-click models in web search. 124–131. https://doi.org/10.1145/1498759.1498818
- Gene Selection for Cancer Classification Using Support Vector Machines. Mach. Learn. 46, 1–3 (mar 2002), 389–422. https://doi.org/10.1023/A:1012487302797
- Temporal dynamics of eye-tracking and EEG during reading and relevance decisions. Journal of the Association for Information Science and Technology 68, 10 (2017), 2299–2312.
- Donna Harman. 1992. Relevance feedback revisited. In Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval. 1–10.
- Simon Haykin. 1994. Neural networks: a comprehensive foundation. Prentice Hall PTR.
- Cerebral location of international 10–20 system electrode placement. Electroencephalography and clinical neurophysiology 66, 4 (1987), 376–382.
- Convolutional neural network architectures for matching natural language sentences. In Advances in neural information processing systems. 2042–2050.
- No clicks, no problem: using cursor movements to understand and improve search. In Proceedings of the SIGCHI conference on human factors in computing systems. 1225–1234.
- Accurately interpreting clickthrough data as implicit feedback. In Sigir, Vol. 5. 154–161.
- Tie-Yan Liu. 2011. Learning to rank for information retrieval. (2011).
- Compumedics Ltd. unknow. 64-channels-quik-cap-synamps-2-rt. EEGcap. https://compumedicsneuroscan.com/product/64-channels-quik-cap-synamps-2-rt/.
- LeCaRD: A Legal Case Retrieval Dataset for Chinese Law System. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (Virtual Event, Canada) (SIGIR ’21). Association for Computing Machinery, New York, NY, USA, 2342–2348. https://doi.org/10.1145/3404835.3463250
- Dieter Merkl and Erich Schweighofer. 1997. En Route to Data Mining in Legal Text Corpora: Clustering, Neural Computation, and International Treaties. In Proceedings of the 8th International Workshop on Database and Expert Systems Applications (DEXA ’97). IEEE Computer Society, USA, 0465.
- Understanding information need: An fMRI study. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval. 335–344.
- Compumedics Neuroscan. unknow. SynAmps RT 64-channel Amplifier. EEGAmplifier. https://compumedicsneuroscan.com/product/synamps-rt-64-channel-eeg-erp-ep-amplifier/.
- Luis Fernando Nicolas-Alonso and Jaime Gomez-Gil. 2012. Brain computer interfaces, a review. sensors 12, 2 (2012), 1211–1279.
- Leveraging user interaction signals for web image search. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval. 559–568.
- Neural correlates of satisfaction of an information need. In Advanced Online & Onsite Course & Symposium on Artificial Intelligence & Neuroscience.
- Text matching as image recognition. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 30.
- Searchgazer: Webcam eye tracking for remote studies of web search. In Proceedings of the 2017 conference on conference human information interaction and retrieval. 17–26.
- Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 12 (2011), 2825–2830.
- The cortical activity of graded relevance. In Proceedings of the 43rd international acm sigir conference on research and development in information retrieval. 299–308.
- J.R. Quinlan. 1987. Simplifying decision trees. International Journal of Man-Machine Studies 27, 3 (1987), 221–234. https://doi.org/10.1016/S0020-7373(87)80053-6
- J. Rehberg and R.D. Popa. 1998. Accidental Tourist on the New Frontier: An Introductory Guide to Global Legal Research. F.B. Rothman, . https://books.google.com/books?id=yt6xbymeO7IC
- Stephen E Robertson and Steve Walker. 1994. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In SIGIR’94. Springer, 232–241.
- Information retrieval in the workplace: A comparison of professional search practices. Information Processing & Management 54, 6 (2018), 1042–1057. https://doi.org/10.1016/j.ipm.2018.07.003
- Ian Ruthven and Mounia Lalmas. 2003. A survey on the use of relevance feedback for information access systems. The Knowledge Engineering Review 18, 2 (2003), 95–145.
- Gerard Salton and Christopher Buckley. 1988. Term-weighting approaches in automatic text retrieval. Information processing & management 24, 5 (1988), 513–523.
- Gerard Salton and Chris Buckley. 1990. Improving retrieval performance by relevance feedback. Journal of the American society for information science 41, 4 (1990), 288–297.
- C.E. Shannon. 1949. Communication in the Presence of Noise. Proceedings of the IRE 37, 1 (jan 1949), 10–21. https://doi.org/10.1109/jrproc.1949.232969
- Yunqiu Shao. 2020. Towards Legal Case Retrieval. In Proceedings of the 43rd International ACM SIGIR Conference on Research & Development in Information Retrieval. 2485–2485.
- BERT-PLI: Modeling Paragraph-Level Interactions for Legal Case Retrieval.. In IJCAI. 3501–3507.
- Investigating User Behavior in Legal Case Retrieval. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (Virtual Event, Canada) (SIGIR ’21). Association for Computing Machinery, New York, NY, USA, 962–972. https://doi.org/10.1145/3404835.3462876
- A latent semantic model with convolutional-pooling structure for information retrieval. In Proceedings of the 23rd ACM international conference on conference on information and knowledge management. 101–110.
- Shravani Sur and Vinod Kumar Sinha. 2009. Event-related potential: An overview. Industrial psychiatry journal 18, 1 (2009), 70.
- Encoded summarization: summarizing documents into continuous vector space for legal case retrieval. Artificial Intelligence and Law 28, 4 (2020), 441–467.
- Marc Van Opijnen and Cristiana Santos. 2017. On the concept of relevance in legal information retrieval. Artificial Intelligence and Law 25 (2017), 65–87.
- Designing a brain-computer interface controlled video-game using consumer grade EEG hardware. In 2012 ISSNIP Biosignals and Biorobotics Conference: Biosignals and Robotics for Better and Safer Living (BRC). IEEE, 1–6.
- Match-srnn: Modeling the recursive matching structure with spatial rnn. arXiv preprint arXiv:1604.04378 (2016).
- Characterizing the Influence of Domain Expertise on Web Search Behavior. In Proceedings of the Second ACM International Conference on Web Search and Data Mining (Barcelona, Spain) (WSDM ’09). Association for Computing Machinery, New York, NY, USA, 132–141. https://doi.org/10.1145/1498759.1498819
- Wikipedia contributors. 2023. Common law — Wikipedia, The Free Encyclopedia. https://en.wikipedia.org/w/index.php?title=Common_law&oldid=1131543007 [Online; accessed 4-January-2023].
- Towards a Better Understanding of Human Reading Comprehension with Brain Signals. In Proceedings of the ACM Web Conference 2022. 380–391.
- Why Don’t You Click: Understanding Non-Click Results in Web Search with Brain Signals. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (Madrid, Spain) (SIGIR ’22). Association for Computing Machinery, New York, NY, USA, 633–645. https://doi.org/10.1145/3477495.3532082
- Weijing Yuan. 1997. End-user searching behavior in information retrieval: A longitudinal study. Journal of the American society for information science 48, 3 (1997), 218–234.
- Diverse legal case search. arXiv:2301.12504 [cs.IR]
- Ruizhe Zhang (46 papers)
- Qingyao Ai (113 papers)
- Ziyi Ye (19 papers)
- Yueyue Wu (18 papers)
- Xiaohui Xie (84 papers)
- Yiqun Liu (131 papers)