2000 character limit reached
Combining Counting Processes and Classification Improves a Stopping Rule for Technology Assisted Review (2312.03171v1)
Published 5 Dec 2023 in cs.IR and cs.CL
Abstract: Technology Assisted Review (TAR) stopping rules aim to reduce the cost of manually assessing documents for relevance by minimising the number of documents that need to be examined to ensure a desired level of recall. This paper extends an effective stopping rule using information derived from a text classifier that can be trained without the need for any additional annotation. Experiments on multiple data sets (CLEF e-Health, TREC Total Recall, TREC Legal and RCV1) showed that the proposed approach consistently improves performance and outperforms several alternative methods.
- Reem Bin-Hezam (4 papers)
- Mark Stevenson (30 papers)