Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

TutorialBank: A Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource Recommendation (1805.04617v1)

Published 11 May 2018 in cs.CL

Abstract: The field of NLP is growing rapidly, with new research published daily along with an abundance of tutorials, codebases and other online resources. In order to learn this dynamic field or stay up-to-date on the latest research, students as well as educators and researchers must constantly sift through multiple sources to find valuable, relevant information. To address this situation, we introduce TutorialBank, a new, publicly available dataset which aims to facilitate NLP education and research. We have manually collected and categorized over 6,300 resources on NLP as well as the related fields of AI, Machine Learning (ML) and Information Retrieval (IR). Our dataset is notably the largest manually-picked corpus of resources intended for NLP education which does not include only academic papers. Additionally, we have created both a search engine and a command-line tool for the resources and have annotated the corpus to include lists of research topics, relevant resources for each topic, prerequisite relations among topics, relevant sub-parts of individual resources, among other annotations. We are releasing the dataset and present several avenues for further research.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Alexander R. Fabbri (34 papers)
  2. Irene Li (47 papers)
  3. Prawat Trairatvorakul (1 paper)
  4. Yijiao He (1 paper)
  5. Wei Tai Ting (1 paper)
  6. Robert Tung (2 papers)
  7. Caitlin Westerfield (2 papers)
  8. Dragomir R. Radev (14 papers)
Citations (32)

Summary

We haven't generated a summary for this paper yet.