2000 character limit reached
K-12BERT: BERT for K-12 education (2205.12335v1)
Published 24 May 2022 in cs.CL and cs.LG
Abstract: Online education platforms are powered by various NLP pipelines, which utilize models like BERT to aid in content curation. Since the inception of the pre-trained LLMs like BERT, there have also been many efforts toward adapting these pre-trained models to specific domains. However, there has not been a model specifically adapted for the education domain (particularly K-12) across subjects to the best of our knowledge. In this work, we propose to train a LLM on a corpus of data curated by us across multiple subjects from various sources for K-12 education. We also evaluate our model, K12-BERT, on downstream tasks like hierarchical taxonomy tagging.
- Vasu Goel (3 papers)
- Dhruv Sahnan (8 papers)
- Venktesh V (23 papers)
- Gaurav Sharma (51 papers)
- Deep Dwivedi (2 papers)
- Mukesh Mohania (13 papers)