2000 character limit reached
SciBERT-based Semantification of Bioassays in the Open Research Knowledge Graph (2009.08801v1)
Published 16 Sep 2020 in cs.AI, cs.CL, cs.LG, and stat.ML
Abstract: As a novel contribution to the problem of semantifying biological assays, in this paper, we propose a neural-network-based approach to automatically semantify, thereby structure, unstructured bioassay text descriptions. Experimental evaluations, to this end, show promise as the neural-based semantification significantly outperforms a naive frequency-based baseline approach. Specifically, the neural method attains 72% F1 versus 47% F1 from the frequency-based method.