2000 character limit reached
Federated pretraining and fine tuning of BERT using clinical notes from multiple silos (2002.08562v1)
Published 20 Feb 2020 in cs.CL and cs.LG
Abstract: Large scale contextual representation models, such as BERT, have significantly advanced NLP in recently years. However, in certain area like healthcare, accessing diverse large scale text data from multiple institutions is extremely challenging due to privacy and regulatory reasons. In this article, we show that it is possible to both pretrain and fine tune BERT models in a federated manner using clinical texts from different silos without moving the data.
Sponsored by Paperpile, the PDF & BibTeX manager trusted by top AI labs.
Get 30 days freePaper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.