2000 character limit reached
Visualizing attention zones in machine reading comprehension models (2410.20652v1)
Published 28 Oct 2024 in cs.CL
Abstract: The attention mechanism plays an important role in the machine reading comprehension (MRC) model. Here, we describe a pipeline for building an MRC model with a pretrained LLM and visualizing the effect of each attention zone in different layers, which can indicate the explainability of the model. With the presented protocol and accompanying code, researchers can easily visualize the relevance of each attention zone in the MRC model. This approach can be generalized to other pretrained LLMs.