Improving Adverse Drug Event Extraction with SpanBERT on Different Text Typologies

In recent years, Internet users are reporting Adverse Drug Events (ADE) on social media, blogs and health forums. Because of the large volume of reports, pharmacovigilance is seeking to resort to NLP to monitor these outlets. We propose for the first time the use of the SpanBERT architecture for the...

Full description

Saved in:
Bibliographic Details
Main Authors: Portelli, Beatrice, Passabì, Daniele, Lenzi, Edoardo, Serra, Giuseppe, Santus, Enrico, Chersoni, Emmanuele
Format: Journal Article
Language:English
Published: 18-05-2021
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In recent years, Internet users are reporting Adverse Drug Events (ADE) on social media, blogs and health forums. Because of the large volume of reports, pharmacovigilance is seeking to resort to NLP to monitor these outlets. We propose for the first time the use of the SpanBERT architecture for the task of ADE extraction: this new version of the popular BERT transformer showed improved capabilities with multi-token text spans. We validate our hypothesis with experiments on two datasets (SMM4H and CADEC) with different text typologies (tweets and blog posts), finding that SpanBERT combined with a CRF outperforms all the competitors on both of them.
DOI:10.48550/arxiv.2105.08882