A comparative evaluation and analysis of three generations of Distributional Semantic Models

Distributional semantics has deeply changed in the last decades. First, predict models stole the thunder from traditional count ones, and more recently both of them were replaced in many NLP applications by contextualized vectors produced by neural language models. Although an extensive body of rese...

Full description

Saved in:

Bibliographic Details
Published in:	Language resources and evaluation Vol. 56; no. 4; pp. 1269 - 1313
Main Authors:	Lenci, Alessandro, Sahlgren, Magnus, Jeuniaux, Patrick, Cuba Gyllensten, Amaru, Miliani, Martina
Format:	Journal Article
Language:	English
Published:	Dordrecht Springer Netherlands 01-12-2022 Springer Nature B.V
Subjects:	Computational Linguistics Computer Science Contextual embeddings Datasets Distributional semantics Evaluation Language and Literature Language modeling Lexical semantics Lexicon Linguistics Original Paper Representational Similarity Analysis Semantics Social Sciences Statistical analysis Evaluation Distributional semantics Representational Similarity Analysis Contextual embeddings
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Distributional semantics has deeply changed in the last decades. First, predict models stole the thunder from traditional count ones, and more recently both of them were replaced in many NLP applications by contextualized vectors produced by neural language models. Although an extensive body of research has been devoted to Distributional Semantic Model (DSM) evaluation, we still lack a thorough comparison with respect to tested models, semantic tasks, and benchmark datasets. Moreover, previous work has mostly focused on task-driven evaluation, instead of exploring the differences between the way models represent the lexical semantic space. In this paper, we perform a large-scale evaluation of type distributional vectors, either produced by static DSMs or obtained by averaging the contextualized vectors generated by BERT. First of all, we investigate the performance of embeddings in several semantic tasks, carrying out an in-depth statistical analysis to identify the major factors influencing the behavior of DSMs. The results show that (i) the alleged superiority of predict based models is more apparent than real, and surely not ubiquitous and (ii) static DSMs surpass BERT representations in most out-of-context semantic tasks and datasets. Furthermore, we borrow from cognitive neuroscience the methodology of Representational Similarity Analysis (RSA) to inspect the semantic spaces generated by distributional models. RSA reveals important differences related to the frequency and part-of-speech of lexical items.
ISSN:	1574-020X 1574-0218 1574-0218
DOI:	10.1007/s10579-021-09575-z