A Collection of Swedish Diachronic Word Embedding Models Trained on Historical Newspaper Data

This paper describes the creation of several word embedding models based on a large collection of diachronic Swedish newspaper material available through Språkbanken Text, the Swedish language bank. This data was produced in the context of Språkbanken Text’s continued mission to collaborate with hum...

Full description

Saved in:
Bibliographic Details
Published in:Journal of open humanities data Vol. 7; no. 2; p. 1
Main Authors: Hengchen, Simon, Tahmasebi, Nina
Format: Journal Article
Language:English
Published: Ubiquity Press 27-01-2021
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper describes the creation of several word embedding models based on a large collection of diachronic Swedish newspaper material available through Språkbanken Text, the Swedish language bank. This data was produced in the context of Språkbanken Text’s continued mission to collaborate with humanities and natural language processing (NLP) researchers and to provide freely available language resources, for the development of state-of-the-art NLP methods and tools.
ISSN:2059-481X
2059-481X
DOI:10.5334/johd.22