Search Results - "Rueter, Jack"
-
1
On searchable Mordvin corpora at the Language Bank of Finland, EMERALD
Published in Journal of data mining and digital humanities (29-04-2024)“…Description of Mordvin language corpora development at the Language Bank of Finland.Description of development…”
Get full text
Journal Article -
2
The Livonian-Estonian-Latvian Dictionary as a threshold to the era of language technological applications
Published in Eesti ja soome-ugri keeleteaduse ajakiri (01-07-2014)“…This article outlines the multiple use of electronic source materials from the Livonian-Estonian-Latvian Dictionary of 2012 in a “Kone Foundation” funded…”
Get full text
Journal Article -
3
Establishing a Role for Minority Source Language in Multilingual Facilitation
Published in Nordlyd (Tromsø, Norway) (30-08-2022)“…This document is dedicated to a young man, who, despite the number of times he has traveled around the Sun, is always open to new thoughts on ways to include…”
Get full text
Journal Article -
4
Documentación de lenguas amenazadas en la época digital
Published in Linha d'água (01-09-2021)“…Presentamos nuestra infraestructura para la documentación de lenguas urálicas, que consiste en herramientas para redactar diccionarios de tal forma que las…”
Get full text
Journal Article -
5
Old Permic Universal Dependencies Treebank
Published in Journal of data mining and digital humanities (01-06-2024)“…Old Permic, also known as Old Komi, is an extinct variety of Komi that was spoken in the late Middle Ages in the lower Vychegda river basin in Northeastern…”
Get full text
Journal Article -
6
Towards an Old Permic Universal Dependencies Treebank
Published in Journal of data mining and digital humanities (2024)“…Old Permic, also known as Old Komi, is an extinct variety of Komi that was spoken in the late Middle Ages in the lower Vychegda river basin in Northeastern…”
Get full text
Journal Article -
7
Analyzing Pok\'emon and Mario Streamers' Twitch Chat with LLM-based User Embeddings
Published 16-11-2024“…We present a novel digital humanities method for representing our Twitch chatters as user embeddings created by a large language model (LLM). We cluster these…”
Get full text
Journal Article -
8
Leveraging Transformer-Based Models for Predicting Inflection Classes of Words in an Endangered Sami Language
Published 04-11-2024“…This paper presents a methodology for training a transformer-based model to classify lexical and morphosyntactic features of Skolt Sami, an endangered Uralic…”
Get full text
Journal Article -
9
FST Morphology for the Endangered Skolt Sami Language
Published 09-04-2020“…We present advances in the development of a FST-based morphological analyzer and generator for Skolt Sami. Like other minority Uralic languages, Skolt Sami…”
Get full text
Journal Article -
10
Sentiment Analysis Using Aligned Word Embeddings for Uralic Languages
Published 24-05-2023“…In this paper, we present an approach for translating word embeddings from a majority language into 4 minority languages: Erzya, Moksha, Udmurt and…”
Get full text
Journal Article -
11
Processing M.A. Castr\'en's Materials: Multilingual Typed and Handwritten Manuscripts
Published 28-12-2021“…The study forms a technical report of various tasks that have been performed on the materials collected and published by Finnish ethnographer and linguist,…”
Get full text
Journal Article -
12
Detecting Depression in Thai Blog Posts: a Dataset and a Baseline
Published 08-11-2021“…We present the first openly available corpus for detecting depression in Thai. Our corpus is compiled by expert verified cases of depression in several online…”
Get full text
Journal Article -
13
Finnish Dialect Identification: The Effect of Audio and Text
Published 06-11-2021“…Finnish is a language with multiple dialects that not only differ from each other in terms of accent (pronunciation) but also in terms of morphological forms…”
Get full text
Journal Article -
14
Never guess what I heard... Rumor Detection in Finnish News: a Dataset and a Baseline
Published 07-06-2021“…This study presents a new dataset on rumor detection in Finnish language news headlines. We have evaluated two different LSTM based models and two different…”
Get full text
Journal Article -
15
Neural Morphology Dataset and Models for Multiple Languages, from the Large to the Endangered
Published 26-05-2021“…We train neural models for morphological analysis, generation and lemmatization for morphologically rich languages. We present a method for automatically…”
Get full text
Journal Article -
16
Ve'rdd. Narrowing the Gap between Paper Dictionaries, Low-Resource NLP and Community Involvement
Published 04-12-2020“…We present an open-source online dictionary editing system, Ve'rdd, that offers a chance to re-evaluate and edit grassroots dictionaries that have been exposed…”
Get full text
Journal Article -
17
Automated Prediction of Medieval Arabic Diacritics
Published 11-10-2020“…This study uses a character level neural machine translation approach trained on a long short-term memory-based bi-directional recurrent neural network…”
Get full text
Journal Article -
18
Automatic Dialect Adaptation in Finnish and its Effect on Perceived Creativity
Published 06-09-2020“…We present a novel approach for adapting text written in standard Finnish to different dialects. We experiment with character level NMT models both by using a…”
Get full text
Journal Article -
19
Apurin\~a Universal Dependencies Treebank
Published 07-06-2021“…This paper presents and discusses the first Universal Dependencies treebank for the Apurin\~a language. The treebank contains 76 fully annotated sentences,…”
Get full text
Journal Article