Search Results - "Jumelet, Jaap"
-
1
Structural Persistence in Language Models: Priming as a Window into Abstract Language Representations
Published in Transactions of the Association for Computational Linguistics (19-09-2022)“…We investigate the extent to which modern neural language models are susceptible to structural priming, the phenomenon whereby the structure of a sentence…”
Get full text
Journal Article -
2
diagNNose: A Library for Neural Activation Analysis
Published 13-11-2020“…In this paper we introduce diagNNose, an open source library for analysing the activations of deep neural networks. diagNNose contains a wide array of…”
Get full text
Journal Article -
3
Transparency at the Source: Evaluating and Interpreting Language Models With Access to the True Distribution
Published 23-10-2023“…We present a setup for training, evaluating and interpreting neural language models, that uses artificial, language-like data. The data is generated using a…”
Get full text
Journal Article -
4
Feature Interactions Reveal Linguistic Structure in Language Models
Published 21-06-2023“…We study feature interactions in the context of feature attribution methods for post-hoc interpretability. In interpretability research, getting to grips with…”
Get full text
Journal Article -
5
Do Language Models Exhibit Human-like Structural Priming Effects?
Published 07-06-2024“…We explore which linguistic factors -- at the sentence and token level -- play an important role in influencing language model predictions, and investigate…”
Get full text
Journal Article -
6
Black Big Boxes: Do Language Models Hide a Theory of Adjective Order?
Published 02-07-2024“…In English and other languages, multiple adjectives in a complex noun phrase show intricate ordering patterns that have been a target of much linguistic…”
Get full text
Journal Article -
7
Interpretability of Language Models via Task Spaces
Published 10-06-2024“…The usual way to interpret language models (LMs) is to test their performance on different benchmarks and subsequently infer their internal processes. In this…”
Get full text
Journal Article -
8
Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue
Published 21-11-2023“…Language models are often used as the backbone of modern dialogue systems. These models are pre-trained on large amounts of written fluent language. Repetition…”
Get full text
Journal Article -
9
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers
Published 05-10-2023“…In recent years, many interpretability methods have been proposed to help interpret the internal states of Transformer-models, at different levels of precision…”
Get full text
Journal Article -
10
Curriculum Learning with Adam: The Devil Is in the Wrong Details
Published 23-08-2023“…Curriculum learning (CL) posits that machine learning models -- similar to humans -- may learn more efficiently from data that match their current learning…”
Get full text
Journal Article -
11
Do Language Models Understand Anything? On the Ability of LSTMs to Understand Negative Polarity Items
Published 31-08-2018“…In this paper, we attempt to link the inner workings of a neural language model to linguistic theory, focusing on a complex phenomenon well discussed in formal…”
Get full text
Journal Article -
12
The Birth of Bias: A case study on the evolution of gender bias in an English language model
Published 20-07-2022“…Detecting and mitigating harmful biases in modern language models are widely recognized as crucial, open problems. In this paper, we take a step back and…”
Get full text
Journal Article -
13
ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation
Published 17-10-2023“…We present the submission of the ILLC at the University of Amsterdam to the BabyLM challenge (Warstadt et al., 2023), in the strict-small track. Our final…”
Get full text
Journal Article -
14
Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence
Published 24-05-2024“…This paper introduces Filtered Corpus Training, a method that trains language models (LMs) on corpora with certain linguistic constructions filtered out from…”
Get full text
Journal Article -
15
Structural Persistence in Language Models: Priming as a Window into Abstract Language Representations
Published 30-09-2021“…We investigate the extent to which modern, neural language models are susceptible to structural priming, the phenomenon whereby the structure of a sentence…”
Get full text
Journal Article -
16
Analysing Neural Language Models: Contextual Decomposition Reveals Default Reasoning in Number and Gender Assignment
Published 19-09-2019“…Extensive research has recently shown that recurrent neural language models are able to process a wide range of grammatical phenomena. How these models are…”
Get full text
Journal Article -
17
Attention vs non-attention for a Shapley-based explanation method
Published 26-04-2021“…The field of explainable AI has recently seen an explosion in the number of explanation methods for highly non-linear deep neural networks. The extent to which…”
Get full text
Journal Article -
18
Language Modelling as a Multi-Task Problem
Published 27-01-2021“…In this paper, we propose to study language modelling as a multi-task problem, bringing together three strands of research: multi-task learning, linguistics,…”
Get full text
Journal Article -
19
Language Models Use Monotonicity to Assess NPI Licensing
Published 28-05-2021“…We investigate the semantic knowledge of language models (LMs), focusing on (1) whether these LMs create categories of linguistic environments based on their…”
Get full text
Journal Article