Search Results - "Heinzerling, Benjamin"
-
1
Examining the effect of whitening on static and contextualized word embeddings
Published in Information processing & management (01-05-2023)“…Static word embeddings (SWE) and contextualized word embeddings (CWE) are the foundation of modern natural language processing. However, these embeddings…”
Get full text
Journal Article -
2
Monotonic Representation of Numeric Properties in Language Models
Published 15-03-2024“…Language models (LMs) can express factual knowledge involving numeric properties such as Karl Popper was born in 1902. However, how this information is encoded…”
Get full text
Journal Article -
3
Representational Analysis of Binding in Language Models
Published 09-09-2024“…Entity tracking is essential for complex reasoning. To perform in-context entity tracking, language models (LMs) must bind an entity to its attribute (e.g.,…”
Get full text
Journal Article -
4
Language Models as Knowledge Bases: On Entity Representations, Storage Capacity, and Paraphrased Queries
Published 20-08-2020“…Pretrained language models have been suggested as a possible alternative or complement to structured knowledge bases. However, this emerging LM-as-KB paradigm…”
Get full text
Journal Article -
5
Cross-stitching Text and Knowledge Graph Encoders for Distantly Supervised Relation Extraction
Published 02-11-2022“…Bi-encoder architectures for distantly-supervised relation extraction are designed to make use of the complementary information found in text and knowledge…”
Get full text
Journal Article -
6
The Curse of Popularity: Popular Entities have Catastrophic Side Effects when Deleting Knowledge from Language Models
Published 10-06-2024“…Language models (LMs) encode world knowledge in their internal parameters through training. However, LMs may learn personal and confidential information from…”
Get full text
Journal Article -
7
ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation
Published 08-05-2024“…Evaluating the quality of free-text explanations is a multifaceted, subjective, and labor-intensive task. Large language models (LLMs) present an appealing…”
Get full text
Journal Article -
8
Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation
Published 04-06-2019“…Pretrained contextual and non-contextual subword embeddings have become available in over 250 languages, allowing massively multilingual NLP. However, while…”
Get full text
Journal Article -
9
Test-time Augmentation for Factual Probing
Published 25-10-2023“…Factual probing is a method that uses prompts to test if a language model "knows" certain world knowledge facts. A problem in factual probing is that small…”
Get full text
Journal Article -
10
The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces
Published 16-10-2024“…This paper investigates whether large language models (LLMs) utilize numerical attributes encoded in a low-dimensional subspace of the embedding space when…”
Get full text
Journal Article -
11
Tracing and Manipulating Intermediate Values in Neural Math Problem Solvers
Published 17-01-2023“…How language models process complex input that requires multiple steps of inference is not well understood. Previous research has shown that information about…”
Get full text
Journal Article -
12
BPEmb: Tokenization-free Pre-trained Subword Embeddings in 275 Languages
Published 05-10-2017“…We present BPEmb, a collection of pre-trained subword unit embeddings in 275 languages, based on Byte-Pair Encoding (BPE). In an evaluation using fine-grained…”
Get full text
Journal Article -
13
COPA-SSE: Semi-structured Explanations for Commonsense Reasoning
Published 18-01-2022“…We present Semi-Structured Explanations for COPA (COPA-SSE), a new crowdsourced dataset of 9,747 semi-structured, English common sense explanations for Choice…”
Get full text
Journal Article -
14
Learning to Learn to be Right for the Right Reasons
Published 23-04-2021“…Improving model generalization on held-out data is one of the core objectives in commonsense reasoning. Recent work has shown that models trained on the…”
Get full text
Journal Article -
15
Fine-Grained Entity Typing in Hyperbolic Space
Published 06-06-2019“…How can we represent hierarchical information present in large type inventories for entity typing? We study the ability of hyperbolic embeddings to capture…”
Get full text
Journal Article -
16
Riposte! A Large Corpus of Counter-Arguments
Published 08-10-2019“…Constructive feedback is an effective method for improving critical thinking skills. Counter-arguments (CAs), one form of constructive feedback, have been…”
Get full text
Journal Article -
17
Revisiting Selectional Preferences for Coreference Resolution
Published 20-07-2017“…Selectional preferences have long been claimed to be essential for coreference resolution. However, they are mainly modeled only implicitly by current…”
Get full text
Journal Article -
18
When Choosing Plausible Alternatives, Clever Hans can be Clever
Published 01-11-2019“…Pretrained language models, such as BERT and RoBERTa, have shown large improvements in the commonsense reasoning benchmark COPA. However, recent work found…”
Get full text
Journal Article -
19
On the Importance of Subword Information for Morphological Tasks in Truly Low-Resource Languages
Published 26-09-2019“…Recent work has validated the importance of subword information for word representation learning. Since subwords increase parameter sharing ability in neural…”
Get full text
Journal Article