Search Results - "Tamkin, Alex"
-
1
Codebook Features: Sparse and Discrete Interpretability for Neural Networks
Published 26-10-2023“…Understanding neural networks is challenging in part because of the dense, continuous nature of their hidden states. We explore whether we can train neural…”
Get full text
Journal Article -
2
Operationalising the Definition of General Purpose AI Systems: Assessing Four Approaches
Published 05-06-2023“…The European Union's Artificial Intelligence (AI) Act is set to be a landmark legal instrument for regulating AI technology. While stakeholders have primarily…”
Get full text
Journal Article -
3
Multispectral Contrastive Learning with Viewmaker Networks
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01-06-2023)“…Contrastive learning methods have been applied to a range of domains and modalities by training models to identify similar "views" of data points. However,…”
Get full text
Conference Proceeding -
4
Multispectral Contrastive Learning with Viewmaker Networks
Published 11-02-2023“…Contrastive learning methods have been applied to a range of domains and modalities by training models to identify similar "views" of data points. However,…”
Get full text
Journal Article -
5
Oolong: Investigating What Makes Transfer Learning Hard with Controlled Studies
Published 24-02-2022“…When we transfer a pretrained language model to a new language, there are many axes of variation that change at once. To disentangle the impact of different…”
Get full text
Journal Article -
6
Eliciting Human Preferences with Language Models
Published 17-10-2023“…Language models (LMs) can be directed to perform target tasks by using labeled examples or natural language prompts. But selecting examples or writing prompts…”
Get full text
Journal Article -
7
Drone.io: A Gestural and Visual Interface for Human-Drone Interaction
Published in 2019 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI) (01-03-2019)“…Drones are becoming ubiquitous and offer support to people in various tasks, such as photography, in increasingly interactive social contexts. We introduce…”
Get full text
Conference Proceeding -
8
Task Ambiguity in Humans and Language Models
Published 20-12-2022“…Language models have recently achieved strong performance across a wide range of NLP benchmarks. However, unlike benchmarks, real world tasks are often poorly…”
Get full text
Journal Article -
9
Feature Dropout: Revisiting the Role of Augmentations in Contrastive Learning
Published 16-12-2022“…What role do augmentations play in contrastive learning? Recent work suggests that good augmentations are label-preserving with respect to a specific…”
Get full text
Journal Article -
10
Collective Constitutional AI: Aligning a Language Model with Public Input
Published 12-06-2024“…Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency. 1395-1417 There is growing consensus that language model (LM) developers…”
Get full text
Journal Article -
11
Language Through a Prism: A Spectral Approach for Multiscale Language Representations
Published 09-11-2020“…Language exhibits structure at different scales, ranging from subwords to words, sentences, paragraphs, and documents. To what extent do deep models capture…”
Get full text
Journal Article -
12
Bayesian Preference Elicitation with Language Models
Published 08-03-2024“…Aligning AI systems to users' interests requires understanding and incorporating humans' complex values and preferences. Recently, language models (LMs) have…”
Get full text
Journal Article -
13
Viewmaker Networks: Learning Views for Unsupervised Representation Learning
Published 14-10-2020“…Many recent methods for unsupervised representation learning train models to be invariant to different "views," or distorted versions of an input. However,…”
Get full text
Journal Article -
14
Active Learning Helps Pretrained Models Learn the Intended Task
Published 18-04-2022“…Models can fail in unpredictable ways during deployment due to task ambiguity, when multiple behaviors are consistent with the provided training data. An…”
Get full text
Journal Article -
15
Tradeoffs Between Contrastive and Supervised Learning: An Empirical Study
Published 10-12-2021“…Contrastive learning has made considerable progress in computer vision, outperforming supervised pretraining on a range of downstream datasets. However, is…”
Get full text
Journal Article -
16
C5T5: Controllable Generation of Organic Molecules with Transformers
Published 23-08-2021“…Methods for designing organic materials with desired properties have high potential impact across fields such as medicine, renewable energy, petrochemical…”
Get full text
Journal Article -
17
Evaluating and Mitigating Discrimination in Language Model Decisions
Published 06-12-2023“…As language models (LMs) advance, interest is growing in applying them to high-stakes societal decisions, such as determining financing or housing eligibility…”
Get full text
Journal Article -
18
Social Contract AI: Aligning AI Assistants with Implicit Group Norms
Published 26-10-2023“…We explore the idea of aligning an AI assistant by inverting a model of users' (unknown) preferences from observed interactions. To validate our proposal, we…”
Get full text
Journal Article -
19
Understanding the Capabilities, Limitations, and Societal Impact of Large Language Models
Published 04-02-2021“…On October 14th, 2020, researchers from OpenAI, the Stanford Institute for Human-Centered Artificial Intelligence, and other universities convened to discuss…”
Get full text
Journal Article -
20
Investigating Transferability in Pretrained Language Models
Published 30-04-2020“…How does language model pretraining help transfer learning? We consider a simple ablation technique for determining the impact of each pretrained layer on…”
Get full text
Journal Article