Search Results - "Webson, Albert"
-
1
Interactive and Visual Prompt Engineering for Ad-hoc Task Adaptation with Large Language Models
Published in IEEE transactions on visualization and computer graphics (01-01-2023)“…State-of-the-art neural language models can now be used to solve ad-hoc language tasks through zero-shot prompting without the need for supervised training…”
Get full text
Journal Article -
2
Do Prompt-Based Models Really Understand the Meaning of their Prompts?
Published 02-09-2021“…Recently, a boom of papers has shown extraordinary progress in zero-shot and few-shot learning with various prompt-based models. It is commonly argued that…”
Get full text
Journal Article -
3
In-context Learning Generalizes, But Not Always Robustly: The Case of Syntax
Published 13-11-2023“…In-context learning (ICL) is now a common method for teaching large language models (LLMs) new tasks: given labeled examples in the input context, the LLM…”
Get full text
Journal Article -
4
Are Language Models Worse than Humans at Following Prompts? It's Complicated
Published 17-01-2023“…Prompts have been the center of progress in advancing language models' zero-shot and few-shot performance. However, recent work finds that models can perform…”
Get full text
Journal Article -
5
Simfluence: Modeling the Influence of Individual Training Examples by Simulating Training Runs
Published 14-03-2023“…Training data attribution (TDA) methods offer to trace a model's prediction on any given example back to specific influential training examples. Existing…”
Get full text
Journal Article -
6
Interactive and Visual Prompt Engineering for Ad-hoc Task Adaptation with Large Language Models
Published 16-08-2022“…State-of-the-art neural language models can now be used to solve ad-hoc language tasks through zero-shot prompting without the need for supervised training…”
Get full text
Journal Article -
7
Are "Undocumented Workers" the Same as "Illegal Aliens"? Disentangling Denotation and Connotation in Vector Spaces
Published 06-10-2020“…In politics, neologisms are frequently invented for partisan objectives. For example, "undocumented workers" and "illegal aliens" refer to the same group of…”
Get full text
Journal Article -
8
Larger language models do in-context learning differently
Published 07-03-2023“…We study how in-context learning (ICL) in language models is affected by semantic priors versus input-label mappings. We investigate two setups-ICL with…”
Get full text
Journal Article -
9
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Published 31-01-2023“…We study the design decisions of publicly available instruction tuning methods, and break down the development of Flan 2022 (Chung et al., 2022). Through…”
Get full text
Journal Article -
10
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models
Published 24-05-2023“…Sparse Mixture-of-Experts (MoE) is a neural architecture design that can be utilized to add learnable parameters to Large Language Models (LLMs) without…”
Get full text
Journal Article -
11
Evaluating Frontier Models for Dangerous Capabilities
Published 20-03-2024“…To understand the risks posed by a new AI system, we must understand what it can and cannot do. Building on prior work, we introduce a programme of new…”
Get full text
Journal Article -
12
Towards Conversational Diagnostic AI
Published 10-01-2024“…At the heart of medicine lies the physician-patient dialogue, where skillful history-taking paves the way for accurate diagnosis, effective management, and…”
Get full text
Journal Article -
13
Capabilities of Gemini Models in Medicine
Published 29-04-2024“…Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge…”
Get full text
Journal Article -
14
Crosslingual Generalization through Multitask Finetuning
Published 03-11-2022“…Multitask prompted finetuning (MTF) has been shown to help large language models generalize to new tasks in a zero-shot setting, but so far explorations of MTF…”
Get full text
Journal Article -
15
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Published 08-03-2024“…In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of…”
Get full text
Journal Article -
16
Scaling Instruction-Finetuned Language Models
Published 20-10-2022“…Finetuning language models on a collection of datasets phrased as instructions has been shown to improve model performance and generalization to unseen tasks…”
Get full text
Journal Article -
17
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Published 02-02-2022“…PromptSource is a system for creating, sharing, and using natural language prompts. Prompts are functions that map an example from a dataset to a natural…”
Get full text
Journal Article -
18
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Published 09-11-2022“…Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these…”
Get full text
Journal Article -
19
Multitask Prompted Training Enables Zero-Shot Task Generalization
Published 15-10-2021“…Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been…”
Get full text
Journal Article