Search Results - "Teehan, Ryan"
-
1
Are LLMs Prescient? A Continuous Evaluation using Daily News as the Oracle
Published 12-11-2024“…Many existing evaluation benchmarks for Large Language Models (LLMs) quickly become outdated due to the emergence of new models and training data. These…”
Get full text
Journal Article -
2
ProCreate, Don't Reproduce! Propulsive Energy Diffusion for Creative Generation
Published 05-08-2024“…In this paper, we propose ProCreate, a simple and easy-to-implement method to improve sample diversity and creativity of diffusion-based image generative…”
Get full text
Journal Article -
3
CoLLEGe: Concept Embedding Generation for Large Language Models
Published 22-03-2024“…Current language models are unable to quickly learn new concepts on the fly, often requiring a more involved finetuning process to learn robustly. Prompting…”
Get full text
Journal Article -
4
Can Language Models Employ the Socratic Method? Experiments with Code Debugging
Published 04-10-2023“…When employing the Socratic method of teaching, instructors guide students toward solving a problem on their own rather than providing the solution directly…”
Get full text
Journal Article -
5
Cut the CARP: Fishing for zero-shot story evaluation
Published 06-10-2021“…Recent advances in large-scale language models (Raffel et al., 2019; Brown et al., 2020) have brought significant qualitative and quantitative improvements in…”
Get full text
Journal Article -
6
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
Published 2023“…Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj Language models demonstrate both quantitative improvement and…”
Get full text
Journal Article -
7
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Published 09-11-2022“…Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these…”
Get full text
Journal Article -
8
Multitask Prompted Training Enables Zero-Shot Task Generalization
Published 15-10-2021“…Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been…”
Get full text
Journal Article -
9
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Published 05-12-2021“…Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the…”
Get full text
Journal Article