Search Results - "Lhoest, Quentin"
-
1
Training Transformers Together
Published 07-07-2022“…The infrastructure necessary for training state-of-the-art models is becoming overly expensive, which makes training such models affordable only to large…”
Get full text
Journal Article -
2
AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages
Published 22-03-2023“…The advancement of speech technologies has been remarkable, yet its integration with African languages remains limited due to the scarcity of African speech…”
Get full text
Journal Article -
3
Croissant: A Metadata Format for ML-Ready Datasets
Published 28-03-2024“…Data is a critical resource for Machine Learning (ML), yet working with data remains a key friction point. This paper introduces Croissant, a metadata format…”
Get full text
Journal Article -
4
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements
Published 30-09-2022“…Evaluation is a key part of machine learning (ML), yet there is a lack of support and tooling to enable its informed and systematic practice. We introduce…”
Get full text
Journal Article -
5
Distributed Deep Learning in Open Collaborations
Published 18-06-2021“…Modern deep learning applications require increasingly more compute to train state-of-the-art models. To address this demand, large corporations and…”
Get full text
Journal Article -
6
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Published 07-03-2023“…As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The…”
Get full text
Journal Article -
7
Datasets: A Community Library for Natural Language Processing
Published 06-09-2021“…The scale, variety, and quantity of publicly-available NLP datasets has grown rapidly as researchers propose new tasks, larger models, and novel benchmarks…”
Get full text
Journal Article -
8
HuggingFace's Transformers: State-of-the-art Natural Language Processing
Published 08-10-2019“…Recent progress in natural language processing has been driven by advances in both model architecture and model pretraining. Transformer architectures have…”
Get full text
Journal Article