Search Results - "Lhoest, Quentin" :: Katalog Arama

1
Training Transformers Together by Borzunov, Alexander, Ryabinin, Max, Dettmers, Tim, Lhoest, Quentin, Saulnier, Lucile, Diskin, Michael, Jernite, Yacine, Wolf, Thomas

Published 07-07-2022
“…The infrastructure necessary for training state-of-the-art models is becoming overly expensive, which makes training such models affordable only to large…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages by Emezue, Chris Chinenye, Gandhi, Sanchit, Tunstall, Lewis, Abid, Abubakar, Meyer, Josh, Lhoest, Quentin, Allen, Pete, Von Platen, Patrick, Kiela, Douwe, Jernite, Yacine, Chaumond, Julien, Noyan, Merve, Sanseviero, Omar

Published 22-03-2023
“…The advancement of speech technologies has been remarkable, yet its integration with African languages remains limited due to the scarcity of African speech…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
3
Croissant: A Metadata Format for ML-Ready Datasets by Akhtar, Mubashara, Benjelloun, Omar, Conforti, Costanza, Gijsbers, Pieter, Giner-Miguelez, Joan, Jain, Nitisha, Kuchnik, Michael, Lhoest, Quentin, Marcenac, Pierre, Maskey, Manil, Mattson, Peter, Oala, Luis, Ruyssen, Pierre, Shinde, Rajat, Simperl, Elena, Thomas, Goeffry, Tykhonov, Slava, Vanschoren, Joaquin, van der Velde, Jos, Vogler, Steffen, Wu, Carole-Jean

Published 28-03-2024
“…Data is a critical resource for Machine Learning (ML), yet working with data remains a key friction point. This paper introduces Croissant, a metadata format…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
4
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements by von Werra, Leandro, Tunstall, Lewis, Thakur, Abhishek, Luccioni, Alexandra Sasha, Thrush, Tristan, Piktus, Aleksandra, Marty, Felix, Rajani, Nazneen, Mustar, Victor, Ngo, Helen, Sanseviero, Omar, Šaško, Mario, Villanova, Albert, Lhoest, Quentin, Chaumond, Julien, Mitchell, Margaret, Rush, Alexander M, Wolf, Thomas, Kiela, Douwe

Published 30-09-2022
“…Evaluation is a key part of machine learning (ML), yet there is a lack of support and tooling to enable its informed and systematic practice. We introduce…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
5
Distributed Deep Learning in Open Collaborations by Diskin, Michael, Bukhtiyarov, Alexey, Ryabinin, Max, Saulnier, Lucile, Lhoest, Quentin, Sinitsin, Anton, Popov, Dmitry, Pyrkin, Dmitry, Kashirin, Maxim, Borzunov, Alexander, del Moral, Albert Villanova, Mazur, Denis, Kobelev, Ilia, Jernite, Yacine, Wolf, Thomas, Pekhimenko, Gennady

Published 18-06-2021
“…Modern deep learning applications require increasingly more compute to train state-of-the-art models. To address this demand, large corporations and…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset by Laurençon, Hugo, Saulnier, Lucile, Wang, Thomas, Akiki, Christopher, del Moral, Albert Villanova, Scao, Teven Le, Von Werra, Leandro, Mou, Chenghao, Ponferrada, Eduardo González, Nguyen, Huu, Frohberg, Jörg, Šaško, Mario, Lhoest, Quentin, McMillan-Major, Angelina, Dupont, Gerard, Biderman, Stella, Rogers, Anna, allal, Loubna Ben, De Toni, Francesco, Pistilli, Giada, Nguyen, Olivier, Nikpoor, Somaieh, Masoud, Maraim, Colombo, Pierre, de la Rosa, Javier, Villegas, Paulo, Thrush, Tristan, Longpre, Shayne, Nagel, Sebastian, Weber, Leon, Muñoz, Manuel, Zhu, Jian, Van Strien, Daniel, Alyafeai, Zaid, Almubarak, Khalid, Vu, Minh Chien, Gonzalez-Dios, Itziar, Soroa, Aitor, Lo, Kyle, Dey, Manan, Suarez, Pedro Ortiz, Gokaslan, Aaron, Bose, Shamik, Adelani, David, Phan, Long, Tran, Hieu, Yu, Ian, Pai, Suhas, Chim, Jenny, Lepercq, Violette, Ilic, Suzana, Mitchell, Margaret, Luccioni, Sasha Alexandra, Jernite, Yacine

Published 07-03-2023
“…As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
7
Datasets: A Community Library for Natural Language Processing by Lhoest, Quentin, del Moral, Albert Villanova, Jernite, Yacine, Thakur, Abhishek, von Platen, Patrick, Patil, Suraj, Chaumond, Julien, Drame, Mariama, Plu, Julien, Tunstall, Lewis, Davison, Joe, Šaško, Mario, Chhablani, Gunjan, Malik, Bhavitvya, Brandeis, Simon, Scao, Teven Le, Sanh, Victor, Xu, Canwen, Patry, Nicolas, McMillan-Major, Angelina, Schmid, Philipp, Gugger, Sylvain, Delangue, Clément, Matussière, Théo, Debut, Lysandre, Bekman, Stas, Cistac, Pierric, Goehringer, Thibault, Mustar, Victor, Lagunas, François, Rush, Alexander M, Wolf, Thomas

Published 06-09-2021
“…The scale, variety, and quantity of publicly-available NLP datasets has grown rapidly as researchers propose new tasks, larger models, and novel benchmarks…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
8
HuggingFace's Transformers: State-of-the-art Natural Language Processing by Wolf, Thomas, Debut, Lysandre, Sanh, Victor, Chaumond, Julien, Delangue, Clement, Moi, Anthony, Cistac, Pierric, Rault, Tim, Louf, Rémi, Funtowicz, Morgan, Davison, Joe, Shleifer, Sam, von Platen, Patrick, Ma, Clara, Jernite, Yacine, Plu, Julien, Xu, Canwen, Scao, Teven Le, Gugger, Sylvain, Drame, Mariama, Lhoest, Quentin, Rush, Alexander M

Published 08-10-2019
“…Recent progress in natural language processing has been driven by advances in both model architecture and model pretraining. Transformer architectures have…”

Get full text

Journal Article
QR Code
Save to List

Saved in: