Search Results - "Vu, Minh Chien"
-
1
Influence of initial water, moisture, and geopolymer content on geopolymer modified sludge
Published in Construction & building materials (28-02-2020)“…•Modified sludge by fly ash based geopolymer at ambient temperature.•Initial curing time and initial water content are controlled in two methods.•Initial…”
Get full text
Journal Article -
2
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Published 22-06-2024“…Task automation has been greatly empowered by the recent advances in Large Language Models (LLMs) via Python code, where the tasks ranging from software…”
Get full text
Journal Article -
3
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
Published 09-02-2024“…Datasets are foundational to many breakthroughs in modern artificial intelligence. Many recent achievements in the space of natural language processing (NLP)…”
Get full text
Journal Article -
4
Consent in Crisis: The Rapid Decline of the AI Data Commons
Published 20-07-2024“…General-purpose artificial intelligence (AI) systems are built on massive swathes of public web data, assembled into corpora such as C4, RefinedWeb, and Dolma…”
Get full text
Journal Article -
5
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Published 30-03-2024“…Pretrained language models underpin several AI applications, but their high computational cost for training limits accessibility. Initiatives such as BLOOM and…”
Get full text
Journal Article -
6
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing
Published 12-10-2022“…Training and evaluating language models increasingly requires the construction of meta-datasets --diverse collections of curated data with clear provenance…”
Get full text
Journal Article -
7
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Published 07-03-2023“…As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The…”
Get full text
Journal Article