Search Results - "Nikpoor, Somaieh"
-
1
Data Governance in the Age of Large-Scale Data-Driven Language Technology
Published 02-11-2022“…Proceedings of 2022 ACM Conference on Fairness, Accountability, and Transparency (FAccT '22) The recent emergence and adoption of Machine Learning technology,…”
Get full text
Journal Article -
2
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Published 07-03-2023“…As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The…”
Get full text
Journal Article