Search Results - "Dey, Manan"
-
1
Assessing Viewer's Mental Health by Detecting Depression in YouTube Videos
Published 29-07-2020“…Depression is one of the most prevalent mental health issues around the world, proving to be one of the leading causes of suicide and placing large economic…”
Get full text
Journal Article -
2
How sensitive are translation systems to extra contexts? Mitigating gender bias in Neural Machine Translation models through relevant contexts
Published 22-05-2022“…Neural Machine Translation systems built on top of Transformer-based architectures are routinely improving the state-of-the-art in translation quality…”
Get full text
Journal Article -
3
Evaluating Gender Bias in Natural Language Inference
Published 12-05-2021“…Gender-bias stereotypes have recently raised significant ethical concerns in natural language processing. However, progress in detection and evaluation of…”
Get full text
Journal Article -
4
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Published 20-12-2021“…What are the units of text that we want to model? From bytes to multi-word expressions, text can be analyzed and generated at many granularities. Until…”
Get full text
Journal Article -
5
Consent in Crisis: The Rapid Decline of the AI Data Commons
Published 20-07-2024“…General-purpose artificial intelligence (AI) systems are built on massive swathes of public web data, assembled into corpora such as C4, RefinedWeb, and Dolma…”
Get full text
Journal Article -
6
StarCoder 2 and The Stack v2: The Next Generation
Published 29-02-2024“…The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces…”
Get full text
Journal Article -
7
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Published 02-02-2022“…PromptSource is a system for creating, sharing, and using natural language prompts. Prompts are functions that map an example from a dataset to a natural…”
Get full text
Journal Article -
8
SantaCoder: don't reach for the stars
Published 09-01-2023“…The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This tech report describes…”
Get full text
Journal Article -
9
StarCoder: may the source be with you
Published 09-05-2023“…The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces…”
Get full text
Journal Article -
10
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Published 07-03-2023“…As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The…”
Get full text
Journal Article -
11
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Published 09-11-2022“…Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these…”
Get full text
Journal Article -
12
Multitask Prompted Training Enables Zero-Shot Task Generalization
Published 15-10-2021“…Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been…”
Get full text
Journal Article