Search Results - "Jernite, Yacine"
-
1
Creating an automated trigger for sepsis clinical decision support at emergency department triage using machine learning
Published in PloS one (06-04-2017)“…To demonstrate the incremental benefit of using free text data in addition to vital sign and demographic data to identify patients with suspected infection in…”
Get full text
Journal Article -
2
Ten simple rules for building and maintaining a responsible data science workflow
Published in PLoS computational biology (18-07-2024)Get full text
Journal Article -
3
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Published in Transactions of the Association for Computational Linguistics (31-01-2022)“…With the success of large-scale pre-training and multilingual modeling in Natural Language Processing (NLP), recent years have seen a proliferation of large,…”
Get full text
Journal Article -
4
Light bulbs have energy ratings — so why can’t AI chatbots?
Published in Nature (London) (22-08-2024)“…The rising energy and environmental cost of the artificial-intelligence boom is fuelling concern. Green policy mechanisms that already exist offer a path…”
Get full text
Journal Article -
5
Improving documentation of presenting problems in the emergency department using a domain-specific ontology and machine learning-driven user interfaces
Published in International journal of medical informatics (Shannon, Ireland) (01-12-2019)“…•Machine Learning can be used prospectively to improve structured data capture.•Machine learning can make capture of structured data faster than unstructured…”
Get full text
Journal Article -
6
BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model
Published in Psychofenia (09-12-2022)“…The BigScience Workshop was a value-driven initiative that spanned one and half years of interdisciplinary research and culminated in the creation of ROOTS, a…”
Get full text
Conference Proceeding -
7
Learning Representations of Text through Language and Discourse Modeling: From Characters to Sentences
Published 01-01-2018“…In this thesis, we consider the problem of obtaining a representation of the meaning expressed in a text. How to do so correctly remains a largely open…”
Get full text
Dissertation -
8
Unsupervised Text Summarization via Mixed Model Back-Translation
Published 22-08-2019“…Back-translation based approaches have recently lead to significant progress in unsupervised sequence-to-sequence tasks such as machine translation or style…”
Get full text
Journal Article -
9
THE RANDOM SUBGRAPH MODEL FOR THE ANALYSIS OF AN ECCLESIASTICAL NETWORK IN MEROVINGIAN GAUL
Published in The annals of applied statistics (01-03-2014)“…In the last two decades many random graph models have been proposed to extract knowledge from networks. Most of them look for communities or, more generally,…”
Get full text
Journal Article -
10
Power Hungry Processing: Watts Driving the Cost of AI Deployment?
Published 28-11-2023“…ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT '24), June 3--6, 2024, Rio de Janeiro, Brazil Recent years have seen a surge in the…”
Get full text
Journal Article -
11
Stable Bias: Analyzing Societal Representations in Diffusion Models
Published 20-03-2023“…As machine learning-enabled Text-to-Image (TTI) systems are becoming increasingly prevalent and seeing growing adoption as commercial services, characterizing…”
Get full text
Journal Article -
12
CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models
Published 22-05-2024“…This paper introduces the "CIVICS: Culturally-Informed & Values-Inclusive Corpus for Societal impacts" dataset, designed to evaluate the social and cultural…”
Get full text
Journal Article -
13
Stronger Together: on the Articulation of Ethical Charters, Legal Tools, and Technical Documentation in ML
Published 09-05-2023“…The growing need for accountability of the people behind AI systems can be addressed by leveraging processes in three fields of study: ethics, law, and…”
Get full text
Journal Article -
14
Towards Openness Beyond Open Access: User Journeys through 3 Open AI Collaboratives
Published 20-01-2023“…Open Artificial Intelligence (Open source AI) collaboratives offer alternative pathways for how AI can be developed beyond well-resourced technology companies…”
Get full text
Journal Article -
15
BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model
Published 09-12-2022“…The BigScience Workshop was a value-driven initiative that spanned one and half years of interdisciplinary research and culminated in the creation of ROOTS, a…”
Get full text
Journal Article -
16
Training Transformers Together
Published 07-07-2022“…The infrastructure necessary for training state-of-the-art models is becoming overly expensive, which makes training such models affordable only to large…”
Get full text
Journal Article -
17
On the Standardization of Behavioral Use Clauses and Their Adoption for Responsible Licensing of AI
Published 07-02-2024“…Growing concerns over negligent or malicious uses of AI have increased the appetite for tools that help manage the risks of the technology. In 2018, licenses…”
Get full text
Journal Article -
18
The ROOTS Search Tool: Data Transparency for LLMs
Published 27-02-2023“…ROOTS is a 1.6TB multilingual text corpus developed for the training of BLOOM, currently the largest language model explicitly accompanied by commensurate data…”
Get full text
Journal Article -
19
The BigCode Project Governance Card
Published 06-12-2023“…This document serves as an overview of the different mechanisms and areas of governance in the BigCode project. It aims to support transparency by providing…”
Get full text
Journal Article -
20
Improving Conditioning in Context-Aware Sequence to Sequence Models
Published 21-11-2019“…Neural sequence to sequence models are well established for applications which can be cast as mapping a single input sequence into a single output sequence. In…”
Get full text
Journal Article