Search Results - "Von Werra, Leandro"
-
1
Radiometric Characterization of a Water-Based Conical Blackbody Calibration Target for Millimeter-Wave Remote Sensing
Published in IEEE journal of selected topics in applied earth observations and remote sensing (01-06-2019)“…In this paper, we present the design and radiometric characterization of a water-based conical blackbody calibration target to be applied as a precise…”
Get full text
Journal Article -
2
DESIGN AND PERFORMANCE OF TWO ORTHOGONAL EXTRACTION TIME-OF-FLIGHT SECONDARY ION MASS SPECTROMETERS FOR FOCUSED ION BEAM INSTRUMENTS
Published in Instrumentation science & technology (04-07-2014)“…The design and performance of two orthogonal extraction time-of-flight mass spectrometers are reported that were adapted to existing focused ion beam…”
Get full text
Journal Article -
3
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations
Published 28-05-2024“…Scale has become a main ingredient in obtaining strong machine learning models. As a result, understanding a model's scaling properties is key to effectively…”
Get full text
Journal Article -
4
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Published 25-06-2024“…The performance of a large language model (LLM) depends heavily on the quality and size of its pretraining dataset. However, the pretraining datasets for…”
Get full text
Journal Article -
5
Unsupervised Anomaly Detection for Seasonal Time Series
Published in 2019 6th Swiss Conference on Data Science (SDS) (01-06-2019)“…We extend eBay's Atlas algorithm to automatically detect anomalies in unlabeled, seasonal time series data. Named MULDER, the algorithm involves deriving a…”
Get full text
Conference Proceeding -
6
Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
Published 01-01-2024“…The high cost of full-parameter fine-tuning (FFT) of Large Language Models (LLMs) has led to a series of parameter-efficient fine-tuning (PEFT) methods…”
Get full text
Journal Article -
7
SelfCodeAlign: Self-Alignment for Code Generation
Published 31-10-2024“…Instruction tuning is a supervised fine-tuning approach that significantly improves the ability of large language models (LLMs) to follow human instructions…”
Get full text
Journal Article -
8
The BigCode Project Governance Card
Published 06-12-2023“…This document serves as an overview of the different mechanisms and areas of governance in the BigCode project. It aims to support transparency by providing…”
Get full text
Journal Article -
9
OctoPack: Instruction Tuning Code Large Language Models
Published 14-08-2023“…Finetuning large language models (LLMs) on instructions leads to vast performance improvements on natural language tasks. We apply instruction tuning using…”
Get full text
Journal Article -
10
Zephyr: Direct Distillation of LM Alignment
Published 25-10-2023“…We aim to produce a smaller language model that is aligned to user intent. Previous research has shown that applying distilled supervised fine-tuning (dSFT) on…”
Get full text
Journal Article -
11
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Published 22-06-2024“…Task automation has been greatly empowered by the recent advances in Large Language Models (LLMs) via Python code, where the tasks ranging from software…”
Get full text
Journal Article -
12
The Stack: 3 TB of permissively licensed source code
Published 20-11-2022“…Large Language Models (LLMs) play an ever-increasing role in the field of Artificial Intelligence (AI)--not only for natural language processing but also for…”
Get full text
Journal Article -
13
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements
Published 30-09-2022“…Evaluation is a key part of machine learning (ML), yet there is a lack of support and tooling to enable its informed and systematic practice. We introduce…”
Get full text
Journal Article -
14
StarCoder 2 and The Stack v2: The Next Generation
Published 29-02-2024“…The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces…”
Get full text
Journal Article -
15
SantaCoder: don't reach for the stars
Published 09-01-2023“…The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This tech report describes…”
Get full text
Journal Article -
16
StarCoder: may the source be with you
Published 09-05-2023“…The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces…”
Get full text
Journal Article -
17
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Published 07-03-2023“…As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The…”
Get full text
Journal Article -
18
A water-based conical blackbody concept for millimeter-wave remote sensing
Published in 2016 41st International Conference on Infrared, Millimeter, and Terahertz waves (IRMMW-THz) (01-09-2016)“…We propose a novel concept for water-based black-bodies to be used as calibration sources in microwave remote sensing instruments. In order to obtain a low…”
Get full text
Conference Proceeding