Search Results - "Von Werra, Leandro"

  • Showing 1 - 18 results of 18
Refine Results
  1. 1

    Radiometric Characterization of a Water-Based Conical Blackbody Calibration Target for Millimeter-Wave Remote Sensing by Jacob, Karl, Schroder, Arne, von Werra, Leandro, Reinhard, Florian, Raisin, Philippe, Murk, Axel

    “…In this paper, we present the design and radiometric characterization of a water-based conical blackbody calibration target to be applied as a precise…”
    Get full text
    Journal Article
  2. 2

    DESIGN AND PERFORMANCE OF TWO ORTHOGONAL EXTRACTION TIME-OF-FLIGHT SECONDARY ION MASS SPECTROMETERS FOR FOCUSED ION BEAM INSTRUMENTS by Alberts, Deborah, von Werra, Leandro, Oestlund, Fredrik, Rohner, Urs, Hohl, Markus, Michler, Johann, Whitby, James A.

    Published in Instrumentation science & technology (04-07-2014)
    “…The design and performance of two orthogonal extraction time-of-flight mass spectrometers are reported that were adapted to existing focused ion beam…”
    Get full text
    Journal Article
  3. 3

    Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations by Hägele, Alexander, Bakouch, Elie, Kosson, Atli, Allal, Loubna Ben, Von Werra, Leandro, Jaggi, Martin

    Published 28-05-2024
    “…Scale has become a main ingredient in obtaining strong machine learning models. As a result, understanding a model's scaling properties is key to effectively…”
    Get full text
    Journal Article
  4. 4

    The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale by Penedo, Guilherme, Kydlíček, Hynek, allal, Loubna Ben, Lozhkov, Anton, Mitchell, Margaret, Raffel, Colin, Von Werra, Leandro, Wolf, Thomas

    Published 25-06-2024
    “…The performance of a large language model (LLM) depends heavily on the quality and size of its pretraining dataset. However, the pretraining datasets for…”
    Get full text
    Journal Article
  5. 5

    Unsupervised Anomaly Detection for Seasonal Time Series by von Werra, Leandro, Tunstall, Lewis, Hofer, Simon

    “…We extend eBay's Atlas algorithm to automatically detect anomalies in unlabeled, seasonal time series data. Named MULDER, the algorithm involves deriving a…”
    Get full text
    Conference Proceeding
  6. 6

    Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models by Zhuo, Terry Yue, Zebaze, Armel, Suppattarachai, Nitchakarn, von Werra, Leandro, de Vries, Harm, Liu, Qian, Muennighoff, Niklas

    Published 01-01-2024
    “…The high cost of full-parameter fine-tuning (FFT) of Large Language Models (LLMs) has led to a series of parameter-efficient fine-tuning (PEFT) methods…”
    Get full text
    Journal Article
  7. 7

    SelfCodeAlign: Self-Alignment for Code Generation by Wei, Yuxiang, Cassano, Federico, Liu, Jiawei, Ding, Yifeng, Jain, Naman, Mueller, Zachary, de Vries, Harm, von Werra, Leandro, Guha, Arjun, Zhang, Lingming

    Published 31-10-2024
    “…Instruction tuning is a supervised fine-tuning approach that significantly improves the ability of large language models (LLMs) to follow human instructions…”
    Get full text
    Journal Article
  8. 8

    The BigCode Project Governance Card by BigCode collaboration, Hughes, Sean, de Vries, Harm, Robinson, Jennifer, Ferrandis, Carlos Muñoz, Allal, Loubna Ben, von Werra, Leandro, Ding, Jennifer, Paquet, Sebastien, Jernite, Yacine

    Published 06-12-2023
    “…This document serves as an overview of the different mechanisms and areas of governance in the BigCode project. It aims to support transparency by providing…”
    Get full text
    Journal Article
  9. 9

    OctoPack: Instruction Tuning Code Large Language Models by Muennighoff, Niklas, Liu, Qian, Zebaze, Armel, Zheng, Qinkai, Hui, Binyuan, Zhuo, Terry Yue, Singh, Swayam, Tang, Xiangru, von Werra, Leandro, Longpre, Shayne

    Published 14-08-2023
    “…Finetuning large language models (LLMs) on instructions leads to vast performance improvements on natural language tasks. We apply instruction tuning using…”
    Get full text
    Journal Article
  10. 10

    Zephyr: Direct Distillation of LM Alignment by Tunstall, Lewis, Beeching, Edward, Lambert, Nathan, Rajani, Nazneen, Rasul, Kashif, Belkada, Younes, Huang, Shengyi, von Werra, Leandro, Fourrier, Clémentine, Habib, Nathan, Sarrazin, Nathan, Sanseviero, Omar, Rush, Alexander M, Wolf, Thomas

    Published 25-10-2023
    “…We aim to produce a smaller language model that is aligned to user intent. Previous research has shown that applying distilled supervised fine-tuning (dSFT) on…”
    Get full text
    Journal Article
  11. 11
  12. 12

    The Stack: 3 TB of permissively licensed source code by Kocetkov, Denis, Li, Raymond, Allal, Loubna Ben, Li, Jia, Mou, Chenghao, Ferrandis, Carlos Muñoz, Jernite, Yacine, Mitchell, Margaret, Hughes, Sean, Wolf, Thomas, Bahdanau, Dzmitry, von Werra, Leandro, de Vries, Harm

    Published 20-11-2022
    “…Large Language Models (LLMs) play an ever-increasing role in the field of Artificial Intelligence (AI)--not only for natural language processing but also for…”
    Get full text
    Journal Article
  13. 13
  14. 14
  15. 15
  16. 16
  17. 17
  18. 18

    A water-based conical blackbody concept for millimeter-wave remote sensing by Schroder, Arne, Murk, Axel, von Werra, Leandro, Reinhard, Florian, Raisin, Philippe, Jacob, Karl

    “…We propose a novel concept for water-based black-bodies to be used as calibration sources in microwave remote sensing instruments. In order to obtain a low…”
    Get full text
    Conference Proceeding