Search Results - "McCandlish, Sam"

Refine Results
  1. 1

    Scaling Laws for Transfer by Hernandez, Danny, Kaplan, Jared, Henighan, Tom, McCandlish, Sam

    Published 01-02-2021
    “…We study empirical scaling laws for transfer learning between distributions in an unsupervised, fine-tuning setting. When we train increasingly large neural…”
    Get full text
    Journal Article
  2. 2

    Towards Understanding Sycophancy in Language Models by Sharma, Mrinank, Tong, Meg, Korbak, Tomasz, Duvenaud, David, Askell, Amanda, Bowman, Samuel R, Cheng, Newton, Durmus, Esin, Hatfield-Dodds, Zac, Johnston, Scott R, Kravec, Shauna, Maxwell, Timothy, McCandlish, Sam, Ndousse, Kamal, Rausch, Oliver, Schiefer, Nicholas, Yan, Da, Zhang, Miranda, Perez, Ethan

    Published 20-10-2023
    “…Human feedback is commonly utilized to finetune AI assistants. But human feedback may also encourage model responses that match user beliefs over truthful…”
    Get full text
    Journal Article
  3. 3

    Studying Large Language Model Generalization with Influence Functions by Grosse, Roger, Bae, Juhan, Anil, Cem, Elhage, Nelson, Tamkin, Alex, Tajdini, Amirhossein, Steiner, Benoit, Li, Dustin, Durmus, Esin, Perez, Ethan, Hubinger, Evan, Lukošiūtė, Kamilė, Nguyen, Karina, Joseph, Nicholas, McCandlish, Sam, Kaplan, Jared, Bowman, Samuel R

    Published 07-08-2023
    “…When trying to gain better visibility into a machine learning model in order to understand and mitigate the associated risks, a potentially valuable source of…”
    Get full text
    Journal Article
  4. 4

    Towards Measuring the Representation of Subjective Global Opinions in Language Models by Durmus, Esin, Nguyen, Karina, Liao, Thomas I, Schiefer, Nicholas, Askell, Amanda, Bakhtin, Anton, Chen, Carol, Hatfield-Dodds, Zac, Hernandez, Danny, Joseph, Nicholas, Lovitt, Liane, McCandlish, Sam, Sikder, Orowa, Tamkin, Alex, Thamkul, Janel, Kaplan, Jared, Clark, Jack, Ganguli, Deep

    Published 28-06-2023
    “…Large language models (LLMs) may not equitably represent diverse global perspectives on societal issues. In this paper, we develop a quantitative framework to…”
    Get full text
    Journal Article
  5. 5

    An Empirical Model of Large-Batch Training by McCandlish, Sam, Kaplan, Jared, Amodei, Dario, Team, OpenAI Dota

    Published 14-12-2018
    “…In an increasing number of domains it has been demonstrated that deep learning models can be trained using relatively large batch sizes without sacrificing…”
    Get full text
    Journal Article
  6. 6

    Toy Models of Superposition by Elhage, Nelson, Hume, Tristan, Olsson, Catherine, Schiefer, Nicholas, Henighan, Tom, Kravec, Shauna, Hatfield-Dodds, Zac, Lasenby, Robert, Drain, Dawn, Chen, Carol, Grosse, Roger, McCandlish, Sam, Kaplan, Jared, Amodei, Dario, Wattenberg, Martin, Olah, Christopher

    Published 21-09-2022
    “…Neural networks often pack many unrelated concepts into a single neuron - a puzzling phenomenon known as 'polysemanticity' which makes interpretability much…”
    Get full text
    Journal Article
  7. 7
  8. 8
  9. 9
  10. 10

    Scaling Laws and Interpretability of Learning from Repeated Data by Hernandez, Danny, Brown, Tom, Conerly, Tom, DasSarma, Nova, Drain, Dawn, El-Showk, Sheer, Elhage, Nelson, Hatfield-Dodds, Zac, Henighan, Tom, Hume, Tristan, Johnston, Scott, Mann, Ben, Olah, Chris, Olsson, Catherine, Amodei, Dario, Joseph, Nicholas, Kaplan, Jared, McCandlish, Sam

    Published 20-05-2022
    “…Recent large language models have been trained on vast datasets, but also often on repeated data, either intentionally for the purpose of upweighting higher…”
    Get full text
    Journal Article
  11. 11
  12. 12
  13. 13
  14. 14
  15. 15
  16. 16
  17. 17
  18. 18

    A General Language Assistant as a Laboratory for Alignment by Askell, Amanda, Bai, Yuntao, Chen, Anna, Drain, Dawn, Ganguli, Deep, Henighan, Tom, Jones, Andy, Joseph, Nicholas, Mann, Ben, DasSarma, Nova, Elhage, Nelson, Hatfield-Dodds, Zac, Hernandez, Danny, Kernion, Jackson, Ndousse, Kamal, Olsson, Catherine, Amodei, Dario, Brown, Tom, Clark, Jack, McCandlish, Sam, Olah, Chris, Kaplan, Jared

    Published 01-12-2021
    “…Given the broad capabilities of large language models, it should be possible to work towards a general-purpose, text-based assistant that is aligned with human…”
    Get full text
    Journal Article
  19. 19
  20. 20

    Scaling Laws for Neural Language Models by Kaplan, Jared, McCandlish, Sam, Henighan, Tom, Brown, Tom B, Chess, Benjamin, Child, Rewon, Gray, Scott, Radford, Alec, Wu, Jeffrey, Amodei, Dario

    Published 22-01-2020
    “…We study empirical scaling laws for language model performance on the cross-entropy loss. The loss scales as a power-law with model size, dataset size, and the…”
    Get full text
    Journal Article