Search Results - "Olah, Chris"

  • Showing 1 - 9 results of 9
Refine Results
  1. 1

    Scaling Laws and Interpretability of Learning from Repeated Data by Hernandez, Danny, Brown, Tom, Conerly, Tom, DasSarma, Nova, Drain, Dawn, El-Showk, Sheer, Elhage, Nelson, Hatfield-Dodds, Zac, Henighan, Tom, Hume, Tristan, Johnston, Scott, Mann, Ben, Olah, Chris, Olsson, Catherine, Amodei, Dario, Joseph, Nicholas, Kaplan, Jared, McCandlish, Sam

    Published 20-05-2022
    “…Recent large language models have been trained on vast datasets, but also often on repeated data, either intentionally for the purpose of upweighting higher…”
    Get full text
    Journal Article
  2. 2
  3. 3
  4. 4
  5. 5
  6. 6
  7. 7

    A General Language Assistant as a Laboratory for Alignment by Askell, Amanda, Bai, Yuntao, Chen, Anna, Drain, Dawn, Ganguli, Deep, Henighan, Tom, Jones, Andy, Joseph, Nicholas, Mann, Ben, DasSarma, Nova, Elhage, Nelson, Hatfield-Dodds, Zac, Hernandez, Danny, Kernion, Jackson, Ndousse, Kamal, Olsson, Catherine, Amodei, Dario, Brown, Tom, Clark, Jack, McCandlish, Sam, Olah, Chris, Kaplan, Jared

    Published 01-12-2021
    “…Given the broad capabilities of large language models, it should be possible to work towards a general-purpose, text-based assistant that is aligned with human…”
    Get full text
    Journal Article
  8. 8

    Concrete Problems in AI Safety by Amodei, Dario, Olah, Chris, Steinhardt, Jacob, Christiano, Paul, Schulman, John, Mané, Dan

    Published 21-06-2016
    “…Rapid progress in machine learning and artificial intelligence (AI) has brought increasing attention to the potential impacts of AI technologies on society. In…”
    Get full text
    Journal Article
  9. 9