Search Results - "Pfau, Jacob"
-
1
Stress testing reveals gaps in clinic readiness of image-based diagnostic artificial intelligence models
Published in NPJ digital medicine (21-01-2021)“…Artificial intelligence models match or exceed dermatologists in melanoma image classification. Less is known about their robustness against real-world…”
Get full text
Journal Article -
2
Artificial Intelligence in Dermatology: A Primer
Published in Journal of investigative dermatology (01-08-2020)“…Artificial intelligence is becoming increasingly important in dermatology, with studies reporting accuracy matching or exceeding dermatologists for the…”
Get full text
Journal Article -
3
Artificial Intelligence in Teledermatology
Published in Current dermatology reports (15-09-2019)“…Purpose of Review This review summarizes current and prospective applications of artificial intelligence (AI) and smartphone technologies to automated…”
Get full text
Journal Article -
4
Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
Published 24-04-2024“…Chain-of-thought responses from language models improve performance across most benchmarks. However, it remains unclear to what extent these performance gains…”
Get full text
Journal Article -
5
Steering Without Side Effects: Improving Post-Deployment Control of Language Models
Published 20-06-2024“…Language models (LMs) have been shown to behave unexpectedly post-deployment. For example, new jailbreaks continually arise, allowing model misuse, despite…”
Get full text
Journal Article -
6
Taking AI Welfare Seriously
Published 04-11-2024“…In this report, we argue that there is a realistic possibility that some AI systems will be conscious and/or robustly agentic in the near future. That means…”
Get full text
Journal Article -
7
Self-Consistency of Large Language Models under Ambiguity
Published 20-10-2023“…Large language models (LLMs) that do not give consistent answers across contexts are problematic when used for tasks with expectations of consistency, e.g.,…”
Get full text
Journal Article -
8
Goal Misgeneralization in Deep Reinforcement Learning
Published 28-05-2021“…We study goal misgeneralization, a type of out-of-distribution generalization failure in reinforcement learning (RL). Goal misgeneralization failures occur…”
Get full text
Journal Article -
9
Robust Semantic Interpretability: Revisiting Concept Activation Vectors
Published 06-04-2021“…Interpretability methods for image classification assess model trustworthiness by attempting to expose whether the model is systematically biased or attending…”
Get full text
Journal Article -
10
Global Saliency: Aggregating Saliency Maps to Assess Dataset Artefact Bias
Published 16-10-2019“…In high-stakes applications of machine learning models, interpretability methods provide guarantees that models are right for the right reasons. In medical…”
Get full text
Journal Article -
11
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Published 27-07-2023“…Reinforcement learning from human feedback (RLHF) is a technique for training AI systems to align with human goals. RLHF has emerged as the central method used…”
Get full text
Journal Article