Search Results - "Soldaini, Luca"
-
1
Learning to reformulate long queries for clinical decision support
Published in Journal of the Association for Information Science and Technology (01-11-2017)“…The large volume of biomedical literature poses a serious problem for medical professionals, who are often struggling to keep current with it. At the same…”
Get full text
Journal Article -
2
Enhancing web search in the medical domain via query clarification
Published in Information retrieval (Boston) (01-04-2016)“…The majority of Internet users search for medical information online; however, many do not have an adequate medical vocabulary. Users might have difficulties…”
Get full text
Journal Article -
3
Overcoming low-utility facets for complex answer retrieval
Published in Information retrieval (Boston) (01-08-2019)“…Many questions cannot be answered simply; their answers must include numerous nuanced details and context. Complex Answer Retrieval (CAR) is the retrieval of…”
Get full text
Journal Article -
4
The Knowledge and Language Gap in Medical Information Seeking
Published 01-01-2018“…Interest in medical information retrieval has risen significantly in the last few years. The Internet has become a primary source for consumers looking for…”
Get full text
Dissertation -
5
One-Shot Labeling for Automatic Relevance Estimation
Published 11-07-2023“…Dealing with unjudged documents ("holes") in relevance assessments is a perennial problem when evaluating search systems with offline experiments. Holes can…”
Get full text
Journal Article -
6
The Cascade Transformer: an Application for Efficient Answer Sentence Selection
Published 05-05-2020“…Large transformer-based language models have been shown to be very effective in many classification tasks. However, their computational complexity prevents…”
Get full text
Journal Article -
7
RouterRetriever: Exploring the Benefits of Routing over Multiple Expert Embedding Models
Published 04-09-2024“…Information retrieval methods often rely on a single embedding model trained on large, general-domain datasets like MSMARCO. While this approach can produce a…”
Get full text
Journal Article -
8
Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense Encoders
Published 16-11-2023“…Prevailing research practice today often relies on training dense retrievers on existing large datasets such as MSMARCO and then experimenting with ways to…”
Get full text
Journal Article -
9
A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents
Published 24-05-2023“…Many real-world applications (e.g., note taking, search) require extracting a sentence or paragraph from a document and showing that snippet to a human outside…”
Get full text
Journal Article -
10
Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula
Published 08-08-2024“…To ensure that math curriculum is grade-appropriate and aligns with critical skills or concepts in accordance with educational standards, pedagogical experts…”
Get full text
Journal Article -
11
Self-Directed Synthetic Dialogues and Revisions Technical Report
Published 25-07-2024“…Synthetic data has become an important tool in the fine-tuning of language models to follow instructions and solve complex problems. Nevertheless, the majority…”
Get full text
Journal Article -
12
Modeling Context in Answer Sentence Selection Systems on a Latency Budget
Published 28-01-2021“…Answer Sentence Selection (AS2) is an efficient approach for the design of open-domain Question Answering (QA) systems. In order to achieve low latency,…”
Get full text
Journal Article -
13
Overview of the TREC 2023 NeuCLIR Track
Published 11-04-2024“…The principal goal of the TREC Neural Cross-Language Information Retrieval (NeuCLIR) track is to study the impact of neural approaches to cross-language…”
Get full text
Journal Article -
14
KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions
Published 06-03-2024“…Large language models (LLMs) adapted to follow user instructions are now widely deployed as conversational agents. In this work, we examine one increasingly…”
Get full text
Journal Article -
15
Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection
Published 20-05-2022“…An important task for designing QA systems is answer sentence selection (AS2): selecting the sentence containing (or constituting) the answer to a question…”
Get full text
Journal Article -
16
Paragraph-based Transformer Pre-training for Multi-Sentence Inference
Published 02-05-2022“…Inference tasks such as answer sentence selection (AS2) or fact verification are typically solved by fine-tuning transformer-based models as individual…”
Get full text
Journal Article -
17
AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters
Published 12-01-2024“…Large language models' (LLMs) abilities are drawn from their pretraining data, and model development begins with data curation. However, decisions around what…”
Get full text
Journal Article -
18
Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering Systems
Published 15-01-2022“…Findings of the Association for Computational Linguistics: EMNLP 2022 Large transformer models can highly improve Answer Sentence Selection (AS2) tasks, but…”
Get full text
Journal Article -
19
The Surveillance AI Pipeline
Published 26-09-2023“…A rapidly growing number of voices argue that AI research, and computer vision in particular, is powering mass surveillance. Yet the direct path from computer…”
Get full text
Journal Article -
20
When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets
Published 15-09-2023“…Using large language models (LMs) for query or document expansion can improve generalization in information retrieval. However, it is unknown whether these…”
Get full text
Journal Article