Search Results - "Soldaini, Luca"

Refine Results
  1. 1

    Learning to reformulate long queries for clinical decision support by Soldaini, Luca, Yates, Andrew, Goharian, Nazli

    “…The large volume of biomedical literature poses a serious problem for medical professionals, who are often struggling to keep current with it. At the same…”
    Get full text
    Journal Article
  2. 2

    Enhancing web search in the medical domain via query clarification by Soldaini, Luca, Yates, Andrew, Yom-Tov, Elad, Frieder, Ophir, Goharian, Nazli

    Published in Information retrieval (Boston) (01-04-2016)
    “…The majority of Internet users search for medical information online; however, many do not have an adequate medical vocabulary. Users might have difficulties…”
    Get full text
    Journal Article
  3. 3

    Overcoming low-utility facets for complex answer retrieval by MacAvaney, Sean, Yates, Andrew, Cohan, Arman, Soldaini, Luca, Hui, Kai, Goharian, Nazli, Frieder, Ophir

    Published in Information retrieval (Boston) (01-08-2019)
    “…Many questions cannot be answered simply; their answers must include numerous nuanced details and context. Complex Answer Retrieval (CAR) is the retrieval of…”
    Get full text
    Journal Article
  4. 4

    The Knowledge and Language Gap in Medical Information Seeking by Soldaini, Luca

    Published 01-01-2018
    “…Interest in medical information retrieval has risen significantly in the last few years. The Internet has become a primary source for consumers looking for…”
    Get full text
    Dissertation
  5. 5

    One-Shot Labeling for Automatic Relevance Estimation by MacAvaney, Sean, Soldaini, Luca

    Published 11-07-2023
    “…Dealing with unjudged documents ("holes") in relevance assessments is a perennial problem when evaluating search systems with offline experiments. Holes can…”
    Get full text
    Journal Article
  6. 6

    The Cascade Transformer: an Application for Efficient Answer Sentence Selection by Soldaini, Luca, Moschitti, Alessandro

    Published 05-05-2020
    “…Large transformer-based language models have been shown to be very effective in many classification tasks. However, their computational complexity prevents…”
    Get full text
    Journal Article
  7. 7

    RouterRetriever: Exploring the Benefits of Routing over Multiple Expert Embedding Models by Lee, Hyunji, Soldaini, Luca, Cohan, Arman, Seo, Minjoon, Lo, Kyle

    Published 04-09-2024
    “…Information retrieval methods often rely on a single embedding model trained on large, general-domain datasets like MSMARCO. While this approach can produce a…”
    Get full text
    Journal Article
  8. 8

    Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense Encoders by Lee, Hyunji, Soldaini, Luca, Cohan, Arman, Seo, Minjoon, Lo, Kyle

    Published 16-11-2023
    “…Prevailing research practice today often relies on training dense retrievers on existing large datasets such as MSMARCO and then experimenting with ways to…”
    Get full text
    Journal Article
  9. 9

    A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents by Newman, Benjamin, Soldaini, Luca, Fok, Raymond, Cohan, Arman, Lo, Kyle

    Published 24-05-2023
    “…Many real-world applications (e.g., note taking, search) require extracting a sentence or paragraph from a document and showing that snippet to a human outside…”
    Get full text
    Journal Article
  10. 10

    Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula by Lucy, Li, August, Tal, Wang, Rose E, Soldaini, Luca, Allison, Courtney, Lo, Kyle

    Published 08-08-2024
    “…To ensure that math curriculum is grade-appropriate and aligns with critical skills or concepts in accordance with educational standards, pedagogical experts…”
    Get full text
    Journal Article
  11. 11

    Self-Directed Synthetic Dialogues and Revisions Technical Report by Lambert, Nathan, Schoelkopf, Hailey, Gokaslan, Aaron, Soldaini, Luca, Pyatkin, Valentina, Castricato, Louis

    Published 25-07-2024
    “…Synthetic data has become an important tool in the fine-tuning of language models to follow instructions and solve complex problems. Nevertheless, the majority…”
    Get full text
    Journal Article
  12. 12

    Modeling Context in Answer Sentence Selection Systems on a Latency Budget by Han, Rujun, Soldaini, Luca, Moschitti, Alessandro

    Published 28-01-2021
    “…Answer Sentence Selection (AS2) is an efficient approach for the design of open-domain Question Answering (QA) systems. In order to achieve low latency,…”
    Get full text
    Journal Article
  13. 13

    Overview of the TREC 2023 NeuCLIR Track by Lawrie, Dawn, MacAvaney, Sean, Mayfield, James, McNamee, Paul, Oard, Douglas W, Soldaini, Luca, Yang, Eugene

    Published 11-04-2024
    “…The principal goal of the TREC Neural Cross-Language Information Retrieval (NeuCLIR) track is to study the impact of neural approaches to cross-language…”
    Get full text
    Journal Article
  14. 14

    KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions by Xu, Fangyuan, Lo, Kyle, Soldaini, Luca, Kuehl, Bailey, Choi, Eunsol, Wadden, David

    Published 06-03-2024
    “…Large language models (LLMs) adapted to follow user instructions are now widely deployed as conversational agents. In this work, we examine one increasingly…”
    Get full text
    Journal Article
  15. 15

    Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection by Di Liello, Luca, Garg, Siddhant, Soldaini, Luca, Moschitti, Alessandro

    Published 20-05-2022
    “…An important task for designing QA systems is answer sentence selection (AS2): selecting the sentence containing (or constituting) the answer to a question…”
    Get full text
    Journal Article
  16. 16

    Paragraph-based Transformer Pre-training for Multi-Sentence Inference by Di Liello, Luca, Garg, Siddhant, Soldaini, Luca, Moschitti, Alessandro

    Published 02-05-2022
    “…Inference tasks such as answer sentence selection (AS2) or fact verification are typically solved by fine-tuning transformer-based models as individual…”
    Get full text
    Journal Article
  17. 17

    AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters by Lucy, Li, Gururangan, Suchin, Soldaini, Luca, Strubell, Emma, Bamman, David, Klein, Lauren F, Dodge, Jesse

    Published 12-01-2024
    “…Large language models' (LLMs) abilities are drawn from their pretraining data, and model development begins with data curation. However, decisions around what…”
    Get full text
    Journal Article
  18. 18

    Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering Systems by Matsubara, Yoshitomo, Soldaini, Luca, Lind, Eric, Moschitti, Alessandro

    Published 15-01-2022
    “…Findings of the Association for Computational Linguistics: EMNLP 2022 Large transformer models can highly improve Answer Sentence Selection (AS2) tasks, but…”
    Get full text
    Journal Article
  19. 19

    The Surveillance AI Pipeline by Kalluri, Pratyusha Ria, Agnew, William, Cheng, Myra, Owens, Kentrell, Soldaini, Luca, Birhane, Abeba

    Published 26-09-2023
    “…A rapidly growing number of voices argue that AI research, and computer vision in particular, is powering mass surveillance. Yet the direct path from computer…”
    Get full text
    Journal Article
  20. 20

    When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets by Weller, Orion, Lo, Kyle, Wadden, David, Lawrie, Dawn, Van Durme, Benjamin, Cohan, Arman, Soldaini, Luca

    Published 15-09-2023
    “…Using large language models (LMs) for query or document expansion can improve generalization in information retrieval. However, it is unknown whether these…”
    Get full text
    Journal Article