Search Results - "Schweter, Stefan"

  • Showing 1 - 7 results of 7
Refine Results
  1. 1

    FLERT: Document-Level Features for Named Entity Recognition by Schweter, Stefan, Akbik, Alan

    Published 13-11-2020
    “…Current state-of-the-art approaches for named entity recognition (NER) typically consider text at the sentence-level and thus do not model information that…”
    Get full text
    Journal Article
  2. 2

    Towards Robust Named Entity Recognition for Historic German by Schweter, Stefan, Baiter, Johannes

    Published 18-06-2019
    “…Recent advances in language modeling using deep neural networks have shown that these models learn representations, that vary with the network depth from…”
    Get full text
    Journal Article
  3. 3

    German's Next Language Model by Chan, Branden, Schweter, Stefan, Möller, Timo

    Published 21-10-2020
    “…In this work we present the experiments which lead to the creation of our BERT and ELECTRA based German language models, GBERT and GELECTRA. By varying the…”
    Get full text
    Journal Article
  4. 4

    hmBERT: Historical Multilingual Language Models for Named Entity Recognition by Schweter, Stefan, März, Luisa, Schmid, Katharina, Çano, Erion

    Published 31-05-2022
    “…Compared to standard Named Entity Recognition (NER), identifying persons, locations, and organizations in historical texts constitutes a big challenge. To…”
    Get full text
    Journal Article
  5. 5

    Data Centric Domain Adaptation for Historical Text with OCR Errors by März, Luisa, Schweter, Stefan, Poerner, Nina, Roth, Benjamin, Schütze, Hinrich

    Published 02-07-2021
    “…We propose new methods for in-domain and cross-domain Named Entity Recognition (NER) on historical data for Dutch and French. For the cross-domain case, we…”
    Get full text
    Journal Article
  6. 6

    Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0 by De Toni, Francesco, Akiki, Christopher, de la Rosa, Javier, Fourrier, Clémentine, Manjavacas, Enrique, Schweter, Stefan, van Strien, Daniel

    Published 11-04-2022
    “…In this work, we explore whether the recently demonstrated zero-shot abilities of the T0 model extend to Named Entity Recognition for out-of-distribution…”
    Get full text
    Journal Article
  7. 7

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model by Scao, Teven Le, Fan, Angela, Gallé, Matthias, Webson, Albert, Wang, Thomas, Bekman, Stas, Laurençon, Hugo, Launay, Julien, Raffel, Colin, Simhi, Adi, Alfassy, Amit, Rogers, Anna, Leong, Colin, van Strien, Daniel, Ponferrada, Eduardo González, Levkovizh, Efrat, Benyamina, Hamza, Tran, Hieu, Yu, Ian, Johnson, Isaac, Bhattacharjee, Joydeep, Von Werra, Leandro, Dey, Manan, Jiang, Mike Tian-Jian, Jauhar, Mohammad A, Kassner, Nora, Pyysalo, Sampo, Pai, Suhas, Schick, Timo, Thrush, Tristan, Nikoulina, Vassilina, Laippala, Veronika, Heinzerling, Benjamin, Taşar, Davut Emre, Salesky, Elizabeth, Lee, Wilson Y, Szczechla, Eliza, Chhablani, Gunjan, Wang, Han, Rozen, Jos, Manica, Matteo, Nayak, Nihal, Teehan, Ryan, Albanie, Samuel, Shen, Sheng, Ben-David, Srulik, Kim, Taewoon, Neeraj, Trishala, Roberts, Adam, Tae, Jaesung, Phang, Jason, Press, Ofir, Ryabinin, Max, Peyrounette, Myriam, Patry, Nicolas, Cornette, Pierre, Dettmers, Tim, Ligozat, Anne-Laure, Névéol, Aurélie, Taktasheva, Ekaterina, Kalo, Jan-Christoph, Clive, Jordan, Kim, Najoung, Mirkin, Shachar, Pais, Shani, Pruksachatkun, Yada, Pestana, Amanda, Faranak, Amy, Santos, Ana, HajiHosseini, Azadeh, Ajibade, Benjamin, Saxena, Bharat, Nguyen, Duong A, Rezanejad, Habib, Bhattacharya, Indrani, Nejadgholi, Isar, McKenna, Michael, Burynok, Mykola, Rajani, Nazneen, Samuel, Olanrewaju, Kromann, Rasmus, Shubber, Sarmad, Viguier, Sylvain, Miranda-Escalada, Antonio, Singh, Ayush, Manjavacas, Enrique, Barth, Fabio, Bulchandani, Lokesh, Nezhurina, Marianna, Liu, Minna, Kang, Myungsun, Dahlberg, Nathan, Chandrasekhar, Ramya, Eisenberg, Renata, Canalli, Rodrigo, Schweter, Stefan, Laud, Tanmay, Kainuma, Tomoya, Venkatraman, Yash, Xu, Yingxin

    Published 09-11-2022
    “…Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these…”
    Get full text
    Journal Article