Search Results - "Suarez, Pedro Ortiz"

  • Showing 1 - 19 results of 19
Refine Results
  1. 1

    Automatic extraction of materials and properties from superconductors scientific literature by Foppiano, Luca, Castro, Pedro Baptista, Ortiz Suarez, Pedro, Terashima, Kensei, Takano, Yoshihiko, Ishii, Masashi

    “…The automatic extraction of materials and related properties from the scientific literature is gaining attention in data-driven materials science (Materials…”
    Get full text
    Journal Article
  2. 2

    Semi-automatic staging area for high-quality structured data extraction from scientific literature by Foppiano, Luca, Mato, Tomoya, Terashima, Kensei, Ortiz Suarez, Pedro, Tou, Taku, Sakai, Chikako, Wang, Wei-Sheng, Amagasa, Toshiyuki, Takano, Yoshihiko, Ishii, Masashi

    “…We propose a semi-automatic staging area for efficiently building an accurate database of experimental physical properties of superconductors from literature,…”
    Get full text
    Journal Article
  3. 3
  4. 4

    Semi-automatic staging area for high-quality structured data extraction from scientific literature by Foppiano, Luca, Mato, Tomoya, Terashima, Kensei, Suarez, Pedro Ortiz, Tou, Taku, Sakai, Chikako, Wang, Wei-Sheng, Amagasa, Toshiyuki, Takano, Yoshihiko, Ishii, Masashi

    Published 16-11-2023
    “…We propose a semi-automatic staging area for efficiently building an accurate database of experimental physical properties of superconductors from literature,…”
    Get full text
    Journal Article
  5. 5

    Automatic extraction of materials and properties from superconductors scientific literature by Foppiano, Luca, de Castro, Pedro Baptista, Suarez, Pedro Ortiz, Terashima, Kensei, Takano, Yoshihiko, Ishii, Masashi

    Published 23-11-2022
    “…STAM:M, 2023, VOL. 3, NO. 1, 2153633 The automatic extraction of materials and related properties from the scientific literature is gaining attention in…”
    Get full text
    Journal Article
  6. 6
  7. 7

    Perplexed by Quality: A Perplexity-based Method for Adult and Harmful Content Detection in Multilingual Heterogeneous Web Data by Jansen, Tim, Tong, Yangling, Zevallos, Victoria, Suarez, Pedro Ortiz

    Published 20-12-2022
    “…As demand for large corpora increases with the size of current state-of-the-art language models, using web data as the main part of the pre-training corpus for…”
    Get full text
    Journal Article
  8. 8

    Moly\'e: A Corpus-based Approach to Language Contact in Colonial France by Dent, Rasul, Janès, Juliette, Clérice, Thibault, Suarez, Pedro Ortiz, Sagot, Benoît

    Published 08-08-2024
    “…Whether or not several Creole languages which developed during the early modern period can be considered genetic descendants of European languages has been the…”
    Get full text
    Journal Article
  9. 9

    Towards a Cleaner Document-Oriented Multilingual Crawled Corpus by Abadji, Julien, Suarez, Pedro Ortiz, Romary, Laurent, Sagot, Benoît

    Published 17-01-2022
    “…The need for raw large raw corpora has dramatically increased in recent years with the introduction of transfer learning and semi-supervised learning methods…”
    Get full text
    Journal Article
  10. 10

    mOSCAR: A Large-scale Multilingual and Multimodal Document-level Corpus by Futeral, Matthieu, Zebaze, Armel, Suarez, Pedro Ortiz, Abadji, Julien, Lacroix, Rémi, Schmid, Cordelia, Bawden, Rachel, Sagot, Benoît

    Published 12-06-2024
    “…Multimodal Large Language Models (mLLMs) are trained on a large amount of text-image data. While most mLLMs are trained on caption-like data only, Alayrac et…”
    Get full text
    Journal Article
  11. 11

    A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages by Suárez, Pedro Javier Ortiz, Romary, Laurent, Sagot, Benoît

    Published 18-06-2020
    “…Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, July 2020, Online We use the multilingual OSCAR corpus, extracted from…”
    Get full text
    Journal Article
  12. 12

    From FreEM to D'AlemBERT: a Large Corpus and a Language Model for Early Modern French by Gabay, Simon, Suarez, Pedro Ortiz, Bartz, Alexandre, Chagué, Alix, Bawden, Rachel, Gambette, Philippe, Sagot, Benoît

    Published 18-02-2022
    “…Language models for historical states of language are becoming increasingly important to allow the optimal digitisation and analysis of old textual sources…”
    Get full text
    Journal Article
  13. 13
  14. 14

    CamemBERT: a Tasty French Language Model by Martin, Louis, Muller, Benjamin, Suárez, Pedro Javier Ortiz, Dupont, Yoann, Romary, Laurent, de la Clergerie, Éric Villemonte, Seddah, Djamé, Sagot, Benoît

    Published 21-05-2020
    “…Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, July 2020, Online Pretrained language models are now ubiquitous in…”
    Get full text
    Journal Article
  15. 15
  16. 16
  17. 17
  18. 18
  19. 19

    Establishing a New State-of-the-Art for French Named Entity Recognition by Suárez, Pedro Javier Ortiz, Dupont, Yoann, Muller, Benjamin, Romary, Laurent, Sagot, Benoît

    Published 27-05-2020
    “…LREC 2020 - 12th Language Resources and Evaluation Conference, May 2020, Marseille, France The French TreeBank developed at the University Paris 7 is the main…”
    Get full text
    Journal Article