Search Results - "van Strien, Daniel"
-
1
Datasheets for Digital Cultural Heritage Datasets
Published in Journal of open humanities data (30-10-2023)“…Sparked by issues of quality and lack of proper documentation for datasets, the machine learning community has begun developing standardised processes for…”
Get full text
Journal Article -
2
Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 1)
Published in The programming historian (17-08-2022)“…This is the first of a two-part lesson introducing deep learning based computer vision methods for humanities research. Using a dataset of historical newspaper…”
Get full text
Journal Article -
3
Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 2)
Published in The programming historian (17-08-2022)“…This is the second of a two-part lesson introducing deep learning based computer vision methods for humanities research. This lesson digs deeper into the…”
Get full text
Journal Article -
4
An Introduction to Version Control Using GitHub Desktop
Published in The programming historian (17-06-2016)“…In this lesson you will be introduced to the basics of version control, understand why it is useful and implement basic version control for a plain text…”
Get full text
Journal Article -
5
A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
Published in Journal of open humanities data (01-01-2022)“…We present a new dataset for the task of toponym resolution in digitized historical newspapers in English. It consists of 343 annotated articles from…”
Get full text
Journal Article -
6
Library Carpentry: Software Skills Training for Library Professionals
Published in LIBER quarterly (2016)“…Librarians play a crucial role in cultivating world-class research and in most disciplinary areas today world-class research relies on the use of software…”
Get full text
Journal Article -
7
Maps of a Nation? The Digitized Ordnance Survey for New Historical Research
Published in Journal of Victorian Culture : JVC (03-05-2021)“…Abstract Although the Ordnance Survey has itself been the subject of historical research, scholars have not systematically used its maps as primary sources of…”
Get full text
Journal Article -
8
Introducción al control de versiones con GitHub Desktop
Published in The programming historian en español (07-04-2017)“…En esta lección aprenderás lo básico del control de versiones, comprenderás por qué es útil e implementarás el control básico de versiones en un documento de…”
Get full text
Journal Article -
9
Metadata Might Make Language Models Better
Published 18-11-2022“…This paper discusses the benefits of including metadata when training language models on historical collections. Using 19th-century newspapers as a case study,…”
Get full text
Journal Article -
10
Library Carpentry
Published in LIBER quarterly (01-11-2016)“…Librarians play a crucial role in cultivating world-class research and in most disciplinary areas today world-class research relies on the use of software…”
Get full text
Journal Article -
11
AI training resources for GLAM: a snapshot
Published 10-05-2022“…We take a snapshot of current resources available for teaching and learning AI with a focus on the Galleries, Libraries, Archives and Museums (GLAM) community…”
Get full text
Journal Article -
12
Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0
Published 11-04-2022“…In this work, we explore whether the recently demonstrated zero-shot abilities of the T0 model extend to Named Entity Recognition for out-of-distribution…”
Get full text
Journal Article -
13
A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching
Published 17-09-2020“…Recognizing toponyms and resolving them to their real-world referents is required for providing advanced semantic access to textual data. This process is often…”
Get full text
Journal Article -
14
Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Published 24-01-2022“…In recent years, large-scale data collection efforts have prioritized the amount of data collected in order to improve the modeling capabilities of large…”
Get full text
Journal Article -
15
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Published 07-03-2023“…As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The…”
Get full text
Journal Article -
16
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Published 09-11-2022“…Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these…”
Get full text
Journal Article