L inked O pen D ata technologies for publication of census microdata
Censuses are one of the most relevant types of statistical data, allowing analyses of the population in terms of demography, economy, sociology, and culture. For fine‐grained analysis, census agencies publish census microdata that consist of a sample of individual records of the census containing de...
Saved in:
Published in: | Journal of the American Society for Information Science and Technology Vol. 64; no. 9; pp. 1802 - 1814 |
---|---|
Main Authors: | , , , |
Format: | Journal Article |
Language: | English |
Published: |
01-09-2013
|
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Censuses are one of the most relevant types of statistical data, allowing analyses of the population in terms of demography, economy, sociology, and culture. For fine‐grained analysis, census agencies publish census microdata that consist of a sample of individual records of the census containing detailed anonymous individual information. Working with microdata from different censuses and doing comparative studies are currently difficult tasks due to the diversity of formats and granularities. In this article, we show that novel data processing techniques can be applied to make census microdata interoperable and easy to access and combine. In fact, we demonstrate how
L
inked
O
pen
D
ata principles, a set of techniques to publish and make connections of (semi‐)structured data on the web, can be fruitfully applied to census microdata. We present a step‐by‐step process to achieve this goal and we study, in theory and practice, two real case studies: the 2001 Spanish census and a general framework for
I
ntegrated
P
ublic
U
se
M
icrodata
S
eries (
IPUMS
‐I). |
---|---|
ISSN: | 1532-2882 1532-2890 |
DOI: | 10.1002/asi.22876 |