A Field Study on Linked and Open Data at Datahub.io |
We describe and conduct a study on datahub.io to explore to what, in practice, Linked and Open Data refers to. We focus on the use of formats, licenses, ages and popularity of the data. An in-depth analysis reveals information about availability, quantity, structure and vocabulary usage of the real-world RDFbased datasets contained. Results show that the most common formats are Microsoft Excel, CSV and RDF. High proportions of structured data is of tabular nature, independent from the format. The heuristics and evaluation methods developed here are released as open source and can be applied to other CKANbased repositories and RDF-based datasets, too.
Heuss T, Fengel J, Humm B, Harriehausen-Mühlbauer B, Atkinson S