Dr Anna Krystallli
R-RSE
@tomjwebb I see tons of spreadsheets that i don't understand anything (or the stduent), making it really hard to share.
— Erika Berenguer (@Erika_Berenguer) January 16, 2015
@tomjwebb @ScientificData "Document. Everything." Data without documentation has no value.
— Sven Kochmann (@indianalytics) January 16, 2015
@tomjwebb Annotate, annotate, annotate!
— CanJFishAquaticSci (@cjfas) January 16, 2015
Document all the metadata (including protocols).@tomjwebb
— Ward Appeltans (@WrdAppltns) January 16, 2015
You download a zip file of #OpenData. Apart from your data file(s), what else should it contain?
— Leigh Dodds (@ldodds) February 6, 2017
“Information that describes, explains, locates, or in some way makes it easier to find, access, and use a resource (in this case, data).”
Backbone of digital curation
Without it, a digital resource may be irretrievable, unidentifiable or unusable
By structuring & adhering to controlled vocabularies, data can be combined, accessed and searched!
Different communities develop different standards which define both the structure and content of metadata
General: Dublin Core Metadata Initiative Specification
NERC Data Centers: Check with individual data centers for their metadata specification.
Re3data.org: Registry of Research Data Repositories.
Most university libraries have assistants dedicated to Research Data Management:
@tomjwebb @ScientificData Talk to their librarian for data management strategies #datainfolit
— Yasmeen Shorish (@yasmeen_azadi) January 16, 2015
Make sure to record units!
methods
documentKeep a dynamic document used to plan, record and write up methods.
@tomjwebb record every detail about how/where/why it is collected
— Sal Keith (@Sal_Keith) January 16, 2015
Any additional information other users would need to combine your data with theirs? Record it
Teaching this course has always felt challenging in terms of practical exercises
Advising on domain specific Controlled Vocabularies & structure ❌
How can we practice creating metadata?
bringing together scientists, developers, and open data enthusiasts from academia, industry, government, and non-profits to get together for a few days and hack on various projects.
Luckily, a whole bunch of other awesome folks were also thinking about these topics and interested in working on them! 🤩
(in alphabetical order):
dataspice
Package
dataspice
makes it easier for researchers to create basic, lightweight and concise metadata files for their datasets.
csv
filesdataspice
tutorialThe goal of this section is to provide a practical exercise in creating metadata for an example field collected data product using package dataspice
.
dataspice
workflowhead to the dataspice
tutorial