Publishing Official Datasets Toby Green OECD Publishing 4
Publishing Official Datasets Toby Green OECD Publishing 4 th Bloomsbury Conference on e-Publishing and e-Publications 24 th and 25 th June , 2010
Publishing Official Data in cool ways since 1961
Climategate! “investigation reveals scientific concern about missing tree ring data”. The Guardian, January 2010
Would it have been lost had it been properly published and curated? Should we rely on authors to self-publish data?
Data is not second class stuff. It should be just as easy to: • peer review • publish • cite as research articles.
We simply need the existing scholarly publishing ‘toolkit’: • review mechanism • metadata • doi identifiers • Cross. Ref
So, whereas for books we have this: Here’s one OECD prepared earlier. . .
For datasets we could have this:
But data is not the same as an article or book chapter, Sub-sets can be published.
Sub-sets: each has unique identifier, with links to the ‘mother’ dataset Data subset series Homepage DOI: 1234. 56/Series Subset 1 Homepage DOI link to: Main dataset DOI: 1234. 56/Subset#1 Subset 2 Homepage DOI link to: Main dataset DOI: 1234. 56/Subset#2 Subset 3 Homepage DOI link to: Main dataset DOI: 1234. 56/Subset#3
The same data can have a different rendition or graphical interface
Datasets with multiple renditions: same identifier Dataset ‘Homepage’ Rendition 1 Rendition 2 Rendition 3
Datasets can grow. Our current solution is to give them the same identifier and explain the growth in the metadata
Datasets can change. Our current solution is to give them a NEW identifier, explain the change in the metadata, and provide a link back to the original dataset.
Read all about it! http: //doi. org/abr
OECD’s “stuff machine” (2010) Jim Gray’s data ‘era’ (2008) Publications Processed data Data Presentations Data
Data publishing workflow at OECD Data producer (author) Statistician and Researcher Responsibility Data Editor Data Production Editor Data Operations Data Marketing & Support Selection, Quality Assurance, Metadata, tion a c i f i t r Ce Acronym killing, Packaging DOI allocation, Technical checks. Hosting, Infrastructure Promotion, Training, Support, Discovery optimisation ra Regist rd Stewa ship ess n Aware Publisher Responsibility End User and Librarian Feedback
I can end it here, or is there time for more? toby. green@oecd. org
http: //statlinks. oecdcode. org/
Great visualisations tell stories Charles Minard's 1869 chart showing the losses in men, their movements, and the temperature of Napoleon's 1812 Russian campaign.
TOYS FOR BOYS?
OECD Toys OECD Factbook i. Phone App http: //itunes. apple. com/us/app/oecd-factbook -2010/id 327348502? mt=8&uo=6 OECD Regional Statistics e. Xplorer http: //stats. oecd. org/OECDregionalstatistics/ OECD Factblog https: //community. oecd. org/community/factbl og/blog/2010/05/11/tax-who-pays-what OECD graph generator http: //viz. oecdcode. org/ts/20755104 table 1/latest
Pimp my data Facebook privacy (not any more): http: //mattmckeon. com/facebook-privacy/ Why I can’t get a cab outside the UN building in NY? http: //www. nytimes. com/interactive/2010/ 04/02/nyregion/taxi-map. html Why my musician brother grows his own food http: //www. informationisbeautiful. net/2010 /how-much-do-music-artists-earn-online/ How they spend your money www. wheredoesmymoneygo. org
PIMP KITS and SITES FOR SHARING DATA
http: //statlinks. oecdcode. org/
Thank-you and er… toby. green@oecd. org
- Slides: 26