Metadata Matters Ian White September 5 2013 urbanmapping
Metadata Matters Ian White September 5, 2013 @urbanmapping 1
Achtung! No. SQL is no panacea Big Data isn’t about data Big Data isn’t new Big Data doesn’t present a Boolean quandary With power comes responsibility - AWS bills - Lady Gaga tweets - Innumeracy (correlation v causation) @urbanmapping 2
One Person’s Metadata is Another Person’s Data @urbanmapping 3
Big v Important Big Heterogeneous Raw Distributed Streaming/real time Search for meaning Time-sensitive Philosophical @urbanmapping Important Well-defined schema High value (not free) Test-driven Relational Historical Enterprise-focused 4
Data Exhaust @urbanmapping Analytics Probes Social Media Gov 2. 0 5
Platforms Commoditization of compute and storage @urbanmapping 6
A Brief History of Metadata Callimachus @urbanmapping Library of Alexandria, Egypt 7
A Brief History of Metadata “Pinakes” (lists) Title Category Author birthplace Father Word count Callimachus @urbanmapping 8
A Brief History of Metadata Leiden University, 1595 Johan van der Does @urbanmapping 9
A Brief History of Metadata Melville Dewey @urbanmapping 10
A Brief History of Metadata Card catalog room, Library of Congress c. 1920 @urbanmapping 11
A Brief History of Metadata Dewey Decimal System goes electronic in 1967 @urbanmapping 12
Out with the Old, in with the New Archiving card catalogs after digitization @urbanmapping 13
Why Can’t We Be Together? Metadata @urbanmapping Data 14
Exponential Growth in Data Volume of Data Generated Unprecedented rate of data creation, 1995 today Pinakes 300 BC @urbanmapping Catalog 1595 AD Taxonomy Database 1876 1970 15
Oh, How I’ve Missed You! The reunification of metadata with data @urbanmapping 16
Together At Last! @urbanmapping 17
GIS Remains Unevolved Melville Dewey @urbanmapping + = 18
Enter the Data Curator Part social scientist, part librarian, part statistician, part RDBMS wiz @urbanmapping 19
DIKW Model Data Fact, Signal, Symbol Information Structural v Functional Symbolic v Subjective Knowledge Processed Procedural Propositional @urbanmapping 20
Popularity Contest Metadata Big Data Science Curation @urbanmapping 21
c. 2013 @urbanmapping 22
One Person’s Metadata is Another’s Data @urbanmapping 23
- Slides: 23