Big Data Privacy Connected sources and available data
Big Data & Privacy Connected sources and available data
Information requires s p a c e • Data – measurements, records, facts – all require space • Storage concern memory • Where do we get data from? • Where might we get info about health stats? • Where might we get info about transportation? • Social media usage? • www. Opendata. socrata. com
How much can you remember? Bit – one binary digit Byte – 8 bits [storage for a single letter of a word] Kilo. Byte – 1024 bytes (binary not decimal) Megabyte – 1024 KB. One minute of sound recording. A novel. Giga. Byte – A symphony in high fidelity. One movie. Terabyte – All the books in a large library. ~ 1, 610 CDs
How much can you remember? Petabyte – 223, 100 DVDs (878 feet tall) or mile high CD stack Exabyte – A list of all words ever spoken by humankind Zettabyte – Recorded video for all TV and movies Yottabyte – Would fill DE and RI with a million datacenters
Moore’s Law ~ Computer Processors will double in complexity approximately every 2 years. Gordon Moore – 1965 Nintendo SNES Game Cube Wii
Challenges Given a tool that can provide access to large data sets, what might be some issues that arise? • What would it be like to try and locate a friend, by sight, in NYC? • What other challenges might we encounter? • Visualization, delay, multiple ‘stories’
‘Internet of Things’ • RFID Radio Frequency ID • RFID can communicate on a network Internet • We know every device on the internet requires its own IP address. • Currently we have an IPv 4 (internet protocol version 4) • Only allows a total of ~4 billion addresses • 32 bit address • New IPv 6 will allow 300 undecillion unique addresses • That’s a 3 followed by 38 zeroes
Let’s find sources of data With a partner (or 3), choose one of the sources on the sheet You may find one of your own Write your responses
- Slides: 8