Dura Cloud Data Integrity Monitoring in the Cloud
- Slides: 16
Dura. Cloud: Data Integrity Monitoring in the Cloud Digital Preservation Partners Meeting July 21, 2010 Andrew Woods awoods@duraspace. org
Overview • • • What is Dura. Cloud? Fixity service use case Basic flow Cost and performance Next steps
What is it? • Cloud-based service offered by the not for profit organization, Dura. Space • An open source, cloud storage/compute application – Focused on preservation support and – Data access for reuse and sharing • Cloud storage across multiple commercial & non-commercial providers • An open canvas for cloud-based services
Fixity use case • Dura. Cloud user has replicated content across one or more cloud stores • Need for periodic verification of bit integrity • Seeking balance between cost & trust
0: Content Topology
1: Data load
1 a: Replicate
1 b: MD 5 export
2: Determine MD 5 s* . . . running fixity service
3: Compare & Report
0: Trust vs. Cost Trust in. . . – Underlying storage providers – Dura. Cloud and opensource software – Requester of service (administrator)
1: Trust vs. Cost Three approaches: – Request stored value • [inexpensive & fast] – Stream out content & re-calculate • [compute intensive & slow] – Stream out content & re-calculate with salt • [user intensive, compute intensive & slow]
2: Determine MD 5 s* Options for providing expected MD 5 With initial listing After MD 5 calculation
2 a: MD 5 at non-primary Additional cost of processing content not local to compute
Next steps • Scalability – MD 5 calculation across Hadoop cluster • Multi-administration efficiency – On-demand compute at secondary provider • Event logging
Thank you Requesting comments & review https: //wiki. duraspace. org/display/duracloud/Fixity+Service http: //duracloud. org https: //wiki. duraspace. org/display/duracloud/Dura. Cloud https: //svn. duraspace. org/duracloud/trunk/
- Dura cloud
- Dura cloud
- Dura cloud
- Data-data monitoring lingkungan merupakan gambaran dari
- Entity integrity ensures correctness of the data in a table
- Data integrity constraints are used to:
- Microsoft access laboratory information management system
- Hadoop io
- Maintaining
- Kick off briefing
- Processing integrity controls
- Security level
- Data integrity means
- Fda data integrity presentation
- Laboratory ethics and data integrity
- Laboratory ethics and data integrity
- Hmda cra