Data Integrity the Unisa Library experience 14 16
Data Integrity: the Unisa Library experience 14 -16 November 2011, North-west University (Potchefstroom Campus) Modiehi Rammutloa IR Quality Reporting Unisa Library
What is the fuss about Data Integrity?
What is data integrity Data integrity implies that the data system, the process and the content of the data are reliable, consistent and accurate Sen-Yoni Musingo (2008) Data Integrity is essential in order for data to be considered credible Data quality is a perception or an assessment of data’s fitness to serve its intended purpose in a given context. www. searchdatamanagement. techtarget. com
Aspects of data integrity
Data integrity unpacked o Accuracy - Closeness of measurement to the expected value. Accuracy can be achieved if data is clean and precise – mechanisms to detect & correct (EDCS) – Business rules (eliminate duplication) – 24/7 approach in data maintenance? ? – Checks and balances – Default values (using 0 – no empty field)
Data integrity unpacked o Consistency - Data as it is at any given point How is that achieved? - Standardization (agreement on processes) - Automation of processes (Special membership) - Back up systems?
Data integrity unpacked Reliability - consistency of measurement. Same results repeatedly. Can your data be trusted? Can it be achieved? – Timely – Security – Completeness
Causes of bad data o o o o Lack of data clean up Migration of systems Walk over technology System generated (uploaded data from vendors) Access rights - malicious modifications Manual operations Lack of standardization
Benefits of high quality data o o o Easy retrievability of information resources Accessibility of the most relevant information Customer satisfaction Cost reduction (staff time saved) Image of the Institution (e. g. High quality catalogue).
Data Governance structures o Millennium Working Group o Data Stewards (External Departments) o Data Integrity Steering Committees (Management Level)
Types of data at Unisa Patron data Procurement Unisa Library Item & Bib data course reserve Financial data
Where do we get data from? o o o HR Oracle Student system Millennium OCLC 3 rd party information providers and publishers
Database - Application level Finance M y Un I s a University estates Student system – 1. Applications 2. Registration 3. Study Material 4. Assignments 5. Examinations 6. Graduations HR Hemis Library Uniflow - routing Academic exam XMO E-mail AD External databases
Data and Information management model Library systems ICT’s domain Business domain Data marts a c c e s s Student, finance, HR Examinations, assignments users Data correction flow - Data cleansing projects -Data integrity ID actions - Data correction initiatives - Report to DISC Staging area
Challenges o o o o o Importance of data integrity Lack of training and ignorance Commitment from data owners Data ownership (Branches - Patron) Access rights (re-deployment of staff in different sections) No real time feedback (24 hours) Data corrected on Millennium is overridden Commitment from external departments Silos – databases all over the show
What have we got in place? o Headings report o URL checker o Database of non-compliances - Inventory Team & Cataloguers o ED Data Integrity Management Forum o Data Stewards Forum
Into the future o o o Solid monitoring and evaluation processes Identity management (University initiative) Standardization of data Validity checking system Data Audit trails and controls Data quality into Manager’s IPMS
Thank you! rammumw@unisa. ac. za (012) 429 2242
- Slides: 18