One step at a time Laying the groundwork

  • Slides: 37
Download presentation
One step at a time: Laying the groundwork for Linked Data with URIs Sonoe

One step at a time: Laying the groundwork for Linked Data with URIs Sonoe Nakasone and Dawn Pearce North Carolina State University Libraries NCSU LIBRARIES

Shout outs NCSU LIBRARIES

Shout outs NCSU LIBRARIES

Co-Presenter Dawn Pearce IT Support Specialist, Information Technology Department Project team member and sirsi

Co-Presenter Dawn Pearce IT Support Specialist, Information Technology Department Project team member and sirsi technical lead pdpearce@ncsu. edu NCSU LIBRARIES

Teammates Lynn Whittenberger Member, monographs domain expert Lisa Madden Member, education lead NCSU LIBRARIES

Teammates Lynn Whittenberger Member, monographs domain expert Lisa Madden Member, education lead NCSU LIBRARIES Jennifer Solomon Project manager Kristen Wilson Member, serials domain expert

URI $0 http: //id. loc. gov/authorities/names/n 50024879 $0 https: //viaf. org/viaf/7386077/ $0 http: //id.

URI $0 http: //id. loc. gov/authorities/names/n 50024879 $0 https: //viaf. org/viaf/7386077/ $0 http: //id. loc. gov/authorities/subjects/sh 85009808 URI $0 http: //id. loc. gov/authorities/names/n 85819387 NCSU LIBRARIES URI

Linked Data Retrieved 1/2/2018: https: //www. w 3. org/Design. Issues/Linked. Data. html NCSU LIBRARIES

Linked Data Retrieved 1/2/2018: https: //www. w 3. org/Design. Issues/Linked. Data. html NCSU LIBRARIES

URI Uniform Resource Identifier Example of a URI: https: //www. wikidata. org/wiki/Q 270715 (Maya

URI Uniform Resource Identifier Example of a URI: https: //www. wikidata. org/wiki/Q 270715 (Maya Angelou visits YCP! 2/4/13. [Image] York College ISLGP. Retrieved from Flickr, https: //www. flickr. com/photos/65767546@N 08/8449738207/ NCSU LIBRARIES

PCC URI Task Group Some examples. . . Columbia University: uris added to small

PCC URI Task Group Some examples. . . Columbia University: uris added to small set of records Harvard University: adding uris in small pilots Cornell University: uris added for FAST headings Stanford University: added uris for LC NAF, VIAF, and ISNI to authority records through BSLW. Plans to use BSLW’s service to add LC NAF URIs to $0 in bibliographic data as well. George Washington University: added over 4 M uris; addition of $0 and $4 integrated into daily cataloging routine PCC Task Group on URIs in MARC Year 2 report to Po. Co, October 2017: https: //www. loc. gov/aba/pcc/bibframe/Task. Groups/PCC_URI_TG_20170415_Report. pdf NCSU LIBRARIES

Goals of our project Investigate two methods of inserting URIs into MARC (Marc. Edit

Goals of our project Investigate two methods of inserting URIs into MARC (Marc. Edit vs Sirsi. API) Educate department on URIs in relation to linked data Create test set of MARC records enhanced with URIs NCSU LIBRARIES

Goal 1: Investigation NCSU LIBRARIES

Goal 1: Investigation NCSU LIBRARIES

Method (Marc. Edit) The importance of identifiers in the new web environment and using

Method (Marc. Edit) The importance of identifiers in the new web environment and using the uniform resource identifier (URI) in subfield zero ($0): A small step that is actually a big step Shieh & Reese (2015) Marc. Edit MARCNext: Linked Records Tool Reese (2014) NCSU LIBRARIES

Method (SIRSI API) NCSU LIBRARIES

Method (SIRSI API) NCSU LIBRARIES

Personal name authority Main entry (100): Angelou, Maya uri: http: //id. loc. gov/authorities/names/n 50024879.

Personal name authority Main entry (100): Angelou, Maya uri: http: //id. loc. gov/authorities/names/n 50024879. html Added personal entry (700): Basquiat, Jean-Michel uri: http: //id. loc. gov/authorities/names/n 85819387. html Boyers, Sara Jane uri: http: //id. loc. gov/authorities/names/n 92117283. html NCSU LIBRARIES

MARC record NCSU LIBRARIES

MARC record NCSU LIBRARIES

Bibliographic requirements ❏ Authorized/validated bibliographic tag in the marc record (1 xx, 7 xx,

Bibliographic requirements ❏ Authorized/validated bibliographic tag in the marc record (1 xx, 7 xx, etc) ❏ Library of Congress Control Number (LCCN) in the 010 of the authority record ❏ Marc policy setup for storing URI (usually in $0) NCSU LIBRARIES

Toolbox ❏ Sirsi. Dynix Symphony APIs ❏ Custom Perl scripts ❏ Text editor/processing (sed/awk)

Toolbox ❏ Sirsi. Dynix Symphony APIs ❏ Custom Perl scripts ❏ Text editor/processing (sed/awk) NCSU LIBRARIES

Getting $ to make $ Perl script is used to dump authority controlled tags

Getting $ to make $ Perl script is used to dump authority controlled tags for processing, with authority key Bib key | Tag | Indicator | Subfields (separated by “”) Subfield “=” stores authority key (hidden link to authority record) Subfield “? ” indicates an unauthorized authority NCSU LIBRARIES

Getting $ to make $ Perl script is used to dump authority controlled tags

Getting $ to make $ Perl script is used to dump authority controlled tags for processing, with authority key Bib key | Tag | Indicator | Subfields (separated by “”) Subfield “=” stores authority key (hidden link to authority record) Subfield “? ” indicates an unauthorized authority NCSU LIBRARIES

Getting $ to make $ Perl script is used to dump authority controlled tags

Getting $ to make $ Perl script is used to dump authority controlled tags for processing, with authority key Bib key | Tag | Indicator | Subfields (separated by “”) Subfield “=” stores authority key (hidden link to authority record) Subfield “? ” indicates an unauthorized authority NCSU LIBRARIES

The authority on the matter Angelou, Maya (authkey: 49248) NCSU LIBRARIES

The authority on the matter Angelou, Maya (authkey: 49248) NCSU LIBRARIES

The authority on the matter (con’t) Basquiat, Jean-Michel (authkey: 356782) NCSU LIBRARIES

The authority on the matter (con’t) Basquiat, Jean-Michel (authkey: 356782) NCSU LIBRARIES

And. . . more $$ fields, y’all (con’t) Using perl, again, to get 010

And. . . more $$ fields, y’all (con’t) Using perl, again, to get 010 from Symphony auth record, with a little extra thrown in to build the URI string. NCSU LIBRARIES

And. . . more $$ fields, y’all (con’t) Using perl, again, to get 010

And. . . more $$ fields, y’all (con’t) Using perl, again, to get 010 from Symphony auth record, with a little extra thrown in to build the URI string. NCSU LIBRARIES

And. . . more $$ fields, y’all (con’t) Using perl, again, to get 010

And. . . more $$ fields, y’all (con’t) Using perl, again, to get 010 from Symphony auth record, with a little extra thrown in to build the URI string. NCSU LIBRARIES

And. . . more $$ fields, y’all (con’t) Merged auth and bib files, formatted

And. . . more $$ fields, y’all (con’t) Merged auth and bib files, formatted for load, including unauthorized tag. Merge file: NCSU LIBRARIES

And. . . more $$ fields, y’all (con’t) Merged auth and bib files, formatted

And. . . more $$ fields, y’all (con’t) Merged auth and bib files, formatted for load, including unauthorized tag. Merge file: NCSU LIBRARIES

And. . . more $$ fields, y’all (con’t) The bibliographic record update: ● Match

And. . . more $$ fields, y’all (con’t) The bibliographic record update: ● Match for update will be made with bib key (internal key in “. 1003. ”). ● Some matching for duplication does occur, but incoming must match existing tag in its entirety. ● Because no tags will match the existing tag (there’s new data!), ALL existing tags must be deleted and recreated from incoming source. ● This is why all unauthorized tags must be preserved and reloaded. NCSU LIBRARIES

Marc record: Result NCSU LIBRARIES

Marc record: Result NCSU LIBRARIES

Findings Marc. Edit Sirsi api ● ● Tags need to be deleted then recreated

Findings Marc. Edit Sirsi api ● ● Tags need to be deleted then recreated Cannot process unauthorized headings Not all authority records have LCCN Only one vocabulary source for URIs NCSU LIBRARIES ● ● ● 12 hours to process ~24, 000 records Simultaneous processing possibly faster Would take 5. 5 months to process 2. 6 million records in catalog

Outcomes Goal 3: Create a set of MARC records with URIs Generated new questions

Outcomes Goal 3: Create a set of MARC records with URIs Generated new questions Generated ideas for new projects NCSU LIBRARIES

Goal 2: Education NCSU LIBRARIES

Goal 2: Education NCSU LIBRARIES

Method NCSU LIBRARIES

Method NCSU LIBRARIES

Acquisitions & Discovery NCSU LIBRARIES

Acquisitions & Discovery NCSU LIBRARIES

Outcomes Well received by our department staff Better understanding of uris and linked data

Outcomes Well received by our department staff Better understanding of uris and linked data Generated interest in learning more about Marc. Edit NCSU LIBRARIES

Regrets Photo by Teodor Bjerrang. https: //unsplash. com/photos/-o. GSMD 7 D NCSU LIBRARIES

Regrets Photo by Teodor Bjerrang. https: //unsplash. com/photos/-o. GSMD 7 D NCSU LIBRARIES

Future NCSU LIBRARIES

Future NCSU LIBRARIES

Thank you! Dawn Pearce pdpearce@ncsu. edu Sonoe Nakasone stnakaso@ncsu. edu NCSU LIBRARIES

Thank you! Dawn Pearce pdpearce@ncsu. edu Sonoe Nakasone stnakaso@ncsu. edu NCSU LIBRARIES