Implementing the Genre and Medium of Performance Algorithm





















- Slides: 21
Implementing the Genre and Medium of Performance Algorithm in a Local Catalog Jeff Lyon jeff_lyon@byu. edu Brigham Young University Greg Reeve greg_reeve@byu. edu
Process Roadmap ● ● ● ● ● Initial test run Assessment phase Local music cataloging procedures Record pull Pre-processing Run algorithm Post-processing Load changes Instrumentation search
Initial test run ● ● ● Extract all music MARC records Remove local fields Install the Music. Form. Genre tool Run the tool on the extracted records Verify output files Load into test ILS server
Algorithm output files Music. Form. Genre. 20200204. 212205. Nothing. Happened. mrc Music. Form. Genre. 20200204. 212205. Nothing. Happened. txt Music. Form. Genre. 20200204. 212205. Records. Changed. Message. mrc Music. Form. Genre. 20200204. 212205. Records. Changed. Message. txt Music. Form. Genre. 20200204. 212205. Records. Changed. OK. Delete. Reason. mrc Music. Form. Genre. 20200204. 212205. Records. Changed. OK. Delete. Reason. txt Music. Form. Genre. 20200204. 212205. Records. Changed. OK. mrc Music. Form. Genre. 20200204. 212205. Records. Changed. OK. txt Music. Form. Genre. 20200204. 212205. Report. txt
Evaluating initial run output files ● ● ● Records. Changed. OK Records. Changed. Message Records. Changed. OK. Delete. Reason Nothing. Happened Problems 212, 544 76, 126 5, 395 51, 694 0
Assessment phase ● Manual record verification ● Heading analysis ○ ○ 650 heading report/creating the spreadsheet Actual changes and potential future cleanup
Assessment phase: Heading analysis ● ● ● Extract 650 and 655 headings Generate a unique list of headings with usage count Generate dummy MARC records for each heading Run dummy MARC records through the musicformgenre tool Programmatically analyze the output for each heading Generate a spreadsheet with each heading and usage count combined with basic checks about the heading output ● Review the top 200 headings and their output
Assessment phase: Heading analysis ● ● ● 22, 899 8, 149 8, 008 6, 608 6, 137 6, 036 5, 910 5, 249 4, 804 4, 152 4, 119 3, 843 Songs with piano ● 80, 896 Operas--Excerpts ● 32, 062 Symphonies ● 27, 831 Organ music ● 6, 932 Orchestral music Christmas music Sacred music Harp music Jazz Songs (High voice) with piano Motion picture music--Excerpts Sonatas (Piano) Long-playing records 78 rpm records Compact discs 45 rpm records
Assessment phase: Heading analysis Required cleanup ● Flip ● Lost demographic ● Misc ○ ○ ○ Chamber music Nothing produced Canons, fugues, etc. Potential future cleanup ● ● No 382 No genre No useful genre Data errors
Assessment phase: Stats
Assessment phase: Stats
Assessment phase: Stats
Local music cataloging rule set
Pre-run steps ● Extract music MARC records ● Pre-processing on extracted records ● Problematic metadata ○ ○ Viola and music|v. Scores and parts. Crashed the program
Run algorithm
Post-processing & load changes ● Add 583 tag ○ ○ Identify modified records Track the category of change made ● Remediation based on our heading analysis ● Coordinate with Library IT to load records
Code repository https: //tinyurl. com/byugenre
Instrumentation search
Lessons learned & future steps
Implementing the Genre and Medium of Performance Algorithm in a Local Catalog https: //tinyurl. com/byugenre Jeff Lyon jeff_lyon@byu. edu Brigham Young University Greg Reeve greg_reeve@byu. edu