English language subject headings in the catalog of
English language subject headings in the catalog of the National Library of Israel Elhanan Adler elhanana@savion. huji. ac. il
Background • Subject retrieval at the NLI has traditionally been based on a classified catalog using a system which is a mixture of Dewey Classification, Scholem Classification (expansion of Dewey for Judaica), UDC and descriptive text (including many synonyms) • This system is unique to the NLI. Much effort was expended in its upkeep. • Most Israeli libraries use alphabetic subject headings (either LCSH or Hebrew) • The stacks of the NLI (closed) are arranged by inventory number. Reading rooms have unique shelving systems. 2
Examples of the current system 001. 091. 767. 1(08) Knowledge (History, description, critical appraisal of intellectual activity in general) - Muslim, Islamic, Arab countries - Collections, congresses 616(01)(09)(P) Human pathology; Clinical medicine; Diseases; Illnesses Works of historic value; Sources 296. 52 Halakha and poseqim before Moshe ben Maimon (Maimonides), Geonic literature and responsa, tshuvot ha-geonim 296. 562. 22(42) Shabbat - Halakha - Eruv hatserot (eruv of courtyards), eruv tehumin (eruv of boundaries) - England, Great Britain E 114(020. 8) Yerushalayim, Jerusalem - History and description - Audiovisual treatment including video cassettes 3
Example of the classified catalog 4
Not all had supplementary text 5
The major problems • The system is not known or understood outside the • • 6 Library Today, most users of the NLI catalog are remote users Other libraries cannot use this data in their cataloging The NLI cannot make use of subject data from other libraries High maintenance cost
The logical conclusion The NLI should switch to a subject retrieval system understandable to all users, particularly remote users A widely used system which does not require local development should be used The system chosen should allow transfer of subject data to and from the NLI 7
Why LCSH? Most widely used system in Israeli academic libraries Most widely used system in academic libraries in the English speaking world Scholars worldwide are familiar with the system Use of LCSH will enable the NLI to receive and provide usable subject data 8
Why not Hebrew? • Creation of a Hebrew subject thesaurus for a large research library (even based on existing thesauri) is a major, expensive professional task and would take years • It is unlikely other Israeli academic libraries would adopt it (another unique system!) • It would not serve non-Hebrew speaking scholars (the Library has a universal mission) • But there is still a need for Hebrew subject access – particularly for the non-academic segment of the NLI’s local user community 9
Isn’t LCSH passé? Calhoun report (2006): “Abandon the attempt to do comprehensive subject analysis manually with LCSH in favor of subject keywords; urge LC to dismantle LCSH” But LC has decided to continue LCSH with some simplifications: “Following the Calhoun report, LC did an analysis and determined we would continue with precoordinated subject strings (and LCSH for the foreseeable future) complemented by social tagging when that becomes feasible in our firewall environment. ” (communication from Barbara Tillett, Feb. 2010) When change does come, presumably there will be LCSH based conversion tools The NLI can lead Israeli libraries into future systems only if it 10 starts out together with them
How do we get from here to there? Matching NLI records against LC MARC database (to 2006) held by the MALMAD Consortium [after 2006 MALMAD ceased purchasing LC data and joined OCLC] Matching NLI records against Israel ULI Union List records from LCSH-using libraries Creating conversion tables from NLI classification to LCSH 80+% of incoming data from major card conversion project (outsourced to Backstage Library Works) contains LCSH Not uniform but… 11 “Quick and dirty” with ongoing cleanup procedures
Example of conversion table $$a 296(07) = 650 $$a. Judaism$$x. Study and teaching $$a 296. 01 = 650 $$a. Judaism$$x. Bibliography $$a 296. 01(020. 8) = 650 $$a. Judaism$$x. Bibliography @@ 655 $$a. Audio-visual $$a 296. 01(04) = 650 $$a. Judaism$$x. Bibliography $$a 296. 01(05) = 650 $$a. Judaism$$x. Bibliography$$v. Periodicals $$a 296. 01(06) = 650 $$a. Judaism$$x. Bibliography$$x. Societies, etc. $$a 296. 01(07) = 650 $$a. Judaism$$x. Bibliography$$x. Study and teaching $$a 296. 01(08) = 650 $$a. Judaism$$x. Bibliography $$a 296. 01(09) = 650 $$a. Judaism$$x. Bibliography$$x. History $$a 296. 012[100] = 650 $$a. Jewish philosophy$$v. Bibliography @@ 650 $$a. Jewish philosophers$$v. Bibliography $$a 296. 012[400] = 650 $$a. Jewish linguists$$v. Bibliography $$a 296. 012[530] = 650 $$a. Jewish scientists$$v. Bibliography @@ 650 $$a. Physicists$$v. Bibliography $$a 296. 012[574] = 650 $$a. Jewish scientists$$v. Bibliography @@ 650 $$a. Biologists$$v. Bibliography $$a 296. 012[800] = 650 $$a. Jewish authors$$v. Bibliography $$a 296. 012[933. 09] = 650 $$a. Jewish historians$$v. Bibliography $$a 296. 014 = 650 $$a. Anonyms and pseudonyms, Hebrew @@ 650 $$a. Anonyms and pseudonyms, Jewish 12
Progress in 1. 5 years Jan. 2010 – 50, 000 NLI records with LCSH June 2010 – 500, 000 NLI records with LCSH (over 50% of relevant records) June 2011 – 1, 127, 000 NLI records with LCSH (over 90% of relevant records) 13
Authority control NLI purchased the retrospective LCSH authority file (370, 000 records) from LC Set up as a separate ALEPH authority file controlling the NLIs subject headings General policy – make minimal changes and only in the NLIs core collection areas (Judaica & Israel) Add xrefs from more common (in Israel) term Prefer Israeli standard term Add specific terms (kosher and/or ‘kosher-style’) Quantity of records being processed precluded going through SACO, etc. 14
Special considerations/challenges/problems 15
Personal and corporate subjects NLI already has 600 & 610 subjects – but in all 4 scripts, and usually without subdivision English/romanized 600 s and 610 s are added when found but this is not a major goal of the current project Inconsistent romanization (LC vs. Academy of the Hebrew Language) Future linking of personal and corporate names in various scripts (a separate project) will improve cross-alphabet searching 16
Uniform title subjects Follow standard Israeli usage: Uniform Headings in Judaica authority file e. g. Talmud Bavli and Talmud Yerushalmi rather than Talmud and Talmud Yerushalmi 17
http: //aleph 3. libnet. ac. il/F/? func=file&file_name=find-b&local_base=uhj 10 18
Theological issues - Bible used for Jewish Bible (Tanakh) Bible. O. T. changed to Bible. O. T. Apocrypha changed to Apocrypha Bible. N. T. changed to stand-alone New Testament Requires parallel series of subjects : … in the New Testament … $x. New Testament teaching etc. 19
Other theological issues B. C. /A. D. or B. C. E. /C. E. ? Personal subjects: Jesus, Mary, Muhammad, saints, etc. 20
Political issues: Israel/Palestine LC policy: Israel = 1948/1967 borders Pre-1948 = Palestine Post-1948 area of British mandate (Israel + West Bank) = Palestine not to be used for post-Oslo Palestinian Authority “ 680 __ |i Here are entered works on the region on the 21 eastern coast of the Mediterranean Sea that in ancient times was called the Land of Canaan, later the kingdoms of Israel and Judah, and in modern times comprises the entire state of Israel, as well as the various disputed territories. Works on a specific jurisdiction or territory within this region are entered
However 22
Existing Israeli solutions Follow LC Israel for all periods and all borders (just as Persia = Iran) Israel (State) vs. Israel and Eretz Israel (NLI choice) Problems distinguishing between periods and geographic boundaries Antiquities Geography, geology, etc. Overlapping periods (20 th century) 23
Other geo-political issues West Bank or “Judea and Samaria” Jerusalem (Israel) -- Israel -- Jerusalem Nablus or Shechem (and other multiple place name issues) 24
Historical subjects Israel – History -- War of Independence, 1948 -1949 not: Israel-Arab War, 1948 -1949 Six Day War, 1967 not: Israel-Arab War, 1967 Yom Kippur War, 1973 not: Israel-Arab War, 1973 Operation Peace for Galilee, 1982 -1985 not: Lebanon -- History -- Israeli intervention, 19821985 25
Uniquely Israeli topics Immigrant absorption Pre-army yeshivot Yeshivot hesder Bagrut examinations (Matriculation examinations) Aliyah/Yeridah (much more than Emigration and immigration), also: Aliyah, 1 st (1882 -1903) Aliyah, 2 nd (1904 -1914) Aliyah, 3 rd (1919 -1923) Aliyah, 4 th (1924 -1929) Aliyah, 5 th (1929 -1939) Aliyah Bet (1933 -1948) 26
Extension/expansion of LCSH terms Judaism – [Moroccan, Tunisian, etc. ] rite [subject] (Jewish law) Chronological subdivisions under Hasidism, Cabala, Jewish philosophy, etc. More groups of Hasidim 27
Alexander Hasidim Amshinov Hasidim Belz Hasidim Biala Hasidim Bobov Hasidim Boyan Hasidim Bratslav Hasidim Buhusi Hasidim Chernobyl Hasidim Chortkov Hasidim Grodzhisk Hasidim Gur Hasidim Husiatyn Hasidim Izbica Hasidim Kaliv Hasidim Karlin Hasidim Komarno Hasidim Kozhnitz Hasidim Kretshniff Hasidim Kuzmir Hasidim Lelov Hasidim Modzitz Hasidim Mogielnica Hasidim 28 Munkacs Hasidim Nadvorna Hasidim Piaseczno Hasidim Red = LCSH Premishlan Hasidim Black = NLI Prshischa Hasidim Rachmastrivka Hasidim Radomsk Hasidim Ruzhin Hasidim Sadigura Hasidim Satmar Hasidim Sighet Hasidim Skulen Hasidim Skvira Hasidim Slonim Hasidim Sochaczew Hasidim Spinka Hasidim Stefanesti Hasidim Toldot Aharon Hasidim (LCSH = Guardian-of-the-Faithful Hasidim) Tolna Hasidim Tosh Hasidim Vizhnitz Hasidim Zanz Hasidim Zidichov Hasidim Zvolin Hasidim
Problem of subjects which require Jewish + general subject Jewish … education Jewish scientists + Physicists, etc. 29
Standard 667 (internal) notes Changed to standard Israeli form Cross references added by NLI Heading created by NLI Heading changed by NLI 30
Use of Genre/Form headings Primarily in conversion from Dewey literature numbers 655_7 with $2 local Example: Polish drama Polish essays Polish fiction Polish letters Polish poetry Polish prose literature Polish wit and humor Also: Audio-visual, Israeli themes in literature, Jewish themes in literature 31
Ongoing cleanup Create initial files of valid main terms ($a) and subfields (vxyz) from LC LCSH file Add terms & subfields as validated Run test against all 630/651 2 nd indicator 0/4 (existing records and new records being added) Report errors & automatically fix many Group of programs and data files 32
LCSH options 1 - Retest LCSH records 2 - Run nnl 01_fix_lcsh on lcsh_retest. dat creating rep. dat 3 - Edit 082_to_lcsh. dat 4 - Edit lcsh_subfielda. dat 5 - Edit 082_to_lcsh_subdivisions. dat 6 - Run nnl 01_lcsh_subfield_clash_retest 9 - Run nnl 01_match_082_to_lcsh 11 - Edit nnl 01_fix_lcsh. dat 12 - Edit vxyz_subfields. dat 15 - Edit nnl 01_lcsh_test. dat 16 - Edit israel_eretz_israel. dat 17 - Edit nnl 01_lcsh_test_ok. dat 19 - Run nnl 01_drop_lcsh on safety creating rep. dat 22 - Run nnl 01_reselect_no_lcsh 23 - Drop app. dat records from nnl 01_no_lcsh. dat 30 - Retrieve lcsh. toload from ram 1 and process to app. dat 35 - Make ram 1: lcsh. 650 from scratch file safety 40 - match nnl 01_no_lcsh. dat to nnl 6 xx_bib_fields. dat 41 - match nnl 01_no_lcsh. dat to nnl 6 xx_bib_fields 2. dat 50 - retest 650 0/650 4 subfield a against NNL 15 (for lcsh_fix_a routine) 51 - Edit lcsh_fix_a. dat 52 - Run lcsh_fix_a 33 99 - Pretest datafile ram 2 before adding to 082_to_lcsh. dat
Correct a to b $$a. Bible. $$p. Apocrypha. = $$a. Apocrypha. $$a. Bible. $$p. N. T. $$ = $$a. New Testament$$ $$a. Bible. $$p. N. T. $$p = $$a. New Testament. $$p $$a. Bible. $$p. O. T. $$ = $$a. Bible$$ $$a. Bible. $$p. O. T. $$p = $$a. Bible. $$p $$a. Yehuda we. Shomeron$$ = $$a. West Bank$$ $$x. Addresses, essays, lectures = ~ $$z. Yehuda we. Shomeron = $$z. West Bank correct $$a. Israel/$$a. Eretz Israel based on string in subfield E $$x. Antiquities … E $$y 18 E $$y 190 … E $$y. Talmudic E $$y. World War I $$y 195 I $$y 196 I $$y 197 … 34
Example – test of subfields: 000385128 Unknown subfield: $$v Bibligraphy at: 000385128 650 4 L $$a. Jewish periodicals$$x. Indexes$$v. Bibligraphy. 000273103 Unknown subfield: $$v Bibligrpahy at: 000273103 650 0 L $$a. Communism$$x. Periodicals$$v. Bibligrpahy. 002808570 Unknown subfield: $$v Bibliographie at: 002808570 650 0 L $$a. Juifs$$x. Histoire$$v. Bibliographie$$v. Catalogs. 002617234 Unknown subfield: $$v Biografias at: 002617234 650 4 L $$a. Economistas chilenos$$v. Biografias. 002916818 Unknown subfield: $$v Biyografi at: 002916818 650 4 L $$a. Islam$$v. Biyografi. 002997635 Unknown subfield: $$v Biyografi at: 002997635 651 4 L $$a. Turkey$$v. Biyografi. 002204402 Unknown subfield: $$v Bubliography at: 002204402 650 0 L $$a. Geology$$z. Soviet Union$$v. Bubliography. 002858437 Unknown subfield: $$v Chrestomathies and readers at: 002858437 650 4 L $$a. Russian language$$v. Chrestomathies and readers. 002975569 Unknown subfield: $$v Chrestomathies and readers at: 002975569 650 4 L $$a. Greek language$$v. Chrestomathies and readers. 35
Ongoing & future plans Subjects are being sent to Worldcat as 650_4 Continued mass loading from conversion projects Conversion of subject headings from NLI special 36 collections (manuscripts, music, Scholem, others) to LCSH Migrate RAMBI to LCSH Begin work on Hebrew language search to LCSH headings Ultimately provide single search interface (Hebrew and English) to all NLI bibliographic data Introduction of LC classification for open shelf collections
תודה רבה Thank you 37
- Slides: 37