Managing a textcritical database of J S Bachs
Managing a text-critical database of J. S. Bach's ‘Well. Tempered Clavier II’ with XML and relational database Tsutomu Fujinami, Yo Tomita Queen’s University Belfast 12/15/2021 Japan Advanced Institute of Science and Technology Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
Introduction n We are still unable to establish the final text of WTC II intended by J. S. Bach. A text-critical database was compiled to ascertain both the origin and authorship of individual variants. The database contains about 5. 5 MB of character-based information. 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
Three Questions n n n [Data analysis] How can we measure and establish the relationship of sources? [Visualization] How can we describe the relationship of sources? [Data sharing] How can we share with other scholars both the data and the result of analysis? Data and knowledge sharing via Internet! 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
The Text-Critical Database n The data was originally stored using spreadsheet software called WINGZ. Example. Prelude No. 1 in C major (BWV 870/1) 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
The items in the Text-Critical Database n n n S/N is the serial number given to each examination point. Bar indicates in which measure(s) the elements are examined. V, bt/pos stands for Voice, Beat and Position, respectively. Element specifies the target of enquiry. Spec. Loc gives graphic representation of information under examination. Classified suggests text-critical significance. 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
The table structure to store the data of variants in relational database The left two columns specify a particular manuscript. Ms. ID 12/15/2021 Mss The third column corresponds to The fourth column stores the S/N the values of the attribute. SN Content 5 P 804 1 Præ lude. 6 Scheibn. 4 1 Præ lude. 7 P 1089 1 Prelude composeè par J. S. Bach 8 P 561 1 Prelude J. S. Bach. 9 Mem-Pre. 8 1 II). Praeludium con Fuga. ex. C dur di | Bach. 10 N. 10490 1 Præ ludio con Fuga. 13 Add. 35021 1 Præ lude et Fuge /i {par} J. S. Bach; ovr on ''di'' 15 Go. S. 312 1 Præ ludium 1. di J. S. Bach 17 P 210 1 Præ ludium. 1. 72 DD 70 1 Preludio di Bach Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
The table structure to store the meta-data of attributes in relational database The leftmost column lists the serial numbers. SN 12/15/2021 Bar Vt. Pos The other columns store the same properties as found in the source. Element Spec. Loc Classified 1 Title a, d 2 1 t-s a, d 3 1 R. H. Clef 4 1 -3 T(B) tie(pitch) ¬ _ | ± [c] M-e 5 1 -3 B voice ¬ _ | ± [C] M-a, e Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
London autograph n n We selected the 'London autograph' as our reference. we created with Coda Finale each movement of WTC II. 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
Converting Finale files into XML n We converted Finale files into XML because q q XML files are easier to read than Finale files, and XML files are portable. <? xml version="1. 0"? > <!DOCTYPE wtc SYSTEM "london. dtd" []> <wtc file="L. new/pr 01. ETF"> <soprano> <measure Number="1" barline="normal" beats="4" chg_sys="true" dvibeat="1024" notekey="c"> <beat Number="1"> <entry ID="2" Type="rest" duration="256" note="semiquaver" pos="1" stem_dir="up" timepoint="0" timespan="256"/> <entry ID="3" Type="note" duration="256" note="semiquaver" pos="2" stem_dir="down" timepoint="256" timespan="256"> <note. Record pitch="c 5" staff="upper"/> </entry> <entry ID="4" Type="note" duration="256" note="semiquaver" pos="3" stem_dir="down" timepoint="512" timespan="256"> <note. Record pitch="d 5" staff="upper"/> </entry> Beginning of Prelude No. 1 in </beat> 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved. XML
Converting XML files into Relational database n We converted further the XML files into relational database for efficiency. measure info. noice number barline beats chg_sys dvibeat notekey Soprano 1 normal 4 true 1024 c voice measure beat record id type duration note pos soprano 1 1 fals 2 rest 256 semiquaver 1 soprano 1 1 true 3 note 256 semiquaver 2 soprano 1 1 true 4 note 256 semiquaver 3 soprano 1 1 true 5 note 256 Semiquaver 4 id pitch staff tie accidental Staff cross 3 c 5 upper NULL 4 d 5 upper NULL 5 e 5 upper NULL 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved. note info.
WTC II Annotation Server n n It allows the user to view the content of both the text-critical database and the London autograph database. It is implemented as a web application using Servlet technology. Only the selected portion of data can be viewed. The two databases are not yet integrated fully. 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
The search interface to the Text-Critical Database n The user can specify the title, bar, voice, and/or the types of variants. demonstration 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
A search result of Text-Critical Database 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
The search interface to the London autograph Database 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
A search result of the London autograph Database 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
Future Developments: Visualization of Manuscript Data n n The tables are not very easy to read. We are trying to generate the score from the data of London autograph using Scalable Vector Graphics (SVG) and Bach font. An example of SVG graphics depicting the beginning of Prelude No. 1. 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
Tools for statistical and logical analysis n n n Statistical tool --- for investigating the inner construction of music, e. g. how often a particular tonal structure appears in all the forty-eight movements. Logical analysis tool --- for analyzing the syntactic information such as a scribe’s use of accidentals. London autograph database is converted into prolog. 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
Conclusion n Analysis of the vast quantity of text-critical data was made manageable by converting the data into XML and organizing the data in relational databases. Much work is still needed to identify what domain knowledge we need for the computers to perform certain analytical tasks. We estimate that the fundamental framework for this project has now been established. 12/15/2021 Copyright(c) 2002 Yo Tomita and Tsutomu Fujinami. All right reserved.
- Slides: 18