Encoding the Document Encoding the Process Elena Pierazzo

  • Slides: 40
Download presentation
Encoding the Document, Encoding the Process Elena Pierazzo Centre for Computing in the Humanities,

Encoding the Document, Encoding the Process Elena Pierazzo Centre for Computing in the Humanities, King’s College London elena. [email protected] ac. uk

Are Modern Manuscripts the same of Medieval Manuscripts? They have different functions: private/note taking

Are Modern Manuscripts the same of Medieval Manuscripts? They have different functions: private/note taking vs. “publication”, public circulation (most of the time) They look different: unplanned vs. planned writing Modern Ms. contain a mixture of text and other stuff which phenomenology is not always easy to re conduct to medieval manuscripts (or so we thought) They can be organised in sequences of drafts Does TEI provide enough support for Modern manuscripts?

The beginning The idea was firstly raised on the TEI MS SIG first meeting:

The beginning The idea was firstly raised on the TEI MS SIG first meeting: Nancy 2003 The issue was arisen again with high priority in the MS SIG Agenda in 2007 P 5 official release Working group within MS SIG in November 2008 (London, TEI MM)

The Working Group Fotis Jannidis (chair) Elena Pierazzo Malte Rehbein + Lou Burnard (Paolo

The Working Group Fotis Jannidis (chair) Elena Pierazzo Malte Rehbein + Lou Burnard (Paolo D’Iorio) & Gregor Middell Moritz Wissenbach

The Workshop Sponsored by ALLC, ACH, TEI, Moore Institute Paris, ENS – ITEM, 14

The Workshop Sponsored by ALLC, ACH, TEI, Moore Institute Paris, ENS – ITEM, 14 15 May 2009 Experts on Genetic Editions, not necessarily on Digital Editions Invited experts: Anne Bohnenkamp, John Bryant, Aurèle Crasson, Jean Daniel Fekete, Daniel Ferrer, Hans Walter Gabler, Axel Gellhaus, Almuth Grésillon, Claus Huitfeldt, Dirk van Hulle, Jean Louis Lebrave, Wolfgang Lukas, Kenneth M. Price, Kathryn Sutherland

Road map Officially presented at the TEI Members’ Meeting in Ann Arbor (Michigan) 2009

Road map Officially presented at the TEI Members’ Meeting in Ann Arbor (Michigan) 2009 Beta test open, up to end of January 2010 February/March 2010: revision April 2010: submission to the TEI council as a formal proposal Accepted in principle November 2010: Workshop and testing at TEI Members’ Meeting in Zadar

Documents or Texts?

Documents or Texts?

Documents and Texts Ying and Yang, Body and Soul In the editorial process it

Documents and Texts Ying and Yang, Body and Soul In the editorial process it is impossible not to consider them both…

Genesis of what? Documents? (tears, cuts, patches, stitches…) Texts? (corrections, revisions, additions, deletions, transpositions?

Genesis of what? Documents? (tears, cuts, patches, stitches…) Texts? (corrections, revisions, additions, deletions, transpositions? ) Both, of course!

Three parts Documents Encoding Transcription Enhancement Genetic encoding

Three parts Documents Encoding Transcription Enhancement Genetic encoding

TEI provides a way to encode TEXTS, but sometimes The process to make the

TEI provides a way to encode TEXTS, but sometimes The process to make the document is at least as important as the text that it contains The text can be determined by the document The text is graphically presented There is no text: non linear texts are non texts (Ferrer)

Where is the text?

Where is the text?

Are Modern Manuscript the same of Medieval Manuscript? Not really…

Are Modern Manuscript the same of Medieval Manuscript? Not really…

<ge: document> a sibling of <text> and <facsimile> represents the physical structure of a

<ge: document> a sibling of <text> and <facsimile> represents the physical structure of a document, in terms of written <surface>s, and <zone>s of writing. a new <ge: line> element is needed to represent topographic lines a new <ge: patch> element is needed to represent fragmentary surfaces

<text> <body> <div type="diary entry"> <dateline> <date value="2009 04 01"> 1 April 2009 </date>

<text> <body> <div type="diary entry"> <dateline> <date value="2009 04 01"> 1 April 2009 </date> </dateline> <p><lb/>Fed Birds in the park today. <lb/> Might write an article about <lb/> the Thick billed Warbler. </p> </div> <div type="note" rend="rotated"> <p><lb/>Samaria is a Greek <lb/> brand of water that <lb/> comes from the natural <lb/> springs of Stilos, in <lb/> Crete</p> </div> </body> </text> <ge: document> <surface ulx="0" uly="0" lrx="200" lry="300"> <zone ulx="10" uly="43" lrx="185" lry="84" rotate="0"> <zone> <ge: line rend="right"> 1 April 2009 </ge: line> </zone> <ge: line>Fed Birds in the park today. </ge: line> <ge: line>Might write an article about </ge: line> <ge: line>the Thick billed Warbler. </ge: line> </zone> <zone ulx="9" uly="20" lrx="70" lry="60" rotate="90"> <ge: line>Samaria is a Greek </ge: line> <ge: line>brand of water that</ge: line> <ge: line>comes from the natural</ge: line> <ge: line>springs of Stilos, in </ge: line> <ge: line>Crete </ge: line> </zone> </surface> </ge: document>

<ge: document> <surface ulx="0" uly="0" lrx="200" lry="300"> <zone stage="#stage 1" seq="0" ulx="10" uly="43" lrx="185"

<ge: document> <surface ulx="0" uly="0" lrx="200" lry="300"> <zone stage="#stage 1" seq="0" ulx="10" uly="43" lrx="185" lry="84"> <zone> <milestone unit="date" span. To="#end. Date"/>1 April 2009 <anchor Sort of… xml: id="end. Date"/> </zone> <milestone unit="p" span. To="#p 2"/> <ge: line>Fed Birds in the park today. </ge: line> <ge: line> Might write an article about </ge: line> <ge: line>the Thick billed Warbler. </ge: line> </zone> <zone stage="#stage 2" ulx="9" uly="20" lrx="70" lry="60" rotate="90"> <milestone unit="p" xml: id="p 2" span. To="#end"/> <ge: line>Samaria is a Greek</ge: line> <ge: line>brand of water that</ge: line> <ge: line>comes from the natural</ge: line> <ge: line>springs of Stilos, in</ge: line> <ge: line>Crete</ge: line> <anchor xml: id="end"/> </zone> </surface> </ge: document> Can we combine the two views?

The content of <zone> <ge: line> <table> Text!

The content of <zone> <ge: line> <table> Text!

Patches <surface> <zone> <ge: line>Poem</ge: line> <ge: line>As in Visions of — at</ge: line>

Patches <surface> <zone> <ge: line>Poem</ge: line> <ge: line>As in Visions of — at</ge: line> <ge: line>night —</ge: line> <ge: line>All sorts of fancies running through</ge: line> <ge: line>the head</ge: line> </zone> <ge: patch type="newsprint" binder="glue" height="40" width="90"> Spring has just set in here, and the weather. . a steamer <zone> <ge: meta. Mark function="sequence">2</ge: meta. Mark> </zone></ge: patch> <ge: patch type="newsprint" binder="glue" height="35" width="90"> "The shores on either side of the Sound are. . . The In <zone> <ge: meta. Mark function="sequence">3</ge: meta. Mark>

Exercise 1: Jane Austen Go to www. janeausten. ac. uk Select Manuscripts Select Volume

Exercise 1: Jane Austen Go to www. janeausten. ac. uk Select Manuscripts Select Volume the First – Text From the dropdown menu select page 1

New Transcriptional Features <ge: metamark> : markup already present in the document <ge: rewrite>

New Transcriptional Features <ge: metamark> : markup already present in the document <ge: rewrite> : clarification or fixation <ge: used>: marked as used elsewhere <ge: undo>: a cancelled revision <ge: redo> <ge: transpose> & <ge: transpose. Grp> <ge: mod>: generic element for modifications

Metamark <del><ge: meta. Mark function="flag" targets="#s 1">lege</ge: meta. Mark> <s xml: id="s 1" >Ock

Metamark <del><ge: meta. Mark function="flag" targets="#s 1">lege</ge: meta. Mark> <s xml: id="s 1" >Ock en schullen de bruwere des hilgen dages nicht over <lb/>setten noch uppe den stillen fridach bruwen. </s> <add><s>Noch nymande <lb/>over setten, se en sehin denne erst, dat uppe den bonen <lb/>neyn stro noch, huw noch flaß ligghe, by pine eyner halven <lb/>roden, deme bruwere so wol alse dem bruwheren to murende. </s></add> </del>

Clarification <ge: line>. . . Sku<ge: rewrite cause="unclear">l<ge: rewrite>dren </ge: line>

Clarification <ge: line>. . . Sku<ge: rewrite cause="unclear">l<ge: rewrite>dren </ge: line>

Fixation <ge: rewrite cause="fix" hand=”#ja 2" stage="#s 1">Now, if we could get <del stage=“

Fixation <ge: rewrite cause="fix" hand=”#ja 2" stage="#s 1">Now, if we could get <del stage=“ 1">but</del> a young Heiress </ge: rewrite>

Used <surface> <ge: used rend="cross" span. To="#X 2"/> <zone> <ge: line rend="underline">The Poet</ge: line>

Used <surface> <ge: used rend="cross" span. To="#X 2"/> <zone> <ge: line rend="underline">The Poet</ge: line> <ge: line><del rend="strikethrough">I think</del> His sight is the</ge: line> <ge: line> sight of the ? and</ge: line> <ge: line>has sent the instinct of the</ge: line> <ge: line>? dog</ge: line> </zone> <ge: line>I think <ge: rewrite>ten</ge: rewrite> million</ge: line> <! . . . > <ge: line>well; those <subst> <del rend="strikethrough">supple fingered gods</del> <add>journeymen divine. </add> </subst></ge: line> <anchor xml: id="X 2"/> </zone> </surface>

Undo (and instant correction) <ge: line> che e’ <hi rend="underline"><ge: undo span. To="#x 2"/>si

Undo (and instant correction) <ge: line> che e’ <hi rend="underline"><ge: undo span. To="#x 2"/>si <anchor xml: id="x 2"/> rechi a’</hi> <del rend="overstrike” instant="true">dotti</del> <hi rend="underline">denti</hi> l’un d’essi cibi</ge: line>

Transposition <ge: line><seg xml: id="ib 01">bör</seg><ge: meta. Mark rend="underline" function="transposition” targets="#ib 1" place="above">2. </ge:

Transposition <ge: line><seg xml: id="ib 01">bör</seg><ge: meta. Mark rend="underline" function="transposition” targets="#ib 1" place="above">2. </ge: meta. Mark> og <seg xml: id="ib 02" >hör</seg><ge: meta. Mark rend="underline" function="transposition" targets="#ib 02" place="above">1. </ge: meta. Mark></ge: line> <ge: transpose. Grp> <ge: transpose> <ptr target="#ib 02"/> <ptr target="#ib 01"/> </ge: transpose. Grp>

A generic modification Semantic <ge: line>I should <subst> <del>soon</del> <add>shortly</add> </subst>do </ge: line> Documentary

A generic modification Semantic <ge: line>I should <subst> <del>soon</del> <add>shortly</add> </subst>do </ge: line> Documentary <ge: line>I should <ge: mod rend="strikethrough">do</ge: m od> <ge: mod rend="interlinear” >soon</ge: mod> shortly do </ge: line>

Support on existing markup Alternative readings <zone> <ge: line>Alone <seg type="alternative" xml: id="alt 1">before</seg>

Support on existing markup Alternative readings <zone> <ge: line>Alone <seg type="alternative" xml: id="alt 1">before</seg> <add place="above" type="alternative" xml: id="alt 2">beside</add> his native river —</ge: line> <alt targets="#alt 1 #alt 2" mode="excl" weights="0 1"/> </zone>

Exercise 2: The Blake Archive Go to www. blakearchive. org Enter the archive Go

Exercise 2: The Blake Archive Go to www. blakearchive. org Enter the archive Go to Text Search and search for Devil Choose first item Songs of Innocence and of Experience Then first item again Then object 39 Open the image as well

Genesis within the same document Genesis across documents The passage from correction within a

Genesis within the same document Genesis across documents The passage from correction within a document to another document can happen at anytime and it is not a one way process

Within the same Doc: Stages and Revision Campaigns <profile. Desc> <creation> <ge: stage. Notes

Within the same Doc: Stages and Revision Campaigns <profile. Desc> <creation> <ge: stage. Notes ordered="true"> <ge: stage. Note xml: id="ST 1">First stage, written in ink by a scribe</ge: stage. Note> <ge: stage. Note xml: id="ST 2">Second stage, written in Goethe's hand using pencil</ge: stage. Note> <ge: stage. Note xml: id="ST 3">Fixation of the revised passages and further revisions by Goethe using ink</ge: stage. Note> <ge: stage. Note xml: id="ST 4">Addition of another stanza in a different hand, probably at a later stage</ge: stage. Note> </ge: stage. Notes> </creation> </profile. Desc>

Transposition (with stages) <ge: line><seg xml: id="ib 01">bör</seg><ge: meta. Mark rend="underline" function="transposition” targets="#ib 1"

Transposition (with stages) <ge: line><seg xml: id="ib 01">bör</seg><ge: meta. Mark rend="underline" function="transposition” targets="#ib 1" place="above” stage=“#ST 2”>2. </ge: meta. Mark> og <seg xml: id="ib 02" >hör</seg><ge: meta. Mark rend="underline" function="transposition" targets="#ib 02" place="above” stage=“#ST 2”>1. </ge: meta. Mark></ge: line> <ge: transpose. Grp> <ge: transpose> <ptr target="#ib 02"/> <ptr target="#ib 01"/> </ge: transpose. Grp>

Genesis across docs <profile. Desc> <creation> <date not. After="1816 07 18"/> <ge: stage. Notes

Genesis across docs <profile. Desc> <creation> <date not. After="1816 07 18"/> <ge: stage. Notes ordered="true"> <ge: stage. Note xml: id="mod 1" when="1816 07 16">The first draft of <title>Persuasion</title> is completed by the <date>July 16 1816</date> written after the word <q>Finis</q> at <ref target="#pers 30">page 30</ref>. </ge: stage. Note> <ge: stage. Note xml: id="mod 2" not. Before="1816 07 16">After the <date>16 th of July</date> Austen starts revision of the two final chapters, by rewriting the end adding a new block (<ref target="#transp 1">pages 32 35</ref>) to be inserted at <ref target="#insertion p 1">page 19</ref>. This stage is documented by the deletion of the date (<date>July 16 1816</date>) at <ref target="#pers 30">page 30</ref>, and the addition of more text and of a new date (<date>July 18. 1816</date>) at <ref target="#pers 31">page 31</ref></ge: stage. Note> <ge: stage. Note not. Before="1816 07 18” xml: id=“mod 3”>Before publication, after <date>July 18 th, 1816</date> chapters 10 11 were broken into three chapters, 10, 11, 12, as

Let’s start by grouping! <tei. Corpus> <tei. Header> <! information about the dossier >

Let’s start by grouping! <tei. Corpus> <tei. Header> <! information about the dossier > </tei. Header> <xi: include href="document 1. xml"/> <xi: include href="document 2. xml"/> <xi: include href="document 3. xml"/> </tei. Corpus>

Then let’s organise our group <graph type="directed"> <node xml: id="A" value="http: //edition. net/witness/A"> <label>A</label>

Then let’s organise our group <graph type="directed"> <node xml: id="A" value="http: //edition. net/witness/A"> <label>A</label> </node> <node xml: id="B" value="http: //edition. net/witness/B"> <label>B</label> </node> <! more nodes > <arc xml: id="AC" from="#A" to="#C" value="http: //edition. net/genetic/analysis#ac"/> <arc xml: id="BC" from="#B" to="#C" value="http: //edition. net/genetic/analysis#bc"/> <! more arcs > </graph>

Genesis of documents http: //www. janeausten. ac. uk/facsimile/blvolsecond/6566. html <ge: document> <surface ulx="0" uly="0"

Genesis of documents http: //www. janeausten. ac. uk/facsimile/blvolsecond/6566. html <ge: document> <surface ulx="0" uly="0" lrx="200" lry="300"> <zone>. . . </zone></surface> <damage. Span span. To="#P 3" stage="#stage 2"/> <damage. Span span. To="#p 3"/> <gap extent="1" unit="folio" stage="#stage 2"> <gap extent="1" unit="folio"> <desc>Stub of a missing folio</desc> <desc>Stub of a</gap> missing folio</desc> </gap> <surface corresp="folio. xml#p 1" stage="#stage 1"/> <surface ulx="0" uly="0" lrx="200" lry="300" xml: id="p 3"> <surface corresp="folio. xml#p 2" stage="#stage 1"/> <zone>. . . </zone><surface ulx="0" uly="0" lrx="200" lry="300" xml: id="P 3"> </surface> <zone>. . . </zone> </ge: document> </surface> </ge: document>

Exercise 3: Whitman archive Go to www. whitmanarchive. org Select Manuscripts > Poetry Manuscripts

Exercise 3: Whitman archive Go to www. whitmanarchive. org Select Manuscripts > Poetry Manuscripts Select ninth item: America Old World Bards

For more Info http: //www. teic. org/SIG/Manuscripts/genetic. html

For more Info http: //www. teic. org/SIG/Manuscripts/genetic. html