Extracting verb valency frames with Noo J Kreimir
Extracting verb valency frames with Noo. J Krešimir Šojat, Kristina Vučković*, Marko Tadić ksojat@ffzg. hr, kvuckovi@ffzg. hr, marko. tadic@ffzg. hr Faculty of Humanities and Social Sciences University of Zagreb Department of Linguistics *Department of Information Sciences Ivana Lucica 3, Zagreb, Croatia Noo. J 2009 Tozeur 2009 -06 -09 1/12
The Plan § Our agenda? § Full description of consumation verb valency frames (Frame. Net by Fillmore, Atkins, Ruppenhofer et al, etc. ) § given core arguments § searching for peripheral elements § time, place, manner, company (PP+I), instrument (NP+I), cause… § How? § using core verb valency frames description § checking the verb’s environment § -4 and +4 sets of word phrases § Noo. J 2009 Tozeur 2009 -06 -09 Why? § to prepare data for Croatian Word. Net § to improve grammars for syntactic verb environment recognition 2/12
Overview § Croatian consumation verb valency § main characteristics § Lexicon § data description § Syntactic grammar § detecting verb’s environment § Checking the data § exctracting frames Noo. J 2009 Tozeur 2009 -06 -09 3/12
Consumation verb valency lexicon § adding semantic information to lexicon § semantic field = cons § consumer § cons 1 (Nominative) § consumed Ja jedem. Jedem. (I am eating. ) Ona se najela gljiva. (She has stuffed § cons 2 (Genitive) herself with mushrooms). § cons 4 (Accusative) Ja jedem ribu. (I’m eating fish. ) § cons 7 (Instrumental) Oni se hrane kukuruzom. (They are feeding on corn. ) § core arguments = cons 1 | cons 12 | cons 14 | cons 17 § jesti, V+FLX=JESTI+Aspect=inf+Prelaz=pov +cons 1+cons 14 Noo. J 2009 Tozeur 2009 -06 -09 4/12
Grammars Noo. J 2009 Tozeur 2009 -06 -09 6/12
Grammars Noo. J 2009 Tozeur 2009 -06 -09 7/12
Results Kao i većina drugih, ta obitelj nikad ne jede u Branimirovoj već hranu nosi kući. Like many others, that family never eats in Branimirova street but carries their food home. -4 -3 -2 i većina ta obitelj drugih -1 nikad ne jede <C> <NP+Nom> <R> om> Noo. J 2009 Tozeur 2009 -06 -09 0 1 2 3 u već hranu Branimirovoj <VP+cons 1> <PP+L> 4 nosi <C> <NP+Acc> <VP> 8/12
Results 2 : problems -4 -3 § A: Ona se tako hrani poradi svoga siromaštva što ga ne smije otkriti kćeri. -2 feeds -1 herself 0 in such a manner 1 4 § She due 2 to her 3 powerty that she must not disclose to her daughter. ona se tako hrani poradi svoga siromaštva što ga ne smije otkriti <NP+Nom> <VP> <R> <VP+cons 1> <PP+G> <PRO> <NP+Acc> <VP> § B: Prije početka susreta jeli su kroasane i voće i pili voćne sokove. § Before the beginning of they 3 ate croassans 4 and -4 -3 -2 -1 0 1 meeting 2 fruit and drank fruit juices. Noo. J 2009 Tozeur 2009 -06 -09 prije početka susreta jeli su kroasane i voće i pili voćne sokove <PP+G> <VP+cons 14> <NP+Acc> <C> <VP+cons 14> <VP> 9/12
Possible solutions 1 § A: § <VP+cons 1><PP+G><PRO+question><WF> § => § <VP+cons 1> <ADV+cause <PP+G <Att> > > § A: <PP+G> - ADV+cause § B: <PP+G> - ADV+time (S+vr) § <PP+G><VP+cons 14>… § => § <ADV+time <PP+G> > <VP+cons 14> Noo. J 2009 Tozeur 2009 -06 -09 10/12
Possible solutions 2 § A: Ona se tako hrani poradi svoga siromaštva što ga ne smije otkriti kćeri. -4 -2 -1 1 2 siromaštva 3 ona-3 tako 0 poradi svoga što 4 ga ne smije otkriti kćeri. <AD ona se tako hrani poradi što ga ne V svoga smije +ma siromaštva otkriti <NP+ nne <NP+Nom> <VP> <R> <VP+cons 1> <PP+G> <PRO> <NP+Acc> <VP> <ADV+cause> r> CONSUMER> § B: Prije početka susreta jeli su kroasane i voće i pili voćne sokove. -4 -3 -2 -1 prije početka 0 1 2 kroasane i voće 3 4 prije početka susreta jeli su kroasane i voće pili voćne sokove <PP+G> <ADV+time> <NP+CONSUME <VP+cons 14> <NP+Acc> <C> <VP+cons 14> <VP> susreta Noo. J 2009 Tozeur 2009 -06 -09 D> i 11/12
Future work § building local grammars for recognizing 1. syntactic verb valency frames § morphosyntactic description of phrases 2. semantic verb valency frames § core + peripheral frame elements 3. check if described frames can be copied into other semantic fields Noo. J 2009 Tozeur 2009 -06 -09 12/12
- Slides: 11