SCONLI3 Corpus Based study of Relative clauses in
SCONLI-3 Corpus Based study of Relative clauses in Hindi & Telugu Transfer grammar rules for relative clauses Viswanatha Naidu Y IIIT-Hyderabad 20 -02 -09 vnaidu@research. iiit. ac. in
OUTLINE Introduction Hindi Relative clause overview Telugu Relative clause overview Transfer grammar & Rules Examples Conclusion 20 -02 -2009 SCONLI-3, India 2
Relative clause overview Restricts or qualify the meaning of the noun in NP Languages like English explicitly indicate with relative pronouns Relative pronouns occur in the initial or part of PP or NP 20 -02 -2009 SCONLI-3, India 3
Types of Relative clause There are two types of relative clauses 1. Restrictive clause 2. Non-restrictive clause 20 -02 -2009 SCONLI-3, India 4
Restrictive Non-Restrictive § Helps the hearer/ reader to identify the referent of the noun phrase. § Rel. pronoun can be dropped • Serves to give hearer an added piece of information identified entity • Provides additional info. • Use of Proper nouns How does he/she know the distinction? Very often the distinction expressed intonationally, but also orthographically with punctuation marker. 20 -02 -2009 SCONLI-3, India 5
Restrictive clause The young linguist whom I saw in the conference lives in Hyderabad. Non-Restrictive clause Chomsky, who arrived for the conference lives in Cambridge, Massachusetts. 20 -02 -2009 SCONLI-3, India 6
The accessibility hierarchy defined (comrie, 1977) as Subject Direct object Non-direct object Possessive § Easy to relativize subjects than it is to relativize any of the other positions 20 -02 -2009 SCONLI-3, India 7
Hindi also has the Restrictive – Non-Restrictive Distinction. Restrictive clause E. g. jo billi mere gara Rel. cat. 3. sg my mem house. nom in hai vaha c. Uhe se Dart. A hai is that. Rat Abl afraid is ‘The cat in my house is afraid of Rat. ’ Non-restrictive clause Chandrababu Naidu, jo Andhra Pradesh k. A mukhya mantr. I REL th. A Ajkal Was. sg GEN chief. M minister. M yah. AM hai. now a days here is Chandrababu Naidu who was the CM of AP is here now a days. 20 -02 -2009 SCONLI-3, India 8
Hindi Relative clauses overview Two types of relative clauses 1. Correlative clause 2. Participle clause 20 -02 -2009 SCONLI-3, India 9
Correlative clause | • Have an explicit relative markers. § These markers will be preceded by the nouns § Appear initial position of the clauses 20 -02 -2009 Participle clause • • 1. 2. • SCONLI-3, India No explicit markers Two types of participle relatives, Present Participle Past Participle Modify head noun as relative clause do. 10
Correlative clauses In correlatives any NP position can be relativized. Like Subjects, objects, Indirect, Oblique including Instruments, locatives etc. Instrumental Relativization jis c. Aku se maine murg. I ko k. At. A vaha bahut tej th. A REL knife. M with I. nom hen. M. Accu cut that very sharp was. M The knife with which I cut the hen was very sharp 20 -02 -2009 SCONLI-3, India 11
Present participle • • 1. 2. • • & All verbs yield present participle forms, which have two functions. Adjectival Adverbial Inflect for Person, Number, Gender. Appear in the form (verb +-t. A +hu. A). 20 -02 -2009 Past participle • Only a restricted set of verbs yield past participle forms, • which indicate achievement. • which have two functions. 1. Adjectival 2. Adverbial • Inflect for Person, Number, Gender. • Appear in the form (verb + -A + hu. A). SCONLI-3, India 12
Hindi does not allow to relativize the Instrument, locative, etc. It allows only subjects and objects. Subject Relativization dau. DA hu. A la. Dk. A ac. Anak ruk gay. A running was boy suddenly stop. Came. ‘The boy who was running suddenly came to stop’ Instrumental Relativization *mere dw. Ar. A kel. A k. At. A hu. A c. Ak. U le j. Ao I by Banana cut did. knife. M take away Take away the knife with which I cut the Banana Locative Relativization *mere dw. Ar. A bait. I hu. I kurc. I bahut m. Ang. I hai. I by sit. F did. Chsir very The chair in which I sat is costly. 20 -02 -2009 costly. SCONLI-3, India is 13
Telugu Relative clause overview 1. 2. • • Telugu also has two types of relative clauses Correlative clauses Participle clauses But, correlatives are not normal in Telugu (Bh. Krishnamurthy) used in more formal speech. My experience of working in corpus gave the same experience as I hardly found the correlatives eppudu Akalaite appudu tin. Ali. whenever. Hungry that+time eat. More naturally it can be expressed using participles 20 -02 -2009 SCONLI-3, India 14
Participle Relative clauses v. Participles form by means of non-finite construction v Do not inflect person, number, gender E. g. ADutunna abb. Ayi b. Awunn. Adu. play. v. adj. conti boy. Nom good. 3. sg. . M The boy, who is playing is good. Accessibility hierarchy of Participle Relativization in Telugu is more frequent, Higher, (oblique forms can also be relativized). 20 -02 -2009 SCONLI-3, India 15
MT ARCHITECTURE v. POS TAGGAER v. CHUNKER v. MORPH ANALYZER v. TOKENIZER v. PARSER v. NER v. WSD v. Etc. ANALYSIS v. TRANSFER GRAMMAR 1. Lexical grammar 2. Structural grammar v. TRANSILITERATION TRANSFER v. Default features v. TL specific features v. Word generator v. Etc. GENERATOR 20 -02 -2009 SCONLI-3, India 16
Transfer grammar One of the approaches to Machine Translation Captures ‘structural differences’ between SL & TL Aim to develop the MT system 20 -02 -2009 SCONLI-3, India 17
ORGANIZATION OF TRANSFER GRAMMAR input §Removing specific features of grammar in SL §Bridging the gap between SL & TL Lexical & Structural Transfer Rules v. Nouns v. Verbs v. Adjectives v. Adverbs v. Prepositions v. Conjunctions v. Miscellaneous Final adjustment by the generator according to the TL features 20 -02 -2009 SCONLI-3, India output 18
Relative clause rules jo Amdh. I kal Ay. I th. I Rel. storm. F come. Pef. F. sg Past. F. sg ninna yesterday vaccina vah bahut nuks. An kar gay. I that much damage. M do go. Perf. F. sg tuf. Anu c. Al. A nas. Tam cesindi. Yesterday. Nom come. past. verbal. adj storm. Nom much damage did. Non-Mascu. 3. sg ‘The storm that raged yesterday did a great deal of damage’ ज आध कल आय थ व बहत नकस न ననన వచచన త ఫ న నషటమ చస ద 20 -02 -2009 SCONLI-3, India कर गय చ ల 19
व Null vo Null बहत చ ల bahut c. Al. A नकस न నషటమ nuks. An nas. Tam చస ద kar_gay. I cesindi कर गय Till now mapping between main clause-main clause there are no problems except the correlative marker Let’s see the sub-ordinate clause mapping 20 -02 -2009 SCONLI-3, India 20
ज आध Null jo ననన कल త ఫ న आय -थ వచచ ద Andh. I Null ninna kal tuf. Anu Ay. I-th. I vaccindi Here we can clearly see, there is a gap between SL and TL in three aspects. 1. No relative marker in TL 2. SL sub-ordinate +Tense into Non-finite form 3. Word order change 20 -02 -2009 SCONLI-3, India 21
RULES 1. Change of word order according to TL 2. Deleting the relative makers (jo & vo) 3. Converting the finite into non-finite 20 -02 -2009 SCONLI-3, India 22
jo Change of word order according to TL Null Andh. I ninna kal vaccina Ay. I_th. I tuf. Anu vo Null bahut c. Al. A nuks. An nas. Tam kargay. I cesindi 20 -02 -2009 SCONLI-3, India 24
jo Null kal ninna Ay. I_th. I vaccina Andh. I tuf. Anu vo Null bahut c. Al. A nuks. An nas. Tam kargay. I cesindi 20 -02 -2009 SCONLI-3, India 25
kal ninna Ay. I_th. I vaccina Andh. I tuf. Anu bahut c. Al. A nuks. An nas. Tam kargay. I cesindi 20 -02 -2009 SCONLI-3, India 27
kal ninna Aya+(Past. V. adj) vaccina Andh. I tuf. Anu bahut c. Al. A nuks. An nas. Tam kargay. I cesindi 20 -02 -2009 SCONLI-3, India 29
REQUIRED RESOURCES ØLARGE AMNOUNT OF CORPUS, REPRESENTS ALL DOMAINS ØBROAD COVERAGE MORPH ANALYZER, GENERATOR >BROAD COVERAGE E-BILINGUAL LEXICONS ØLARGE AMOUNT OF POS & PARSED TREE BANK CORPUS ØLARGE AMOUNT OF PARALLEL CORPORA ØApart from, there is an high necessity of well trained LINGUIST for modeling the language (s), and a Computer Scientist for implementing the model (s). 20 -02 -2009 SCONLI-3, India 31
Thank you for patience 20 -02 -2009 SCONLI-3, India 32
- Slides: 32