Some Speech Basics Phonetic Transcription Contextdependent variation and
Some Speech Basics Phonetic Transcription, Context-dependent variation, and Intonation v Jennifer J. Venditti v Postdoctoral Research Associate v Columbia Computer Science v 12 September 2002
1. Phonetic Transcription
Spelling vs. Sounds v same spelling = different sounds o comb, tomb, bomb oo blood, food, good c court, center, cheese s reason, surreal, shy v same sound = different spellings [i] sea, see, scene, receive, thief [s] cereal, same, miss [u] true, few, choose, lieu, do [ay] prime, buy, rhyme, lie v combination of letters = single sound ch child, beach th that, bathe oo good, foot gh laugh v single letter = combination of sounds x exit, Texas u use, music v ‘silent’ letters k knife, know p psycho, pterodactyl e moose, bone gh through
Figures 4. 1 and 4. 2: Jurafsky & Martin (2000), pages 94 -95.
On-line pronunciation dictionaries phoneset derived from: number of wordforms English variety LDC PRONLEX ARPAbet 90, 694 American CMUdict ARPAbet 100, 000 American CELEX IPA 160, 595 British Source: Jurafsky & Martin (2000), page 121.
Places of articulation dental labial alveolar post-alveolar/palatal velar uvular pharyngeal laryngeal/glottal http: //www. chass. utoronto. ca/~danhall/phonetics/sammy. html
Vocal fold vibration [UCLA Phonetics Lab demo]
Articulatory parameters for English consonants (in ARPAbet) MANNER OF ARTICULATION PLACE OF ARTICULATION bilabial labiodental stop p inter- alveolar palatal velar dental b fric. f v th dh t d s z affric. g q sh zh h ch jh nasal m n approx w l/r flap k glottal ng y dx VOICING: voiceless voiced
American English vowel space HIGH iy uw eh ae uh ow ey FRONT ux oy ax ah ay aw ix ih ao aa LOW BACK
[iy] vs. [uw] (From a lecture given by Rochelle Newman)
[ae] vs. [aa] (From a lecture given by Rochelle Newman)
Acoustic landmarks [p][ix][t] [ih][sh] [ax][n][p] [ae] [t][s] [iy][n] [s] [ae] [l][iy] “Patricia and Patsy and Sally” [p] [ix] [t] [ih]
Articulators in action (Sample from the Queen’s University / ATR Labs X-ray Film Database) “Why did Ken set the soggy net on top of his deck? ”
Exercise (1) 1. Write your name in: (a) IPA. (b) ARPAbet (if possible). 2. Choose one of the following triplets and transcribe each word in both IPA and ARPAbet. ® cone, tomb, bottom ® blood, fool, hook ® court, race, cheese ® reason, surreal, cash ® thing, these, other ® laugh, through, ghoul
Figures 4. 1 and 4. 2: Jurafsky & Martin (2000), pages 94 -95.
IPA consonants (Distributed by the International Phonetics Association. )
IPA vowels (Distributed by the International Phonetics Association. )
2. Context-dependent phonetic variation
Context-dependent variation v What we would consider a single ‘sound’ can be pronounced differently depending on the phonetic context. For example, the phoneme /t/: Figure 4. 8: Jurafsky & Martin (2000), page 104.
Another regular alternation v I can ask v I can see v I can bake v I can play v I can go v I can carry [ay k ae [ay k ae n ae s k] n s iy] m b ey k] m p l ey] ng g ow] ng k ae r iy] n m / __ [+labial stop] n ng / __ [+velar stop] (inopportune [n], insatiable [n], impervious [m], immortal [m], incoherent [ng], ingratitude [ng])
English plurals hiccup [p] sock [k] habit [t] spoof [f] hearth [th] hiccups socks habits spoofs hearths beach [ch] dish [sh] judge [jh] race [s] axe [s] raise [z] beaches dishes judges races axes raises flood [d] scab [b] frog [g] comb [m] grave [v] lathe [dh] fool [l] sewer [r] pies [ay] curfew [uw] sofa [ax] floods scabs frogs combs graves lathes fools sewers pies curfews sofas
Phonological rules for Engl. plurals v Assume that the lexical form of plural is /z/. v Insertion: ix / [+sibilant] ^__ z # v Devoicing: z s / [-voice] ^__ # bus+PL /b ah s +z/ insertion: b ah s +ix z devoicing: -[b ah s ix z] cape+PL /k ey p +z/ -k ey p s [k ey p s] hen+PL /h eh n +z/ --[h eh n z] /b ah s +z/ devoicing: b ah s s insertion: -*[b ah s s] /k ey p +z/ k ey p s -[k ey p s] /h eh n +z/ --[h eh n z]
3. Intonation
Intonation makes the difference A: I’d like to fly to Davenport, Iowa on TWA. B: TWA doesn’t fly there. . . B 1: They fly to Des Moines. B 2: They fly to Des Moines. A: What types of foods are a good source of vitamins? B 1: Legumes are a good source of vitamins. B 2: Legumes are a good source of vitamins. A 1: I met Mary and Elena’s mother at the mall yesterday. A 2: I met Mary and Elena’s mother at the mall yesterday.
Intonation is about. . . v Pitch v Melody, or “tune” v Alignment v Prominence and focus v Chunking, or “phrasing” v. . . and more. . .
Vocal fold vibration Physical: Fundamental frequency (F 0) rate of vibration of the vocal folds fundamental freq. Perceptual: Pitch [UCLA Phonetics Lab demo] perceived pitch
Pitch range v Differences can be due to physical size, gender, social identity, excitement level, linguistic, etc. . . [from Prosody on the Web tutorial on pitch]
English Pitch Accents v Certain words in the speech stream can be made structurally and perceptually prominent by the use of pitch accents. * works for Lucent. * Lenora v Pitch accents are local pitch movements (e. g. rising, falling) or pitch maxima/minima that accompany these metrically strong syllables. v The intonational “tune” is the melody that is created by sequences of pitch accents over an utterance.
Intonational tunes: What do they mean? v Lenora works for Lucent. [Tell me something about the world. . . ] v Lenora works for Lucent. [. . . Really? I wasn’t aware of that. ] v Lenora works for Lucent. [. . . I hope she doesn’t have stock options. ] v Lenora works for Lucent. [I’ve told you a million times. . . ] * * * * [See works by Bolinger, Ladd, Hirschberg. . . ]
Alignment differences cue “assertion” vs. “suggestion” A: I’d like to fly to Davenport, Iowa on TWA. B: TWA doesn’t fly there. . . they fly to Des Moines
Alignment with different words * Legumes are a good source of * vitamins. “broad focus” A: What types of foods are a good source of vitamins? B: * LEGUMES are a good source of vitamins. “narrow focus” # Legumes are a good source of VITAMINS.
Placement of focal accent LEGUMES are a good source of vitamins The rise-fall tune (= “I assert this”) shifts locations.
Placement of focal accent Legumes are a GOOD source of vitamins The rise-fall tune (= “I assert this”) shifts locations.
Placement of focal accent legumes are a good source of VITAMINS The rise-fall tune (= “I assert this”) shifts locations.
Chunking, or “phrasing” A 1: I met Mary and Elena’s mother at the mall yesterday. A 2: I met Mary and Elena’s mother at the mall yesterday.
Phrasing can disambiguate Mary & Elena’s mother mall I met Mary and Elena’s mother at the mall yesterday One intonation phrase with relatively flat overall pitch range.
Phrasing can disambiguate Elena’s mother Mary mall I met Mary and Elena’s mother at the mall yesterday Separate phrases, with expanded pitch movements.
Lists of numbers, nouns twenty. eight. five ninety. four. three seventy. three. seven forty. seventy. seven coffee cake and cream chocolate ice cream and cake fish fingers and bottles cheese sandwiches and milk cream buns and chocolate [from Prosody on the Web tutorial on chunking]
Exercise (2) 1. Sketch out an F 0 contour of Does Manitowoc have a bowling alley? as uttered in the following two contexts: (a) “I know Green Bay has a bowling alley, but. . . ” (b) “I know Manitowoc has a theater, but. . . ” 2. Complete the sentence: When Madonna sings the song. . . Describe the prosodic phrasing of your utterance. 3. How can phrasing help disambiguate the utterance: that’s right at the traffic light
- Slides: 39