Speech and Language Processing Chapter 1 of SLP

  • Slides: 35
Download presentation
Speech and Language Processing Chapter 1 of SLP

Speech and Language Processing Chapter 1 of SLP

Natural Language Processing • We’re going to study what goes into getting computers to

Natural Language Processing • We’re going to study what goes into getting computers to perform useful and interesting tasks involving human languages. • We are also concerned with the insights that such computational work gives us into human processing of language. 1/1/2022 Speech and Language Processing - Jurafsky and Martin 2

Why Should You Care? 1. An enormous amount of knowledge is now available in

Why Should You Care? 1. An enormous amount of knowledge is now available in machine readable form as natural language text 2. Conversational agents are becoming an important form of human-computer communication 3. Much of human-human communication is now mediated by computers 1/1/2022 Speech and Language Processing - Jurafsky and Martin 3

Commercial World • Lot’s of exciting stuff going on, e. g. Powerset 1/1/2022 Speech

Commercial World • Lot’s of exciting stuff going on, e. g. Powerset 1/1/2022 Speech and Language Processing - Jurafsky and Martin 4

Google Translate 1/1/2022 Speech and Language Processing - Jurafsky and Martin 5

Google Translate 1/1/2022 Speech and Language Processing - Jurafsky and Martin 5

Google Translate 1/1/2022 Speech and Language Processing - Jurafsky and Martin 6

Google Translate 1/1/2022 Speech and Language Processing - Jurafsky and Martin 6

Web Q/A 1/1/2022 Speech and Language Processing - Jurafsky and Martin 7

Web Q/A 1/1/2022 Speech and Language Processing - Jurafsky and Martin 7

Deep Q/A to Jeopardy Winner 1/1/2022 Speech and Language Processing - Jurafsky and Martin

Deep Q/A to Jeopardy Winner 1/1/2022 Speech and Language Processing - Jurafsky and Martin 8

Weblog Analytics • Data-mining of Weblogs, discussion forums, message boards, user groups, and other

Weblog Analytics • Data-mining of Weblogs, discussion forums, message boards, user groups, and other forms of user generated media w Product marketing information w Political opinion tracking w Social network analysis w Buzz analysis (what’s hot, what topics are people talking about right now). 1/1/2022 Speech and Language Processing - Jurafsky and Martin 9

Major Topics (from Textbook) 1. 2. 3. 4. Words Syntax Semantics Pragmatics 1/1/2022 5.

Major Topics (from Textbook) 1. 2. 3. 4. Words Syntax Semantics Pragmatics 1/1/2022 5. Applications exploiting each Speech and Language Processing - Jurafsky and Martin 10

Applications • First, what makes an application a language processing application (as opposed to

Applications • First, what makes an application a language processing application (as opposed to any other piece of software)? w An application that requires the use of knowledge about human languages § Example: Is Unix wc (word count) an example of a language processing application? 1/1/2022 Speech and Language Processing - Jurafsky and Martin 11

Big Applications • • 1/1/2022 Question answering Conversational agents Summarization Machine translation Speech and

Big Applications • • 1/1/2022 Question answering Conversational agents Summarization Machine translation Speech and Language Processing - Jurafsky and Martin 12

Conversational Agents 1/1/2022 Speech and Language Processing - Jurafsky and Martin 13

Conversational Agents 1/1/2022 Speech and Language Processing - Jurafsky and Martin 13

Big Applications • These kinds of applications require a tremendous amount of knowledge of

Big Applications • These kinds of applications require a tremendous amount of knowledge of language. • Consider the following interaction with HAL the computer from 2001: A Space Odyssey 1/1/2022 Speech and Language Processing - Jurafsky and Martin 14

HAL from 2001 • Dave: Open the pod bay doors, Hal. • HAL: I’m

HAL from 2001 • Dave: Open the pod bay doors, Hal. • HAL: I’m sorry Dave, I’m afraid I can’t do that. 1/1/2022 Speech and Language Processing - Jurafsky and Martin 15

What’s needed? • Speech recognition and synthesis • Knowledge of the English words involved

What’s needed? • Speech recognition and synthesis • Knowledge of the English words involved w What they mean • How groups of words clump w What the clumps mean 1/1/2022 Speech and Language Processing - Jurafsky and Martin 16

What’s needed? • Dialog w It is polite to respond, even if you’re planning

What’s needed? • Dialog w It is polite to respond, even if you’re planning to kill someone. w It is polite to pretend to want to be cooperative (I’m afraid, I can’t…) 1/1/2022 Speech and Language Processing - Jurafsky and Martin 17

Caveat NLP has an AI aspect to it. w We’re often dealing with ill-defined

Caveat NLP has an AI aspect to it. w We’re often dealing with ill-defined problems w We don’t often come up with exact solutions/algorithms w We can’t let either of those facts get in the way of making progress 1/1/2022 Speech and Language Processing - Jurafsky and Martin 18

Course Material • We’ll be intermingling discussions of: w Linguistic topics § E. g.

Course Material • We’ll be intermingling discussions of: w Linguistic topics § E. g. Morphology, syntax, discourse structure w Formal systems § E. g. Regular languages, context-free grammars w Applications § E. g. Machine translation, information extraction 1/1/2022 Speech and Language Processing - Jurafsky and Martin 19

Topics: Linguistics • • • 1/1/2022 Word-level processing Syntactic processing Lexical and compositional semantics

Topics: Linguistics • • • 1/1/2022 Word-level processing Syntactic processing Lexical and compositional semantics Discourse processing Dialogue structure Speech and Language Processing - Jurafsky and Martin 20

Topics: Techniques • Finite-state methods • Context-free methods • Augmented grammars w Unification w

Topics: Techniques • Finite-state methods • Context-free methods • Augmented grammars w Unification w Lambda calculus • Probability models • Supervised machine learning methods • First order logic 1/1/2022 Speech and Language Processing - Jurafsky and Martin 21

Quotes • It must be recognized that the notion “probability of a sentence” is

Quotes • It must be recognized that the notion “probability of a sentence” is an entirely useless one, under any known interpretation of this term. • Noam Chomsky, 1969 1/1/2022 • Whenever I fire a linguist our system performance improves. • Frederick Jelinek, 1988 Speech and Language Processing - Jurafsky and Martin 22

Topics: Applications • • • 1/1/2022 Small w Spelling correction w Hyphenation Medium w

Topics: Applications • • • 1/1/2022 Small w Spelling correction w Hyphenation Medium w Word-sense disambiguation w Named entity recognition w Information retrieval Large w Question answering w Conversational agents w Machine translation • Stand-alone • Enabling applications • Funding/Business plans Speech and Language Processing - Jurafsky and Martin 23

Categories of Knowledge • • • 1/1/2022 Phonology Morphology Syntax Semantics Pragmatics Discourse Each

Categories of Knowledge • • • 1/1/2022 Phonology Morphology Syntax Semantics Pragmatics Discourse Each kind of knowledge has associated with it an encapsulated set of processes that make use of it. Interfaces are defined that allow the various levels to communicate. This usually leads to a pipeline architecture. Speech and Language Processing - Jurafsky and Martin 24

Ambiguity • Computational linguists are obsessed with ambiguity • Ambiguity is a fundamental problem

Ambiguity • Computational linguists are obsessed with ambiguity • Ambiguity is a fundamental problem of computational linguistics • Resolving ambiguity is a crucial goal 1/1/2022 Speech and Language Processing - Jurafsky and Martin 25

Ambiguity • Find at least 2 meanings of these headlines: w Drunk Gets Nine

Ambiguity • Find at least 2 meanings of these headlines: w Drunk Gets Nine Months In Violin Case w Farmer Bill Dies In House w Iraqi Head Seeks Arms w Enraged Cow Injures Farmer With Ax w Stud Tires Out w Eye Drops Off Shelf w Teacher Strikes Idle Kids w Squad Helps Dog Bite Victim 1/1/2022 Speech and Language Processing - Jurafsky and Martin 26

Ambiguity is Pervasive • Phonetics! w w w w w 1/1/2022 I mate or

Ambiguity is Pervasive • Phonetics! w w w w w 1/1/2022 I mate or duck I’m eight or duck Eye maid; her duck Aye mate, her duck I maid her duck I’m aid her duck I mate her duck I’m ate or duck I mate or duck Speech and Language Processing - Jurafsky and Martin 27

Dealing with Ambiguity Four possible approaches: • 1. Tightly coupled interaction among processing levels;

Dealing with Ambiguity Four possible approaches: • 1. Tightly coupled interaction among processing levels; knowledge from other levels can help decide among choices at ambiguous levels. 2. Pipeline processing that ignores ambiguity as it occurs and hopes that other levels can eliminate incorrect structures. 1/1/2022 Speech and Language Processing - Jurafsky and Martin 28

Dealing with Ambiguity 3. Probabilistic approaches based on making the most likely choices 4.

Dealing with Ambiguity 3. Probabilistic approaches based on making the most likely choices 4. Don’t do anything, maybe it won’t matter 1/1/2022 Speech and Language Processing - Jurafsky and Martin 29

Models and Algorithms • By models we mean the formalisms that are used to

Models and Algorithms • By models we mean the formalisms that are used to capture the various kinds of linguistic knowledge we need. • Algorithms are then used to manipulate the knowledge representations needed to tackle the task at hand. 1/1/2022 Speech and Language Processing - Jurafsky and Martin 30

Models • • 1/1/2022 State machines Rule-based approaches Logical formalisms Probabilistic models Speech and

Models • • 1/1/2022 State machines Rule-based approaches Logical formalisms Probabilistic models Speech and Language Processing - Jurafsky and Martin 31

Algorithms • Many of the algorithms that we’ll study will turn out to be

Algorithms • Many of the algorithms that we’ll study will turn out to be transducers; algorithms that take one kind of structure as input and output another. • Unfortunately, ambiguity makes this process difficult. This leads us to employ algorithms that are designed to handle ambiguity of various kinds 1/1/2022 Speech and Language Processing - Jurafsky and Martin 32

Paradigms • In particular. . w State-space search § To manage the problem of

Paradigms • In particular. . w State-space search § To manage the problem of making choices during processing when we lack the information needed to make the right choice w Dynamic programming § To avoid having to redo work during the course of a state-space search • CKY, Earley, Minimum Edit Distance, Viterbi, Baum-Welch w Classifiers § Machine learning based classifiers that are trained to make decisions based on features extracted from the local context 1/1/2022 Speech and Language Processing - Jurafsky and Martin 33

State Space Search • States represent pairings of partially processed inputs with partially constructed

State Space Search • States represent pairings of partially processed inputs with partially constructed representations. • Goals are inputs paired with completed representations that satisfy some criteria. • As with most interesting problems the spaces are normally too large to exhaustively explore. w We need heuristics to guide the search w Criteria to trim the space 1/1/2022 Speech and Language Processing - Jurafsky and Martin 34

Dynamic Programming • Don’t do the same work over and over. • Avoid this

Dynamic Programming • Don’t do the same work over and over. • Avoid this by building and making use of solutions to sub-problems that must be invariant across all parts of the space. 1/1/2022 Speech and Language Processing - Jurafsky and Martin 35