Language and Speech Processing at Johns Hopkins University

  • Slides: 31
Download presentation
Language and Speech Processing at Johns Hopkins University March 5, 2010 Center for Language

Language and Speech Processing at Johns Hopkins University March 5, 2010 Center for Language and Speech Processing The Johns Hopkins University

The JHU Center for Language and Speech Processing CLSP was established in 1992 with

The JHU Center for Language and Speech Processing CLSP was established in 1992 with outside support to promote research and education in the science and technology of speech and language. Electrical and Computer Engineering Computer Science CLSP Biomedical Engineering Cognitive Science Applied Math & Statistics Human Language Tech. Center of Excellence

Speech and Language Faculty at JHU (does not list senior research staff, postdocs, students,

Speech and Language Faculty at JHU (does not list senior research staff, postdocs, students, …) n Electrical & Computer Eng n n n n n Chris Callison-Burch Jason Eisner David Yarowsky Applied Math & Statistics n Carey Priebe Cognitive Science / Psychology n n n Computer Science n n Andreas Andreou Mounya Elhilali Hynek Hermansky Frederick Jelinek (Director) Damianos Karakos Sanjeev Khudanpur n Biomedical Engineering n n Eric Young Applied Physics Laboratory n n n Justin Halberda Geraldine Legendre Kyle Rawlins Paul Smolensky (Asst Dir) Colin Wilson James Mayfield Christine Piatko HLT Center of Excellence n n Kenneth Church Mark Dredze` n n Aren Jansen Ben Van Durmé

CLSP Vision Statement n n Understand how human language is used to communicate ideas/thoughts/information.

CLSP Vision Statement n n Understand how human language is used to communicate ideas/thoughts/information. Develop technology for machine analysis, translation, and transformation of multilingual speech and text.

CLSP Mission Statement 1. Research n n n 2. Education n 3. Advance state

CLSP Mission Statement 1. Research n n n 2. Education n 3. Advance state of the art in our interdisciplinary field Focus on developing key algorithms and statistical models Focus on strategic languages, including low-resource languages Attract the best students and train them to be leaders Offer full spectrum of courses Conduct annual international summer school at JHU Outreach n n Be responsive to government and industry problems Serve as a “hub” for the HLT community Organize international summer research workshop at JHU Welcome short- and long-term visitors

Research: Primary Areas n Speech Recognition n n Speech Applications n n n Acoustic

Research: Primary Areas n Speech Recognition n n Speech Applications n n n Acoustic processing Acoustic-phonetic modeling Pronunciation modeling Language modeling Keyword spotting Spoken term detection Speaker verification Language identification Speech Science n n Auditory physiology Neuromorphic signal processing n Natural Language Processing n n n Machine Translation n Low-resource languages Arabic and Chinese Knowledge-Base Population n Morphological analysis Syntactic analysis (parsing) Information extraction Co-reference resolution Automatic content extraction Inference and learning Machine Learning n n n Small-sample learning Structured prediction Minimally supervised learning

Sponsored Research in Speech & Language in WSE is ≈ $2. 5 M/year P

Sponsored Research in Speech & Language in WSE is ≈ $2. 5 M/year P Investigator Project Title (Granting Agency) Period Amount Jelinek Investigation of Meaning Representation in Language Understanding (NSF) 10/05 -01/10 $2. 5 M Jelinek Cross Cutting Research Workshops in Intelligent Information Systems (NSF) 09/07 -08/11 $830 K Eisner Finite-State Machine Learning on Strings and Sequences (NSF) 02/04 -01/10 $500 K Khudanpur Rosetta: An Analyst Co-pilot (DARPA/IBM) 10/05 -04/11 $3. 4 M Eisner Learned Dynamic Prioritization (NSF) 09/10 -08/14 $1. 2 M Hager Gesture Induction for Manipulative and Interactive Tasks (NSF) 02/06 -01/10 $490 K Smolensky* Unifying the Science of Language (NSF: IGERT) 05/06 -04/11 $3. 0* M Vice Provost* Human Language Technology Center of Excellence (MPO) 01/07 -01/17 $50* M Andreou Energy Efficient Organic Semiconductor Circuits (DOE) 05/07 -04/10 $660 K Yarowsky Multi-Level Modeling of Language and Translation (NSF) 06/07 -06/10 $400 K Karakos Novel Approaches to Unsupervised Classification via ISPDTs (NSF) 09/07 -08/10 $300 K Callison-Burch DARPA Computer Science Study Group (DARPA) 02/08 -02/09 $93 K Jelinek Research Workshops in Intelligent Information Systems (Google) 06/08 -05/10 $270 K Khudanpur Self-Supervised Discriminative Training of Statistical Language Models (NSF) 09/08 -08/09 $137 K Khudanpur Self-Training for ASR in Low Resource Languages (BBN) 09/09 -08/10 $101 K

CLSP Mission Statement 1. Research n n n 2. Education n 3. Advance state

CLSP Mission Statement 1. Research n n n 2. Education n 3. Advance state of the art in our interdisciplinary field Focus on developing key algorithms and statistical models Focus on strategic languages, including low-resource languages Attract the best students and train them to be leaders Offer full spectrum of courses Conduct annual international summer school at JHU Outreach n n Be responsive to government and industry problems Serve as a “hub” for the HLT community Organize international summer research workshop at JHU Welcome short- and long-term visitors

Education: Interdisciplinary Environment n Who and where n n Coursework n n Interdisciplinary core

Education: Interdisciplinary Environment n Who and where n n Coursework n n Interdisciplinary core curriculum (extends dept. requirements) Variety of other relevant courses (growing list, new plans) International 2 -week summer school Research n n n Ph. D, MSE, and BS students from multiple depts. Shared interdisciplinary offices in CSEB Shared technical perspective and computing infrastructure Students do research from the start Students work with faculty from multiple departments and HLTCOE Other learning n n n Distinguished outside speaker every week Student speaker and town meeting every week Reading groups and conference travel

Sample Courses for an MSE in Human Language Technology Course Number Course Title Instructor

Sample Courses for an MSE in Human Language Technology Course Number Course Title Instructor CS 600. 465 Natural Language Processing Eisner CS 600. 466 Information Retrieval and Web Agents Yarowsky CS 600. 425 Declarative Methods Eisner AMS 550. 732 Pattern Recognition Priebe COG 050. 320 Introduction to the Syntax of Natural Language Legendre COG 050. 325 Sound Structure in Natural Language Burzio COG 050. 825 Optimality Theory Smolensky ECE 520. 445 Introduction to Speech and Audio Processing Elhilali ECE 520. 447 Introduction to Information Theory and Coding Jelinek ECE 520. 651 Random Signal Analysis Khudanpur ECE 520. 666 Information Extraction from Speech and Text Jelinek ECE 520. 674 Information Theoretic Methods in Statistics Khudanpur ECE 520. 682 Computational Systems Neuroscience Elhilali ECE 520. 735 Sensory Information Processing Andreou

Education: Track Record n n n WSE has the 2 nd largest university group

Education: Track Record n n n WSE has the 2 nd largest university group in the U. S. working on Human Language Technology 38 Ph. Ds awarded, many more MSEs CLSP Ph. Ds presently hold research/faculty positions at n n n n Carnegie Mellon University U. of Massachusetts, Amherst Swarthmore College Michigan State University Hong Kong Polytechnic Univ. Bogazici University (Turkey) U. of Karlsruhe (Germany) Saarland University (Germany) n CLSP Ph. Ds presently hold senior technical/research positions at n n n Apptek BBN Convergys e-Scription Fair Isaac Google (several) Microsoft (several) MITRE IBM (several) NSA (several) Nuance (several) SRI International

CLSP Mission Statement 1. Research n n n 2. Education n 3. Advance state

CLSP Mission Statement 1. Research n n n 2. Education n 3. Advance state of the art in our interdisciplinary field Focus on developing key algorithms and statistical models Focus on strategic languages, including low-resource languages Attract the best students and train them to be leaders Offer full spectrum of courses Conduct annual international summer school at JHU Outreach n n Serve as a “hub” for the HLT community Be responsive to government and industry problems Organize international summer research workshop at JHU Welcome short- and long-term visitors

JHU Summer Workshops in HLT: Integrating Research and Education n Organized by JHU on

JHU Summer Workshops in HLT: Integrating Research and Education n Organized by JHU on behalf of the Human Language Technology field n n n Mixed teams of senior and student researchers n n 3 teams per summer (since 1995) n selected & refined from 25 proposals by “interactive peer review” each team comes to JHU for 8 weeks of intense collaborative research Team ≈ 3 academics, 1 industry, 1 govt, 2 -3 grad students, 2 undergrads 30+ participants 8 weeks 15 years More than 160 star students trained in HLT research (1998— 2007) Outcomes n n n Numerous research breakthroughs New, long-term collaborations, tangible knowledge transfer Diverse expertise, research infrastructure, data resources

A Few of Many Workshop Accomplishments n A small sample of research results and

A Few of Many Workshop Accomplishments n A small sample of research results and their wider impact n Statistical Machine Translation (1999) n n MEAD Multilingual Multi-document Summarization (2001) n n Improved ASR technology for conversational Arabic Moses Machine Translation Repository (2006) n n Major breakthrough in speaker recognition technology Factored Language Models (2002) n n 100 s of worldwide users, active developers in the community Super. SID: High-level information for Speaker-ID (2002) n n GIZA++ is extensively used to build SMT systems even today The de facto standard in statistical machine translation More than 100 refereed publications n Detailed technical reports also available on CLSP web-site 15

Human Language Technology Center of Excellence at JHU • Long-term research mission: Automatically analyze

Human Language Technology Center of Excellence at JHU • Long-term research mission: Automatically analyze a wide range of speech, text, and document images in multiple languages. • Founded with government support in 2007 • Has brought many new researchers and research challenges into the CLSP community • Aggressively hiring the top new Ph. D. s nationally

Human Language Technology Center of Excellence at JHU Whiting School of Engineering JHU Provost

Human Language Technology Center of Excellence at JHU Whiting School of Engineering JHU Provost Administrative Staff Security Staff Center for Language and Speech Processing Executive Director Sponsor RD Leadership Sponsor Technical Board Director of Researchers Sponsor Researchers

Prof. Andreas G. Andreou: Sensory Information Processing in Natural and Synthetic Systems Research •

Prof. Andreas G. Andreou: Sensory Information Processing in Natural and Synthetic Systems Research • Principles of sensory information processing in biology. • Sensory communication. • Algorithms and processor architecture design for energy efficient acoustic, speech and vision processing. • Physics of sensing and computation. Applications Algorithms for robust ASR • Robust acoustic feature representation and dimensionality reduction • Algorithms and architecture optimization for Chip Multi Processors (CMP) in Exascale systems Multimodal scene analysis • Active and passive processing for scene analysis (visual & auditory) • Acoustic and EM micro-Doppler imaging Bio-inspired systems • Energy efficient microsystems for processing what and where in natural environments.

Prof. Chris Callison-Burch: Statistical Machine Translation Research • • • Statistical machine translation Syntactic

Prof. Chris Callison-Burch: Statistical Machine Translation Research • • • Statistical machine translation Syntactic translation models Low resource languages Data-driven paraphrasing Evaluation measures, creation of shared data resources

Prof. Kenneth Church: Human Language Technology (HLT) at Scale Applications Research • • Speech

Prof. Kenneth Church: Human Language Technology (HLT) at Scale Applications Research • • Speech Processing at Scale Language Processing at Scale Web Search at Scale Mining Speech/Language with Zero Linguistic Resources • • • Web search Cloud computing Language modeling Text analysis Spelling correction Word-sense disambiguation Terminology Translation Lexicography Compression Speech recognition and synthesis OCR

Prof. Mark Dredze: Applications of Machine Learning to Real-World Text Processing Applications Research •

Prof. Mark Dredze: Applications of Machine Learning to Real-World Text Processing Applications Research • Adaptation of machine learning algorithms between text domains • Large scale information processing and learning • Intelligent user interfaces for information management Domain adaptation • Extending NLP models to new datasets Cross-domain learning • Applying NLP techniques to languages with few resources Knowledge base population • Building large high precision knowledge bases from text Intelligent email • Improved email clients by aiding the user with artificial intelligence

Prof. Jason Eisner: Algorithms and Models for Language Processing Research • Novel algorithms for

Prof. Jason Eisner: Algorithms and Models for Language Processing Research • Novel algorithms for NLP • Bayesian statistical models of linguistic structure • Machine learning (structured prediction, novel training objectives) • Declarative formalisms for grammars and algorithms Applications Parsing sentence structure • Faster and more accurate algorithms • Unsupervised or cross-lingual learning Machine translation • Model syntax, structure, word order • Combinatorial methods for translation and for training models Morphology / phonology • Word spelling and pronunciation • Variant word forms (conjugation, transliteration, misspelling, …) Information integration • Truth maintenance • Deductive databases • Reasoning from facts in text

Prof. Mounya Elhilali: Reverse Engineering the Neurobiology of Speech and Audio Processing Applications Research

Prof. Mounya Elhilali: Reverse Engineering the Neurobiology of Speech and Audio Processing Applications Research Goals • Information representation and computational strategies employed by the brain • Sound perception in distorted or complex acoustic environments • Speech intelligibility in noise and distortions • Auditory scene analysis and speaker segregation • Speech enhancement • Hearing prostheses • Adaptive audio systems • Robotics and autonomous systems • Object tracking in sensor networks • Communication channels • Microphone Design

Prof. Hynek Hermansky: Robust Acoustic Speech Processing Applications Technology • Proprietary techniques based on

Prof. Hynek Hermansky: Robust Acoustic Speech Processing Applications Technology • Proprietary techniques based on temporal cues in the signal and on artificial neural net postprocessing • Emulations of auditory processing in biology Speech recognition • what has been said? Speaker identification • who is speaking? Speaker verification • is the talker the one claimed to be? Language identification • which language is being used? Speech and audio coding • how to store/transmit the signal efficiently? Enhancement of degraded speech • how to make noise or reverberated speech easier listening to?

Prof. Aren Jansen: Knowledge-based Approaches to Speech Processing Applications Research • • Pursuit of

Prof. Aren Jansen: Knowledge-based Approaches to Speech Processing Applications Research • • Pursuit of more invariant representations of speech Unsupervised/semi-supervised learning of speech units Sparse representations and models Computational models of human speech perception Noise-Robust Speech Recognition • Invariance and efficiency through sparsity Low-Resource Speech Recognition • What can be done with little or no transcribed training data? Spoken Term Detection and Discovery • “Google” for speech documents • Query-by-example vs. text queries Large-Scale Speech Processing • Scaling speech technology to massive problem sizes

Prof. Frederick Jelinek: Statistical Speech Recognition and Machine Translation Research Interests • Statistical grammar

Prof. Frederick Jelinek: Statistical Speech Recognition and Machine Translation Research Interests • Statistical grammar and parsing • Signals and systems • ASR treatment of out-ofvocabulary words and phrases • Machine translation Statistical aspects of Automatic Speech Recognition (ASR) Language Modeling • Predicting next word given the past Reconstruction of ASR output • Create a grammatical sentence preserving the speaker’s intended meaning Rescoring of ASR output alternatives Search algorithms for ASR and Machine Translation

Prof. Damianos Karakos: Statistical Aspects of Speech and Language Applications Technology • Data fusion

Prof. Damianos Karakos: Statistical Aspects of Speech and Language Applications Technology • Data fusion and dimensionality reduction for improved inference in text classification. • Novel language modeling techniques for speech recognition. Speech recognition • Adaptation to the speech topic • Error corrective techniques Machine translation • System combination • Language modeling Document categorization • Automatic clustering into meaningful categories • Detection of topics of interest

Prof. Sanjeev Khudanpur: Statistical Modeling for Information Processing Applications Basic Research • • Stochastic

Prof. Sanjeev Khudanpur: Statistical Modeling for Information Processing Applications Basic Research • • Stochastic Modeling of Signals and Systems Parameter Estimation Model Structure Estimation Information Theory and Statistics Automatic speech recognition • Domain and genre adaptation • Pronunciation variability modeling Machine translation (text & speech) • Output language word ordering • Context dependent translation Multimedia search and retrieval • Searching large speech archives • Content-based image/video search Robotic minimally invasive surgery • Automated skill assessment • Automated surgical training

Prof. Benjamin Van Durme: Computational Semantics and Large-Scale Text Processing Applications Research • Application

Prof. Benjamin Van Durme: Computational Semantics and Large-Scale Text Processing Applications Research • Application of theoretical semantics to problems in language technology • Streaming algorithms for efficient processing of large text collections Knowledge Acquisition • Enable “everyday” reasoning • Formal interpretation of generic sentences (e. g. , dictionary definitions) “Deep” Information Extraction • Infer implicit relations • Semantic language modeling • Recognize higher order modification of factoids Organizing Social Media • Dynamic clustering of authors, documents, feeds

Prof. David Yarowsky: Minimally Supervised Learning for Low-Resource Languages Applications Basic Research • Cross-language

Prof. David Yarowsky: Minimally Supervised Learning for Low-Resource Languages Applications Basic Research • Cross-language information projection • Cross-domain knowledge transfer • Co-training • Active learning and human computation • Creative bootstrapping from multiple knowledge sources Machine Translation • Translation discovery without aligned bilingual text • Exploiting language universals and language family relationships Natural Language Processing • Word sense disambiguation • Inflectional and derivational morphology Information Extraction • Biographic fact extraction • Characterizing communicants • Informal genres

Linguistics and Human Language Processing Prof. Paul Smolensky: Architecture of Universal Grammar Prof. Colin

Linguistics and Human Language Processing Prof. Paul Smolensky: Architecture of Universal Grammar Prof. Colin Wilson: Theoretical, Experimental, & Computational Phonology Prof. Geraldine Legendre: Syntax, Morphology, Acquisition Prof. Kyle Rawlins: Formal & Computational Semantics Prof. Justin Halberda: Word Learning in Children + a new professor … Human Sentence Processing

Lots of Great Ph. D. Students: The Next Big Things!

Lots of Great Ph. D. Students: The Next Big Things!