Corpora and Teaching Language for Special Purposes Terms

  • Slides: 28
Download presentation
Corpora and Teaching Language for Special Purposes

Corpora and Teaching Language for Special Purposes

Terms • język specjalistyczny • sublanguage, scientific (technical) language, special subject language, special language,

Terms • język specjalistyczny • sublanguage, scientific (technical) language, special subject language, special language, language for special purposes (LSP) • Fachsprache • lange de spécialité

Definition Język specjalistyczny to “szczególna postać języka ogólnonarodowego, przystosowana do możliwie precyzyjnego opisu określonej

Definition Język specjalistyczny to “szczególna postać języka ogólnonarodowego, przystosowana do możliwie precyzyjnego opisu określonej gałęzi wiedzy lub techniki. Różni się od języka ponaddialektalnego przede wszystkim słownictwem fachowym, niejednokrotnie zawierającym wiele internacjonalizmów, oraz składnią, jak również częstotliwością użycia określonych form gramatycznych (np. passivum w niemieckim języku technicznym) (Szulc 1984: 104 w Roskowski 2005: 174)

Characteristic features • Określony zakres tematyczny ograniczony do danej dziedziny • Ograniczenia struktury leksykalnej,

Characteristic features • Określony zakres tematyczny ograniczony do danej dziedziny • Ograniczenia struktury leksykalnej, składniowej i semantycznej • Niestandardowe zasady gramatyczne • Wysoka częstotliwość występowania niektórych konstrukcji • Specyficzne cechy struktury tekstowej • Wykorzystanie specjalnych symboli (Lehrberger 1886, 1982: 102 w Roszkowski 2005: 176)

 • Different levels of specialisation – English for academic purposes vs. English for

• Different levels of specialisation – English for academic purposes vs. English for chemistry – English for medical purposes vs. English for urology

Teaching LSP • Lack of textbooks or other teaching materials • Teacher’s lack of

Teaching LSP • Lack of textbooks or other teaching materials • Teacher’s lack of expertise

How can corpora help? • Syllabus design • Production of teaching materials -authentic examples

How can corpora help? • Syllabus design • Production of teaching materials -authentic examples – Text types – Vocabulary and grammar in context

Types of corpora helpful in teaching LSP • subcorpora (sections of general corpora) e.

Types of corpora helpful in teaching LSP • subcorpora (sections of general corpora) e. g. section J of Brown or LOB • special corpora (Air Trafic Control Corpus) • monolingual corpora • bilingual corpora – comparable – parallel

Example 1 Academic language

Example 1 Academic language

Verb frequencies

Verb frequencies

Academic Word List • developed at the School of Linguistics and Applied Language Studies

Academic Word List • developed at the School of Linguistics and Applied Language Studies at Victoria University of Wellington, New Zealand • purpose: to be used by teachers and students working alone as part of a preparation for tertiary level study • Coxhead, Averil (2000) A New Academic Word List. TESOL Quarterly, 34(2): 213 -238 • http: //www. victoria. ac. nz/lals/resources/acade micwordlist/

Academic corpus • a written corpus of academic English • approximately 3, 500, 000

Academic corpus • a written corpus of academic English • approximately 3, 500, 000 running words • divided into four faculty sections: Arts, Commerce, Law and Science • Each of these faculty sections contained approximately 875, 000 running words. • Each faculty section was divided into seven subject areas of approximately 125, 000 running words.

Selection criteria • The AWL contains 570 word families • Range. The AWL families

Selection criteria • The AWL contains 570 word families • Range. The AWL families had to occur in the Arts, Commerce, Law and Science faculty sections of the Academic Corpus (see below for details on the Academic Corpus). The word families also had to occur in over half of the 28 subject areas of the Academic Corpus. Just over 94% of the words in the AWL occur in 20 or more subject areas. This principle ensures that the words in the AWL are useful for all learners, no matter what their area of study or what combination of subjects they take at tertiary level.

Selection criteria • Frequency. The AWL families had to occur over 100 times in

Selection criteria • Frequency. The AWL families had to occur over 100 times in the 3, 500, 000 word Academic Corpus in order to be considered for inclusion in the list. This principle ensures that the words will be met a reasonable number of times in academic texts. • Uniformity of frequency. The AWL families had to occur a minimum of 10 times in each faculty of the Academic Corpus to be considered for inclusion in the list. This principle ensures that the vocabulary is useful for all learners.

Words excluded from the AWL • Words occurring in the first 2, 000 words

Words excluded from the AWL • Words occurring in the first 2, 000 words of English. The AWL assumes knowledge of West's General Service List (GSL) (1953) as the basic vocabulary any learner should have before starting to learn academic vocabulary. • Narrow range words. Words which occurred in fewer than 4 faculty sections of the Academic Corpus or which occurred in fewer than 15 of the 28 subject areas of the Academic Corpus were excluded because they had narrow range. Technical or specialist words often have narrow range and were excluded on this basis

Words excluded from the AWL • Proper nouns. The names of places, people, countries,

Words excluded from the AWL • Proper nouns. The names of places, people, countries, for example, New Zealand, Jim Bolger and Wellington were excluded from the list. • Latin forms. Some of the most common Latin forms in the Academic Corpus were et al, etc, ie, and ibid.

Example 2 Język umów

Example 2 Język umów

(Roszkowski 2005: 184)

(Roszkowski 2005: 184)

Patterns of word frequencies Frequency

Patterns of word frequencies Frequency

Example 3 English for Biology

Example 3 English for Biology

Corpus of English for Biology • source: Flowerdew 2001 • purpose: course design –

Corpus of English for Biology • source: Flowerdew 2001 • purpose: course design – the selection and grading of items for the syllabus – the authentic contextualization of theses items in teaching materials • teaching situation: – science students at the English-medium Sultan Quaboos University, Oman; foundation course – science course and English course • corpus: transcripts of 25 hours of lectures in biology + supporting reading material • size 104, 483 tokens

Example (Flowerdew 2001: 79)

Example (Flowerdew 2001: 79)

Example 4 My project English for Urology

Example 4 My project English for Urology

Design criteria

Design criteria

Analysis • • wordlist keyword list of clusters collocations

Analysis • • wordlist keyword list of clusters collocations

Terminology extraction • wordlists • clusters • keywords http: //www. ling. pl/

Terminology extraction • wordlists • clusters • keywords http: //www. ling. pl/

Examples of comparable corpora • Multext – http: //www. lpl. univ-aix. fr/projects/multext/ • Multext-East

Examples of comparable corpora • Multext – http: //www. lpl. univ-aix. fr/projects/multext/ • Multext-East – http: //nl. ijs. si/ME/