Developments in corpusbased translation studies A bibliometric approach
Developments in corpusbased translation studies A bibliometric approach Gernot Hebenstreit University of Graz
Starting point This paper aims at giving an overview of the development of corpus based methodologies in research on translation and interpreting. Analyzing bibliographic data on relevant publications this study tries to give answers to the following questions: Which branches of translation/interpreting studies have most interest in “introducing” or “incorporating” corpora? Of what kind are the research questions that are to be answered based on information provided by corpora? Is there a relation between the adherence to a school of thought and the interest in corpora? What’s the relation of quantitative vs. qualitative research? Which methodological questions are being raised? Are there preferred design patterns for building corpora? What role does tagging play? Of what kind are the desiderata voiced in terms of corpus technology? Developments in coprus-based translation studies
Bibliometrics? • Measuring academic activity – Publications as indicator of scientific activity – Only published knowledge is scientific knowledge – Perception of achievements is documented in citations – Derek J. de Solla Price, Eugene Garfield (Institute of Scientific Information) • Mainly used for assessing academic activity Developments in coprus-based translation studies
Perspectives of bibliometrical analysis • On Producers – Authors, teams, institutions, fields, nations • Scholars as writers choosing occasions, topics, methods, arguments, style • As linkers choosing documents to cite or to refer to • As submitters choosing journals, ways of publication • Collaborators choosing institutions, projects, co-authors Developments in coprus-based translation studies
Perspectives of bibliometrical analysis • Artefacts – Articles, books, conference papers, journals, conferences • Concepts – Words in the titles or texts, terminology, purpose or motivation of a citation Developments in coprus-based translation studies
Methods – Purely scientometric • Publication counting • Citation counting – Imported from other fields of research • • • Word anaylsis Text analysis Network analysis Questionaires Interviews – Mix of quantitative and qualitative methods Developments in coprus-based translation studies
Considerations concerning the counting of publications • Limited to data to be gained from bibliographic databases – – – – Publication rates in a period of time Number of authors Number of publications Number of document types Languages of publications Publishing houses Distribution of articles in journals and collecive volumes – Author's origin in terms of country or affiliation Developments in coprus-based translation studies
What is a publication? • • • Publications available for sale Patents “gray literature” Audio-visual media Articles in online journals Internet documents Reviews Letters to editors Thesis, dissertations Developments in coprus-based translation studies
Counting methods • Simple or weighted? • All types of publications to be treated equally? – – – Articles Monographs Edited volumes Proceedings … • How to deal with co-authorship? Developments in coprus-based translation studies
Word counts in bibliometrical analysis • Words in – Titles – Keywords – abstracts • Terms/categories reflect a very specific perspective of a given body of work – Indexing = external view • Problem of consistency • Indexer effect – Title, abstracts = internal view Developments in coprus-based translation studies
Problem of databases • Coverage – Relevant publications • Selection criteria of database provider – In time Developments in coprus-based translation studies
Databases used for this study • BITRA • TSA (Translations Studies Abstracts, St. Jerome) • TSB (Translation Studies Bibliography, Benjamins) • Abstracts, Keywords • Search query: KW= “corpus” or “corpora” Developments in coprus-based translation studies
Number of publications BITRA TSB TSA 575 691 512 100 90 Obviously publications of 2009 and 2010 not yet covered by databases 80 70 BITRA 60 TSB 50 TSA 40 30 20 10 0 91 19 92 19 93 19 94 19 95 996 997 998 999 000 001 002 003 004 005 006 007 008 009 010 2 19 1 1 1 2 2 2 2 2 1 Developments in coprus-based translation studies
Comparison of keywords in abstracts TSB 1 CORPORA 2 TSA TSB 14 ANALYSIS EXPLICITATION TRANSLATOR 15 LSP MULTILINGUAL 3 LINGUISTICS LANGUAGES 16 LITERARY TARGET 4 RESEARCH LEXICAL 17 DIDACTICS SOURCE 5 DISCOURSE LINGUISTIC 18 LANGUAGE TOOLS 6 CONTRASTIVE TRANSLATIONAL 19 TRANSLATOR TERMINOLOGICAL 7 INTERPRETINGFINNISH 20 MACHINE PORTUGUESE 8 APPROACH 21 PARALLELISM ALIGNMENT 9 TERMINOLOGYLINGUISTICS 22 GRAMMAR METHODOLOGY 10 THEORY TRANSLATORS TSA CONTRASTIVE BILINGUAL 23 LEXICOGRAPHY DICTIONARIES 11 EQUIVALENCE ARTICLE 24 AUTOMATIC SEMANTIC 12 TOOL 25 LINGUISTIC EXPLICITATION COMPARABLE 13 COMPARATIVE EQUIVALENTS Developments in coprus-based translation studies
Keywords diachronically (TSB) KEYWORDS_TSB_2002_ 2003 Key word TRANSLATION CORPUS CORPORA LINGUISTICS RESEARCH TOOL MACHINE CONTRASTIVE AUTOMATIC WEB INTERPRETING WEBSITE TERMINOLOGY APPROACH WWW DISCOURSE PARALLELISM AIDED LSP EQUIVALENCE DIDACTICS INTERNET THEORY TRANSLATOR EMPIRICAL KEYWORDS_TSB_2004_ 2005 Key word CORPUS CORPORA TRANSLATION RESEARCH LINGUISTICS CONTRASTIVE DISCOURSE COMPARATIVE INTERPRETING TERMINOLOGY ANALYSIS DIDACTICS PARALLEL AUTOMATIC MACHINE LEXICON BIBLE THEORY ICT APPROACH METHODOLOGY COMPUTER WEB TOOL STUDIES KEYWORDS_TSB_2006_ 2008 Key word CORPUS CORPORA TRANSLATION RESEARCH CONTRASTIVE LINGUISTICS DISCOURSE TOOL COMPARATIVE MACHINE AUTOMATIC AIDED INTERPRETING TRANSLATOR DIDACTICS LITERARY WEB TRAINING WEBSITE MEDICAL WWW LSP EMPIRICAL DUBBING PARALLEL Developments in coprus-based translation studies KEYWORDS_TSB_2009_ 2010 Key word CORPUS CORPORA TRANSLATION PARALLEL TRANSLATOR DISCOURSE DIDACTICS LINGUISTICS TRAINING EXPLICITATION ANALYSIS LITERARY TEACHING SCIENTIFIC BITEXT TERMINOLOGY INTERPRETING STYLISTICS RESEARCH TRANSLATOR'S TOOL LANGUAGE EVALUATION EXPLANATION MODALITY
Keywords diachronically TSA 1991_2001 Key word CORPORA CORPUS TRANSLATION PARALLEL ENGLISH TRANSLATORS BASED LANGUAGE MULTILINGUAL BILINGUAL TERMINOLOGY TEXTS EQUIVALENTS ALIGNMENT TEXT COMPARABLE TRANSLATOR TRANSLATED TRANSLATIONS TRANSLATIONAL CONTRASTIVE USING EXTRACTION AUTOMATIC CREATIVITY 2002_2003 Key word TRANSLATION CORPORA CORPUS TRANSLATIONAL PARALLEL ENGLISH LINGUISTICS EXTRACTION TRANSLATOR BILINGUAL STUDIES CONTRASTIVE TEXTS LINGUISTIC TRANSLATORS LANGUAGE TERMINOLOGY USING BASED TERMINOLOGICAL TRANSLATIONS TRANSLATED ANALYSIS ALIGNMENT WEB 2004_2005 2006_2008 Key word CORPUS TRANSLATION CORPORA TRANSLATED ENGLISH MULTILINGUAL PARALLEL STUDY FINNISH BASED TERMINOLOGY STUDY NORWEGIAN SPANISH PARALLEL UNIVERSALS EXPLICITATION TRANSLATORS TEXTS VERBS CONTRASTIVE LINGUISTICS TRANSLATIONS CONTRASTIVE UNIVERSALS TRANSLATIONS SWEDISH TEXTS INVESTIGATING MULTILINGUAL EXTRACTING TEXT LANGUAGE SPECIALIZED ITALIAN TRANSLATOR ANALYSIS LEXICAL TECHNICAL LANGUAGE TRANSLATE REPRESENTATIVENESS LANGUAGES ANALYSIS Developments in coprus-based translation studies 2009_2010 Key word CORPUS TRANSLATION CORPORA PARALLEL ENGLISH BASED CONTRASTIVE STUDY CHINESE COMPARABLE TRANSLATOR TRANSLATING TRANSLATIONS TRANSLATORS SPANISH TRANSLATIONAL WEB PHRASEOLOGY TRANSLATED MODAL PARTICLES BILINGUAL SPECIALIZED
Indexing compared TSB - BITRA 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 BITRA GENRE THEORY ENGLISH LINGUISTICS TEACHING TECHNICAL MACHINE_TRANSLATION GRAMMAR DOCUMENTATION PROBLEM DICTIONARIES LITERATURE SPANISH LEGAL GERMAN INTERPRETING FRENCH QUALITY TERMINOLOGY PHRASEOLOGY NORUEGO PORTUGUESE INTERFERENCE MEDICINE NORMALIZATION 137 135 126 98 86 79 72 70 58 54 39 31 27 25 21 21 17 16 16 15 14 13 12 12 11 TSB CORPUS_LINGUISTICS TRANSLATION COMPARATIVE_RESEARCH_CONTRASTIVE_RESEARCH TERMINOLOGY MACHINE_TRANSLATION ANALYSIS RESEARCH TOOL_TRANSLATION_TOOL TRANSLATION_STUDIES EMPIRICAL_RESEARCH_EXPERIMENTAL_RESEARCH_SURVEY GRAMMAR DIDACTICS TEACHING LANGUAGE_FOR_SPECIAL_PURPOSES LSP SEMANTICS STRATEGY_PROCEDURE THEORY_TRANSLATION_THEORY_INTERPRETING_THEORY PARALLELISM LEXICON ASSESSMENT_EVALUATION METHODOLOGY EQUIVALENCE_NON-EQUIVALENCE ALIGNMENT CONTRASTIVE_LINGUISTICS Developments in coprus-based translation studies 155 129 81 78 77 73 64 57 44 41 40 39 39 31 31 30 30 30 29 28 27 27 26 25 25
Indexing diachronic TSB 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 1991 -2001 CORPUS_CORPORA CORPUS_LINGUISTICS TRANSLATION TERMINOLOGY RESEARCH ANALYSIS COMPARATIVE_RESEARCH_CONTRASTIVE_RESEARCH GRAMMAR MACHINE_TRANSLATION_AUTOMATIC_TRANSLATION TOOL_TRANSLATION_TOOL TRANSLATION_STUDIES LINGUISTIC_APPROACH_LINGUISTICS PARALLELISM THEORY_TRANSLATION_THEORY_INTERPRETING_THEORY EMPIRICAL_RESEARCH_EXPERIMENTAL_RESEARCH_SURVEY EQUIVALENCE_NON-EQUIVALENCE LANGUAGE_FOR_SPECIAL_PURPOSES LSP SEMANTICS STRATEGY_PROCEDURE DIDACTICS TEACHING TERM_EXTRACTION ALIGNMENT LITERARY_APPROACH_LITERARY_DISCOURSE_LITERATURE 2004 -2005 CORPUS_CORPORA CORPUS_LINGUISTICS TRANSLATION COMPARATIVE_RESEARCH_CONTRASTIVE_RESEARCH ANALYSIS MACHINE_TRANSLATION_AUTOMATIC_TRANSLATION TERMINOLOGY LEXICON TRANSLATION_STUDIES METHODOLOGY DIDACTICS RESEARCH TEACHING BIBLE_TRANSLATION_BIBLE_RELIGIOUS_DISCOURSE_ COMPUTER_SCIENCE_IT_ICT SEMANTICS STRATEGY_PROCEDURE UNIVERSAL EMPIRICAL_RESEARCH_EXPERIMENTAL_RESEARCH_SURVEY PARALLEL_CORPUS_PARALLEL_CORPORA PARALLEL_TEXT TERM_EXTRACTION THEORY_TRANSLATION_THEORY_INTERPRETING_THEORY TOOL_TRANSLATION_TOOL CASE_STUDY Developments in coprus-based translation studies
Work in progress • Refining query • Interpretation of data • Among the identified problems: – inconsistency of terminology used by scholars (e. g. “parallel”, “comparable” corpus) – Separate contrastive studies publications from translation studies publications Developments in coprus-based translation studies
- Slides: 19