Language Technology and Data Analysis Laboratory LADAL Schoolbased

Language Technology and Data Analysis Laboratory (LADAL) School-based support infrastructure for digital humanities research at UQ Michael Haugh & Martin Schweinberger CRICOS code 00025 B

Who are you & What do you do Michael Haugh is Professor of Linguistics in the School of Languages and Cultures at the University of Queensland, Australia. His research spans pragmatics (language-inuse), intercultural communication and humour studies. He is a leading proponent of the Australian National Corpus and the recent establishment of the Language Technology and Data Analysis Laboratory (LADAL) and is currently working on a national language data commons initiative. Martin Schweinberger is a postdoctoral Research Fellow in Language Technologies at UQ. After obtaining his Ph. D in English linguistics, Martin worked at several German universities and was part of the Language Technology Group at the Computer Science department of Universität Hamburg. Martin specializes in computational approaches to quantitative analyses of language data. In his current role, Martin is one of the leading proponents of LADAL.

Humanities and Computing Potential Vast potential for the humanities by extending computational methods to humanities research • Big Data (e. g. mega corpora) • Multimodal / multipurpose data sets and digital archiving (e. g. Trove) • Mapping (combining data and geographical space) • Real world applications (applying corpus-based research in classrooms)

Humanities and Computing Challenges • Global • Increasing demand for training and education in computational approaches and digital tools • Training in the “digital” is essential for research in the humanities to keep pace with other fields of scientific endeavour and for Australian humanities researchers to remain internationally competitive • Training in such skills also increasingly demanded by students of the humanities

Humanities and Computing Challenges • Within HASS • Best practices and transparency in research (Access to / sharing of data) • (Over-)reliance on existing (commercial) tools • Unwillingness to give up on accustomed practices • Vastly different needs across disciplines (differences in experience and expectations) • Lack of programs, materials and professional training (both for general and specialized audiences)

Humanities and Computing Solutions Strategic combinations of general purpose and specialised training • at various levels of expertise • designed for different audiences • from general introductions to highly specific methods

Language Technology and Data Analysis Laboratory (LADAL) • HASS e. Research support infrastructure for digital HASS the UQ School of Languages and Cultures • Assists in the use of data analytics and digital research tools • Enhances existing research programs • Offers pathways into new research possibilities • Components • Specialist computing lab for language-based computational and experimental work (the Computational and Experimental Workshop) • Online virtual lab (the LADAL website: https: //slcladal. github. io/index. html)

Language Technology and Data Analysis Laboratory (LADAL) • Enables development of skills in • Digital tools and data management • Computational methods and (basic) programming skills • Data extraction / transformation / processing • Data visualization (including geospatial mapping and interactive web apps) • NLP applications (text analysis) and various statistical procedures (including classification and machine learning)

Language Technology and Data Analysis Laboratory (LADAL) • Services • Specialised training/support (workshops) on digital research methods and technologies • Information and self-guided study materials • Hands-on practical tutorials on topics relating to digital tools, computational methods for data extraction and processing, data visualization, and statistical analyses (learning to “code”) • Face-to-face consultations

Language Technology and Data Analysis Laboratory (LADAL)

What we can do for you Aims of LADAL • Increase quality of research (Best Practice, assure high standards quality and replicabilty) • Allow UQ reseachers to make use of digital methods and big data • Enable researchers to pusue new pathways by using innovative methods and new types of data (social media data, machine learning, etc. ) The LADAL is still in the process of being established and we are looking for input to find out what aspects of data analytics are useful not only for UQ SLC researchers but also with respect to skills in graduates that would render them attractive on the job marked and for Australian companies.

Contact Details Professor Michael Haugh | Head of School of Languages and Cultures michael. haugh@uq. edu. au Dr Martin Schweinberger Postdoctoral Research Fellow In Language Technologies School of Languages and Cultures m. schweinberger@uq. edu. au https: //slcladal. github. io/index. html
- Slides: 12