Diachronic Internet Corpus of English DICE Project Presentation 13. 12. 2006
Contents Project group & Client What is DICE? Project Plan & Requirements Specification Things done so far Things to be done next Problems & Solutions Prototype presentation Questions
Project Group Project Managers: - Thanyaporn Lerlerdthaiyanupap - Björn Bielesch Members: - Aleksi Vuorenmaa - Mika Salmi - Salla Kuisma - Tom Eklund - Riki Kawakami (Usability team)
Client Mark Kaunisto - Researcher - School of Modern Languages and Translation (University of Tampere) - Collected the corpus data when doing Ph. D thesis
What is DICE? Online web-based corpus Offering freely available Internet texts for linguistic research Non-fiction texts from the 16 th century to the present day DICE will be released under the GPL
Project Plan Incremental development model 4 iterations Most important features implemented first Additional features in the following increments Feedback from client and usability team
Things done so far In first increment phase – – – Database structure created Prototype Basic search Advanced search Result section
Things to be done next The second increment Update database structure Implement more functions – – – Presenting detailed information Sorting Administration tools System testing and usability evaluation
Problems & Solutions Priorities of implemented functions > Brainstorming > Review with the client and lecturer Generating dynamic searches on the internet > To be discussed Text Copyrights > Contacting authors concerning the copyrights > Consulting experts