Typecraft v2.5
Jump to: navigation, search

Workshop on Text and Speech Annotation

Revision as of 23:15, 15 August 2010 by Dorothee Beermann (Talk | contribs) (Aim of the Workshop)

CIILcrest.gif
Central Institute of Indian Languages


Linguistic Description and the Creation of Digital Language Resources


Error creating thumbnail: Unable to save thumbnail to destination

CIIL 2.jpeg CIIL 3.jpeg

Aim of the Workshop

The workshop will address the creation and usage of digital speech and text resources. A particular focus lies on the role that digital resources play in linguistic research. The course will give a general introduction to the generation of data collections, small language corpora and on-line linguistic knowledge bases.

The target group of the workshop are linguists interested in strengthening the empirical underpinning of their typological and/or theoretical work. Graduate students as well as faculty members are welcome.

Our focus will be on collaborative speech and text annotation making use of the new facilities that modern web-technology and linguistic software offers to linguists.

The workshop will feature 4 days of course work covering practical issues relating to linguistic annotation of speech and text as well as web-editing as a means to create public linguistic knowledge bases through collaborative on-line editing.

We will offer hands-on introductory courses to TypeCraft, a multi-lingual on-line database for text annotation developed at the University for Science and Technology, Trondheim, Norway by Dorothee Beermann and Pavel Mihaylov, and Praat, a freely available signal analysis software developed by Paul Boersma and David Weenink of the University of Amsterdam, The Netherlands.

Next to an introduction to text and speech annotation we are planning several guest lectures on selected linguistic topics of particular relevance to language annotation and linguistic analysis.