Trondheim Workshop in Language Description and Documentation
September 6th - 9th 2010
New: Section: Final Session
© Jørn Adde The Trondheim Workshop in Language Description and Documentation
© Dorothee Beermann
addresses aspects of language description and documentation.
A special focus lies on comparative Bantu studies and the description of tone.
Courses cover practical issues relating to speech and text annotation and methodological aspects of conducting semantic fieldwork, techniques for eliciting data and the interaction between methodology and theory development.
Contents
WORKSHOP PROGRAM
The Workshop is supported by NTNU's Globalization Program
September 6th-9th | Monday | Tuesday | Wednesday | Thursday |
9:30-12:00
|
10:00 - 11:00 Reception 11:00-11:10 Opening of the Workshop by the Dean for the Humanities Kathrine Skretting Keynote: Documentary and descriptive linguistics: interdisciplinary perspectives Felix Ameka room DL33 |
9:30 - 12:00
Comparative Bantu Abdulaziz Lodhi room D118 |
9:30-10:30
Semantic Fieldwork Henrik Bergquist room D137 11:00-12:00 Praat Wim van Dommelen room D137 |
9:30-10:30
Comparative Bantu Absulaziz Lodhi room D111
Tone Constance Kutsch Lojenga' room D111 |
13:15 - 15:30
|
Semantic Fieldwork
Henrik Bergquist room DI172 |
Text Annotation Online
Dorothee Beermann room D112 |
Tone Constance Kutsch Lojenga' room D 137 |
Final Session: 14:00 - 15:30
Digitial Resources, Lexica, Grammars - Who needs what? Open Discussion Beermann, Bergquist, Lars Hellan, Kutsch Lojenga, Lodhi, van Dommelen and all participants Totalrommet, Hovedbygg |
17:15-19:00
coffee break 18:00-18:15 |
Case Study
Community Oriented Language Documentation for the Ghana-Togo-Mountain Languages Felix Ameka room DL31 |
Practical Workshop
Text Annotation Online Dorothee Beermann Language Lab |
Practical Workshop
Praat Wim van Dommelen Language Lab |
sponsored
Workshop Dinner |
Trondheim LDD-Workshop: Background Literature
Please follow the above title-link to get to the Background Literature page.
On that page you find several links for TextExcerts.
TextExcerpts can only be opened with a special password which you will receive in your workshop-folder.
REGISTRATION and GETTING AROUND IN TRONDHEIM
Registration time has passed. If you have question about free student places, please contact us by sending mail to the ldd.workshop.
Reaching Trondheim from the airport
After arrival in Trondheim you will find an airport bus called 'flybus', waiting in from of the terminal building. All flights are connected to buses that take you directly to Trondheim where they stop at several central locations.
The prize of the bus ticket is NOK 90,00.
For those booked in the Fru Schøller hotel, please ask the driver to stop at Dronningensgata.
For those booked in the Youth Hostel, please ask the driver to stop at Lademoen kirke ( Lademoen church).
The google map under STUDENT ACCOMMODATION shows you how to get from the Lademoen-kirke busstop to the Youth Hostel.
Meals and Workshop Dinner
Welcome Evening We have reserved a table for a welcome meeting on Sunday, 5th at the pub Den Gode Nabo. The pub has the biggest selection of beer in Trondheim and serves small meals in the Pizza-Hamburger category.
Hope to see you on Sunday for a welcome drink!
Lunch has been arranged for all registered workshop participants.
New Information: Cold lunch will be served at the Dragvoll mensa, Monday through Thursday.
The lunch on Monday and Thursday is sponsored by the workshop for all registered participants; the other days you are on your own.
On Thursday, right after lunch, a bus will take us from Dragvoll to the Final Session of the Workshop which takes place at the Gløshaugen campus.
The workshop dinner takes place at the Trondheim Festning.
Reception: 17:30
Buffet: 18:00
Those interested in vegetarian food please contact us.
STUDENT ACCOMMODATION
...Reviews of the Youth Hostel
The LDD has made a group reservation at the Trondheim Youth Hostel which allows us to offer very reasonably prized single rooms for the workshop period.
For a reservation of a room at the Trondheim Youth Hostel please send a short mail to: ldd.workshop
If you prefer to book a room in one of the many hotels in Trondheim you find more information at Trondheim's Offical Website
The way from the flight busstop: Lademoen kirke to the Youth Hostel (vandrerhjem):
<googlemap version="0.9" lat="63.436618" lon="10.428085" type="map" zoom="15" width="400" height="300" > 63.433913, 10.424083, Trondheim vandrerhjem AS Weidemanns vei 41 Trondheim, Sør-Trøndelag 6#B2758BC5 63.436489, 10.428097 Flightbus: Stop Lademoen kirke 63.436565, 10.431187 63.433364, 10.427497 63.433863, 10.424149 </googlemap>
MORE ABOUT THE LECTURES AND THE LECTURERS
Semantic Fieldwork
Semantic field work includes a number of issues that relate to methodology, data collection, and analysis.
As a separate branch of the linguistic enterprise, language documentation emphasizes the process of data collection and the annotation of data as central concerns that are treated as separate from the analytical aspect of language documentation.
There are some important implications of such a view that may benefit a wider and deeper semantic analysis of lesser described languages.
The lectures of this workshop will focus on the problem of translation in a documentation context and the role of metadata in the annotation of language materials resulting form language documentation.
Some important differences between language description and language documentation will also be discussed as a motivation for the central issues discussed in the lectures.
Text Annotation Online
Text Annotation or Interlinear Glossing (IG) is one of the most common methods used by linguists across fields.
This is not so surprising since IG results from a long tradition of aligning natural language text with one
or several lines of annotation consisting of a short-hand that indicates the text's internal grammatical structure.
IG is the main means to communicate linguistic data, making Interlinear Glosses the main currency of modern linguistics.
Yet, we as linguists do not share a common understanding concerning the function of IG neither in our daily research nor in our publications.
In this course we treat Interlinear Glosses as a language resource (unfortunately not yet a common assumption).
We discuss linguistic annotation standards, depth of annotation and data pertinence.
We will learn how to build small corpora, and how to be more conscientious about metadata.
The creation of reusable data and practices of data and knowledge sharing will be addressed.
We will make use of practical examples from different languages to show how to build pertinent and reusable linguistic resources.
The participants will be introduced to the online database TypeCraft and to methods of language documentation online.
Praat
This course will give a short introduction to the use of the programme Praat in the annotation and instrumental analysis of speech.
The first part will deal with the task of mapping linguistic units (like phrases, syllables, phonemes) onto the physical speech signal.
In order to be able to perform such a task we need to acquire some knowledge about the acoustic properties of the speech signal.
Apart from the technical aspects of speech annotation, the main challenge is the variability of the speech signal.
It will be discussed how to deal with this variability using phonemic or phonetic transcription.
The second part of this course will focus on the acoustic analysis of the speech signal. We will look into:
- vowel and consonant duration
- vowel quality
- intonation and tone
Please note that instrumental analysis is a demanding task. Therefore, it may be necessary to reduce the number of issues.
For your convenience, the second part of the programme will be flexible and may be changed on the basis of your current research interests.
Community Oriented Language Documentation for the Ghana-Togo-Mountain Groups
In this tutorial we will pay attention to the question “who is language documentation for?”, and discuss the efforts we are making in
collaboration with some Ghana-Togo-Mountain communities to generate documentation products. In the first part we will introduce the
sociolinguistic context of the GTM languages-their multilingual character and the limited domains of use of the community languages.
In the second part we discuss the challenges and unintended consequences of designing orthographies, producing literacy primers,
dictionaries and grammars. We will also consider issues of language use in education in these communities.
Tone
The introduction to this short course on Tone starts with a brief overview on Tone in the world’s languages and Tone in African languages.
The question what a tone language really is needs some discussion.
Some topics related to the difference between surface and underlying structure will be treated. Following that, elicitation, transcription, and methodology will be discussed.
Some main thoughts on typology, and, finally, the lexical and grammatical function of Tone are treated and exemplified with examples from languages from all over Africa.
Bantu
This is a short introduction to Bantu languages, with a brief history of research in Comparative Bantu. The focus is on the Standard Bantu Noun
Classification System, Verbal Extensions and Tense-Aspect Markers/TAM, with special reference to Swahili and a dozen other major Bantu languages.
During sessions 2 and 3 we shall also go through a short comparative lists of selected Proto-Bantu items and their realizations in some languages in
east, central and southern Africa.
The three one hour sessions of the course will deal with
1) Introduction to Bantu languages with a history of research
2) Noun Classification in Bantu languages
3) Verbal Extensions and Tense-Aspect Markers/TAM
Final Session: Digitial Resources, Lexica, Grammars - Who needs what?
A recent conference on Language Documentation and Conservation named “Supporting Small Languages Together"1 as its central theme. Emphasizing the need for communities, linguists, and other academics to work in close collaboration, research and language revitalization efforts are mentioned as central issues. Does that mean that the Language Description and Language Documentation community is mainly interested in small and dying languages? Are languages with several million speakers, like Luganda, an Ugandian language, or Telugu and other South-Asian languages uninteresting in the context of Language Description and Documentation? Many languages in Africa are neither dying nor small but they are categorized as ‘less-resourced languages’, which mainly means that they lack digital resources like developed electronic corpora and digital lexicons, for example. But why is it so important to possess such digital resources? Do these electronic resources make traditional lexicons and grammars obsolete?
Modern Language Description and Language Documentation means multi-media documentation of language, audio- and video files databases, archives …. . LDD is high-tech and the magic phrase is: Rich, accessible records which can benefit both language research and speech communities. But in which way will multi-media resources which can only be viewed using a computer with fancy software be of interest to a speech community with uncertain supply of electricity and only limited access to data-technology? Is recorded and digitalized language the latest type of Relict to be shipped to the 'museums' of the First World?
Which role can a standard linguist play? Wouldn't linguists be more useful if they would simply continue to put their skills into producing grammar books and lexicons that teach native speakers of indigenous languages more about their own language?
Although undoubtedly a very important task, to only concentrate on paper material as a medium might be short-term thinking. In the age of Globalization no country can close itself off against the digital revolution. Digital information is the second most important currency (after the dollar). To be able to secure a place for one’s own language on the global information, is crucial for the language's survival on the long run. But how do we build digital resources? How do we secure and preserve digital lexicons and grammars, and most importantly, how do we as a language community secure local control over these resources?
Answers to these and related questions will be central to how the Field of Language Description and Documentation will develop in the years to come. The discussion in the final session of the LDD workshop will give us a head-start for finding good answers to important questions.
1 http://nflrc.hawaii.edu/ICLDC/2011/index.html
WORKSHOP LOCATION
The LDD workshop is located at Dragvoll the campus of the Humanities (Det humanistiske fakultet (HF)) and the Faculty of Samfunnsvitenskap and Teknologiledelse (SVT) of the Norwegian University of Science and Technology in Trondheim.
At the Dragvoll campus you find next to the University library also a book store, a kiosk with a post service, a Mensa, and a coffee shop.
Find workshop rooms
The workshop will be held in different rooms throughout the week. Rooms are announced in the workshop program above. The following pdf files show the location of the workshop rooms on the Dragvoll campus maps. The only exception is the room for the final session which is on the main campus close to down-town. Transportation will be organized.
Notice that not all rooms we use on the Dragvoll campus are in the same building. D111 refers to the room 111 at Dragvoll, and DL33 refers to the room 33 at Låven which is is the building up the hill on the back-side of the Dragvoll main building.
Room plan: media:DragvollRooms.pdf
Campus plan: media:DragvollCampus.pdf