Typecraft v2.5
Jump to: navigation, search

Difference between revisions of "Classroom:Simon's cat - multi-lingual"

(Simon's Cat in Persian)
(Simon's cat in Akan)
 
(50 intermediate revisions by 11 users not shown)
Line 1: Line 1:
<span style="color:green">'''[http://www.ntnu.no/studieinformasjon/timeplan/v12/?emnekode=LING2208-1&valg=emnekode&bokst LING 2208'''] a course at the [http://www.ntnu.edu/studies/courses/LING2208 Norwegian University of Science and Technology] </span>
+
The material presented here is based on course work at [http://www.ntnu.edu/studies/courses/LING2208 Norwegian University of Science and Technology]  
  
 
==Digital Methods for Speech and Text Processing==
 
==Digital Methods for Speech and Text Processing==
Line 76: Line 76:
  
 
<br/>
 
<br/>
 +
 +
* [http://typecraft.org/TCEditor/2143/# Go to the database for the annotated text]
  
 
*'''Source information''':
 
*'''Source information''':
 
*Speaker: Yutong Pu
 
*Speaker: Yutong Pu
 
*Language: Mandarin Chinese
 
*Language: Mandarin Chinese
*Date: 28.01.2012
+
*Date: 28.01.2013
 
* Size of the records: 2.04 MB
 
* Size of the records: 2.04 MB
 
* Duration of the records: 00:02:14
 
* Duration of the records: 00:02:14
Line 87: Line 89:
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
  
'''Transcript'''  [[Corpus:Xī Méng de Māo (3)]]
+
'''Transcript'''  [[Corpus:Xī Méng de Māo 3 ]]
 
+
  
 +
<br/><br/><br/>
  
 
===Simon's Cat in Ewe===
 
===Simon's Cat in Ewe===
Line 193: Line 195:
 
'''Transcript''' [[Corpus: Enjangu ya Simooni]]
 
'''Transcript''' [[Corpus: Enjangu ya Simooni]]
  
===Simon's cat in Norwegian (Bokmål)===
+
===Simon's cat in Norwegian===
  
 
  <flashmp3>AO L1.mp3</flashmp3>
 
  <flashmp3>AO L1.mp3</flashmp3>
Line 209: Line 211:
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
'''Transcript'''  [[Corpus:Simons katt]]
 
'''Transcript'''  [[Corpus:Simons katt]]
 +
<br>
 +
 +
<br>
 +
 +
<flashmp3>Mtm_simon's_cat.mp3‎ </flashmp3>
 +
<br>
 +
 +
*'''Source information''':
 +
*Speaker: Malin T.M.
 +
*Language: Norwegian
 +
*Date: 07.01.2013
 +
* Size of the records: 1.84 MB
 +
* Duration of the records: 00:02:17
 +
* Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
 +
*Purpose: Class work LING 2208, exercise L1/P1
 +
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 +
'''Transcript'''  [[Corpus:Katta til Simon]]
 +
<br>
 +
<br>
 +
 +
<flashmp3>SFHSC1.mp3‎‎</flashmp3>
 +
 +
<br>
 +
*'''Source information''':
 +
*Speaker: Sindre Flø Hunnes
 +
*Language: Norwegian
 +
*Date: 07.01.2013
 +
* Size of the records: 2.12 MB
 +
* Duration of the records: 00:02:18
 +
* Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
 +
*Purpose: Class work LING 2208, exercise L1/P1
 +
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 +
'''Transcript'''  [[Corpus:Simons katt, Volding dialect]]
 +
<br>
 +
 +
<br>
 +
<flashmp3>SFHSC1.mp3</flashmp3>
 +
<br>
 +
*'''Source information''':
 +
*Speaker: Henning Fleischer Rognlien
 +
*Language: Norwegian
 +
*Date: 10.01.2013
 +
* Size of the records: 1.19 MB
 +
* Duration of the records: 00:02:36
 +
* Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
 +
*Purpose: Class work LING 2208, exercise L1/P1
 +
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 +
'''Transcript'''  [[Corpus:Saimens kætt]]
 +
<br>
 +
 +
<br>
 +
<flashmp3>Simon's_Cat_Nowegian_TRH.mp3</flashmp3>
 +
*[http://typecraft.org/TCEditor/2146/ Go to the database for the annotated text]
 +
*'''Source information''':
 +
*Speaker: Thomas René Hansen
 +
*Language: Norwegian
 +
*Date: 7.01.2013
 +
* Size of the records: 2.16 MB
 +
* Duration of the records: 00:02:21
 +
* Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
 +
*Purpose: Class work LING 2208, exercise L1/P1
 +
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 +
'''Transcript'''  [[Corpus:Katten til Simon (Vega-dialekt)|Katten til Simon (Vega-dialekt)]]
  
 
===Simon's cat in Akan===
 
===Simon's cat in Akan===
Line 224: Line 289:
 
*Purpose: Class work LING 2013, exercise L1/P1
 
*Purpose: Class work LING 2013, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
'''Transcript'''  [[Corpus:Simon ɔkra]]
 
  
 
===Simon's cat in Slovenian===
 
===Simon's cat in Slovenian===
Line 240: Line 304:
 
*Purpose: Class work LING 2208, exercise L1/P1
 
*Purpose: Class work LING 2208, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
'''Transcript'''  [[Corpus: Simonova mačka]]
+
'''Transcript'''  [[Corpus: Simonova mačka (2)]]
  
 
===Simon's cat in Nzema===
 
===Simon's cat in Nzema===
Line 274: Line 338:
  
  
<span style="color:green">'''[http://www.ntnu.no/studieinformasjon/timeplan/v12/?emnekode=LING2208-1&valg=emnekode&bokst LING 2208'''] a course at the [http://www.ntnu.edu/studies/courses/LING2208 Norwegian University of Science and Technology] </span>
 
  
==Digital Methods for Speech and Text Processing==
+
===Simon's Cat in Persian===
  
 +
<flashmp3>AM_L1.mp3</flashmp3>
  
 +
*[http://typecraft.org/TCEditor/1839/#tab1 Go to the database for the annotated text]
  
We are a group of graduate students at the linguistic department at the Norwegian University of Science and Technology that work on text and speech annotation.
 
 
[[File:SimonsCat.png|thumb|300px|right|[http://www.simonscat.com/faqs.html source]]]
 
 
We have used work by the English animator Simon Tofield to create parallel recordings in [ ] languages? We have then annotated these recordings using Praat
 
and TypeCraft.
 
 
You can learn more about Simon and his cat by going to [http://www.simonscat.com/faqs.html the official Simon's cat Website].
 
 
Simon's cat videos can be watched for example on [http://www.youtube.com/watch?v=I1qHVVbYG8Y YouTube].
 
 
===Simon's cat in Brazilian Portuguese===
 
 
<flashmp3>FR L1.mp3</flashmp3>
 
 
*[http://typecraft.org/TCEditor/1364/#tab1 Go to the database for the annotated text]
 
  
 
*'''Source information''':
 
*'''Source information''':
*Speaker: Franciane Rocha
+
*Speaker: Atena Mahmoodan
*Language: Brazilian Portuguese
+
*Language: Persian
*Date: 11.01.2011
+
*Date: 12.01.2012
* Size of the records: 5.54 MB
+
* Size of the records: 1.14 MB
* Duration of the records: 00:04:43
+
* Duration of the records: 00:02:20
* Place: Phonetic laboratory ISK Dragvoll (Trondheim - Norway)
+
* Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
*Purpose: Class work LING 2008, exercise L1/P1
+
*Purpose: Class work LING 2208, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
'''Transcript'''  [[Corpus:O_gato_de_Simon]]
+
'''Transcript'''  [[Corpus:Simon Gorbe]]
 
+
===Simon's cat in Mandarin Chinese===
+
 
+
<flashmp3>MZ L1.mp3</flashmp3>
+
 
+
* [http://typecraft.org/TCEditor/1361/# Go to the database for the annotated text]
+
 
+
*'''Source information''':
+
*Speaker: Miaomiao Zhang
+
*Language: Mandarin Chinese
+
*Date: 13.01.2011
+
* Size of the records: 1.63 MB
+
* Duration of the records: 00:01:46
+
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
+
*Purpose: Class work LING 2008, exercise L1/P1
+
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
+
 
+
'''Transcript'''  [[Corpus:Xīméng de māo]]
+
 
+
<br/><br/><br/>
+
 
+
<flashmp3>QW L1.mp3</flashmp3>
+
  
 
<br/>
 
<br/>
  
* [http://typecraft.org/TCEditor/1837/# Go to the database for the annotated text]
+
<flashmp3>MM_L1.mp3</flashmp3>
 
+
*'''Source information''':
+
*Speaker: Qingqing Wang
+
*Language: Mandarin Chinese
+
*Date: 25.01.2012
+
* Size of the records: 2.07 MB
+
* Duration of the records: 00:02:15
+
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
+
*Purpose: Class work LING 2208, exercise L1/P1
+
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
+
 
+
'''Transcript'''  [[Corpus:Xī Méng de Māo (2)]]
+
 
+
<br/><br/><br/>
+
 
+
<flashmp3>YU_L1.mp3</flashmp3>
+
  
 
<br/>
 
<br/>
  
*'''Source information''':
 
*Speaker: Yutong Pu
 
*Language: Mandarin Chinese
 
*Date: 28.01.2012
 
* Size of the records: 2.04 MB
 
* Duration of the records: 00:02:14
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
*Purpose: Class work LING 2208, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
 
'''Transcript'''  [[Corpus:Xī Méng de Māo (3)]]
 
 
 
 
===Simon's Cat in Ewe===
 
 
<flashmp3>MM_L1.mp3</flashmp3>
 
 
*[http://typecraft.org/TCEditor/1378/#tab1 Go to the database for the annotated text]
 
  
 
*'''Source information''':
 
*'''Source information''':
*Speaker: Mercy Motte
+
*Speaker: Mahboubeh Maleki
*Language: Ewe
+
*Language: Persian
*Date: 11.01.2011
+
*Date: 28.01.2013
* Size of the records: 2.9 MB
+
* Size of the records: 2.03 MB
* Duration of the records: 00:03:07
+
* Duration of the records: 00:02:13
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
+
* Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
*Purpose: Class work LING 2008, exercise L1/P1
+
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
+
'''Transcript'''  [[Corpus:Simon ƒe dadi le ʋegbe me]]
+
 
+
 
+
<br>
+
 
+
<br>
+
 
+
<flashmp3>TS_L1.mp3‎ </flashmp3>
+
 
+
*[http://typecraft.org/TCEditor/1865/#tab1 Go to the database for the annotated text]
+
 
+
*'''Source information''':
+
*Speaker: Theresa M. Sedzro
+
*Language: Ewe
+
*Date: 12.01.2012
+
* Size of the records: 2.12 MB
+
* Duration of the records: 00:02:19
+
* Place: Phonetic laboratory ISK Dragvoll (Trondheim - Norway)
+
 
*Purpose: Class work LING 2208, exercise L1/P1
 
*Purpose: Class work LING 2208, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
'''Transcript'''  [[Corpus:Simon ƒé dadi]]
+
'''Transcript'''  [[Corpus:Simon's cat in Persian]]
  
===Simon's Cat in Safaliba===
 
<flashmp3>KM_L1.mp3‎</flashmp3>
 
 
*[http://typecraft.org/TCEditor/1380/#tab1 Go to the database for the annotated text]
 
 
*'''Source information''':
 
*Speaker: Bodua-Mango Kenneth
 
*Language: Safaliba
 
*Date: 11.01.2011
 
* Size of the records: 2.2 MB
 
* Duration of the records: 00:02:26
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
*Purpose: Class work LING 2008, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
'''Transcript'''  [[Corpus:Simon dɔgtέέ ]]
 
 
===Simon's Cat in Dagaare===
 
<flashmp3>JG_L1.mp3‎</flashmp3>
 
 
*'''Source information''':
 
*Speaker: John Ganaah
 
*Language: Dagaare
 
*Date: 11.01.2011
 
* Size of the records: 2.8 MB
 
* Duration of the records: 00:03:07
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
*Purpose: Class work LING 2008, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
'''Transcript'''  [[Corpus:SIMON KOKOLAA ]]
 
 
 
 
===Simon's Cat in Runyankore-Rukiga===
 
 
<flashmp3>DT L1.mp3‎</flashmp3>
 
 
*[http://typecraft.org/TCEditor/1864/ Go to the database for the annotated text]
 
*'''Source information''':
 
*Speaker: Daphine Tayebwa
 
*Language: Runyankore-Rukiga
 
*Date: 13.01.2012
 
* Size of the records: 2.21 MB
 
* Duration of the records: 00:02:25
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
*Purpose: Class work LING 2008, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
'''Transcript''' [[Corpus: Pussy ya Simon]]
 
<br>
 
<br>
 
 
 
<flashmp3>MN L1.mp3‎</flashmp3>
 
 
*[http://typecraft.org/TCEditor/1363/#tab1 Go to the database for the annotated text]
 
*'''Source information''':
 
*Speaker: Misah Natumanya
 
*Language: Runyankore-Rukiga
 
*Date: 11.01.2011
 
* Size of the records: 2.8 MB
 
* Duration of the records: 00:03:07
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
*Purpose: Class work LING 2008, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
'''Transcript''' [[Corpus: Enjangu ya Simooni]]
 
 
===Simon's cat in Norwegian (Bokmål)===
 
 
<flashmp3>AO L1.mp3</flashmp3>
 
 
*[http://typecraft.org/TCEditor/1389/#tab1 Go to the database for the annotated text]
 
 
*'''Source information''':
 
*Speaker: Are Johansen Ormberg
 
*Language: Norwegian (Bokmål)
 
*Date: 11.01.2011
 
* Size of the records: 1.05 MB
 
* Duration of the records: 00:01:08
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
*Purpose: Class work LING 2008, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
'''Transcript'''  [[Corpus:Simons katt]]
 
 
 
<span style="color:green">'''[http://www.ntnu.no/studieinformasjon/timeplan/v12/?emnekode=LING2208-1&valg=emnekode&bokst LING 2208'''] a course at the [http://www.ntnu.edu/studies/courses/LING2208 Norwegian University of Science and Technology] </span>
 
 
==Digital Methods for Speech and Text Processing==
 
 
 
 
We are a group of graduate students at the linguistic department at the Norwegian University of Science and Technology that work on text and speech annotation.
 
 
[[File:SimonsCat.png|thumb|300px|right|[http://www.simonscat.com/faqs.html source]]]
 
 
We have used work by the English animator Simon Tofield to create parallel recordings in [ ] languages? We have then annotated these recordings using Praat
 
and TypeCraft.
 
 
You can learn more about Simon and his cat by going to [http://www.simonscat.com/faqs.html the official Simon's cat Website].
 
 
Simon's cat videos can be watched for example on [http://www.youtube.com/watch?v=I1qHVVbYG8Y YouTube].
 
 
===Simon's cat in Brazilian Portuguese===
 
 
<flashmp3>FR L1.mp3</flashmp3>
 
 
*[http://typecraft.org/TCEditor/1364/#tab1 Go to the database for the annotated text]
 
 
*'''Source information''':
 
*Speaker: Franciane Rocha
 
*Language: Brazilian Portuguese
 
*Date: 11.01.2011
 
* Size of the records: 5.54 MB
 
* Duration of the records: 00:04:43
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
*Purpose: Class work LING 2008, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
'''Transcript'''  [[Corpus:O_gato_de_Simon]]
 
 
===Simon's cat in Mandarin Chinese===
 
 
<flashmp3>MZ L1.mp3</flashmp3>
 
 
* [http://typecraft.org/TCEditor/1361/# Go to the database for the annotated text]
 
 
*'''Source information''':
 
*Speaker: Miaomiao Zhang
 
*Language: Mandarin Chinese
 
*Date: 13.01.2011
 
* Size of the records: 1.63 MB
 
* Duration of the records: 00:01:46
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
*Purpose: Class work LING 2008, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
 
'''Transcript'''  [[Corpus:Xīméng de māo]]
 
 
<br/><br/><br/>
 
 
<flashmp3>QW L1.mp3</flashmp3>
 
  
 
<br/>
 
<br/>
 +
<flashmp3>Simonscat_alma-1.mp3</flashmp3>
 +
<br/>
  
* [http://typecraft.org/TCEditor/1837/# Go to the database for the annotated text]
 
  
*'''Source information''':
 
*Speaker: Qingqing Wang
 
*Language: Mandarin Chinese
 
*Date: 25.01.2012
 
* Size of the records: 2.07 MB
 
* Duration of the records: 00:02:15
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
*Purpose: Class work LING 2208, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
  
'''Transcript'''  [[Corpus:Xī Méng de Māo (2)]]
+
*Speaker: Alma Jafarzadeh Marandi
 
+
*Language: Persian
<br/><br/><br/>
+
*Date: 28.01.2013
 
+
* Size of the records: 2.03 MB
<flashmp3>YU_L1.mp3</flashmp3>
+
* Duration of the records: 00:02:03
 
+
* Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
<br/>
+
 
+
*'''Source information''':
+
*Speaker: Yutong Pu
+
*Language: Mandarin Chinese
+
*Date: 28.01.2012
+
* Size of the records: 2.04 MB
+
* Duration of the records: 00:02:14
+
* Place: Phonetic laboratory ISK Dragvoll (Trondheim - Norway)
+
 
*Purpose: Class work LING 2208, exercise L1/P1
 
*Purpose: Class work LING 2208, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 +
'''Transcript'''  [[Corpus:Simon´s Cat in Persian (Farsi)]]
  
'''Transcript'''  [[Corpus:Xī Méng de Māo (3)]]
+
===Simon's Cat in Ganda (Luganda)===
 
+
<flashmp3>SN_L1.mp3</flashmp3>
 
+
 
+
===Simon's Cat in Ewe===
+
 
+
<flashmp3>MM_L1.mp3</flashmp3>
+
 
+
*[http://typecraft.org/TCEditor/1378/#tab1 Go to the database for the annotated text]
+
  
 +
*[http://typecraft.org/tc2wiki/Special:TypeCraft/Editor/1886/#tab1 Go to the database for the annotated text]
 
*'''Source information''':
 
*'''Source information''':
*Speaker: Mercy Motte
+
*Speaker: Samuel Namugala
*Language: Ewe
+
*Language: Ganda (Luganda)
*Date: 11.01.2011
+
* Size of the records: 2.9 MB
+
* Duration of the records: 00:03:07
+
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
+
*Purpose: Class work LING 2008, exercise L1/P1
+
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
+
'''Transcript'''  [[Corpus:Simon ƒe dadi le ʋegbe me]]
+
 
+
 
+
<br>
+
 
+
<br>
+
 
+
<flashmp3>TS_L1.mp3‎ </flashmp3>
+
 
+
*[http://typecraft.org/TCEditor/1865/#tab1 Go to the database for the annotated text]
+
 
+
*'''Source information''':
+
*Speaker: Theresa M. Sedzro
+
*Language: Ewe
+
 
*Date: 12.01.2012
 
*Date: 12.01.2012
* Size of the records: 2.12 MB
+
* Size of the records: 2.06 MB
* Duration of the records: 00:02:19
+
* Duration of the records: 00:02:15
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
*Purpose: Class work LING 2208, exercise L1/P1
 
*Purpose: Class work LING 2208, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
'''Transcript'''   [[Corpus:Simon ƒé dadi]]
+
'''Transcript''' [[Corpus:Enjangu ya Simooni]]
  
===Simon's Cat in Safaliba===
+
===Simon's Cat in Kinyarwanda===
<flashmp3>KM_L1.mp3‎</flashmp3>
+
  
*[http://typecraft.org/TCEditor/1380/#tab1 Go to the database for the annotated text]
+
<flashmp3>JOK_L1a.mp3‎‎</flashmp3>
  
*'''Source information''':
 
*Speaker: Bodua-Mango Kenneth
 
*Language: Safaliba
 
*Date: 11.01.2011
 
* Size of the records: 2.2 MB
 
* Duration of the records: 00:02:26
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
*Purpose: Class work LING 2008, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
'''Transcript'''  [[Corpus:Simon dɔgtέέ ]]
 
 
===Simon's Cat in Dagaare===
 
<flashmp3>JG_L1.mp3‎</flashmp3>
 
  
 
*'''Source information''':
 
*'''Source information''':
*Speaker: John Ganaah
+
*Speaker: Joy Kabibi
*Language: Dagaare
+
*Language: Kinyarwanda
*Date: 11.01.2011
+
*Date: 28.01.2013
* Size of the records: 2.8 MB
+
* Size of the records: 2.06 MB
* Duration of the records: 00:03:07
+
* Duration of the records: 00:02:15
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
*Purpose: Class work LING 2008, exercise L1/P1
+
*Purpose: Class work LING 2208, exercise 2
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
'''Transcript'''  [[Corpus:SIMON KOKOLAA ]]
+
'''Transcript'''  [[Corpus:Injangwe ya Simon]]
  
 +
===Simon's Cat in Runyankole-Rukiga===
  
 +
<flashmp3>JOK_L1b.mp3‎‎</flashmp3>
  
===Simon's Cat in Runyankore-Rukiga===
 
  
<flashmp3>DT L1.mp3‎</flashmp3>
 
 
*[http://typecraft.org/TCEditor/1864/ Go to the database for the annotated text]
 
 
*'''Source information''':
 
*'''Source information''':
*Speaker: Daphine Tayebwa
+
*Speaker: Joy Kabibi
*Language: Runyankore-Rukiga
+
*Language: Runyankole-Rukiga
*Date: 13.01.2012
+
*Date: 28.01.2013
* Size of the records: 2.21 MB
+
* Size of the records: 1.96 MB
* Duration of the records: 00:02:25
+
* Duration of the records: 00:02:09
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
*Purpose: Class work LING 2008, exercise L1/P1
+
*Purpose: Class work LING 2208, exercise 2
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
'''Transcript''' [[Corpus: Pussy ya Simon]]
+
'''Transcript'''   [[Corpus:Enjangu ya Simon]]
<br>
+
<br>
+
  
 +
===Simon's cat in Hijazi Arabic===
  
<flashmp3>MN L1.mp3‎</flashmp3>
+
<flashmp3>MOH_L1.mp3‎</flashmp3>
 
+
*[http://typecraft.org/TCEditor/1363/#tab1 Go to the database for the annotated text]
+
 
*'''Source information''':
 
*'''Source information''':
*Speaker: Misah Natumanya
+
*Speaker: Mohammed A. Mohammed-Yahya .
*Language: Runyankore-Rukiga
+
*Language: Hijazi Arabic
*Date: 11.01.2011
+
* Size of the records: 2.8 MB
+
* Duration of the records: 00:03:07
+
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
+
*Purpose: Class work LING 2008, exercise L1/P1
+
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
+
'''Transcript''' [[Corpus: Enjangu ya Simooni]]
+
 
+
===Simon's cat in Norwegian (Bokmål)===
+
 
+
<flashmp3>AO L1.mp3</flashmp3>
+
 
+
*[http://typecraft.org/TCEditor/1389/#tab1 Go to the database for the annotated text]
+
 
+
*'''Source information''':
+
*Speaker: Are Johansen Ormberg
+
*Language: Norwegian (Bokmål)
+
*Date: 11.01.2011
+
* Size of the records: 1.05 MB
+
* Duration of the records: 00:01:08
+
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
+
*Purpose: Class work LING 2008, exercise L1/P1
+
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
+
'''Transcript'''  [[Corpus:Simons katt]]
+
 
+
 
+
===Simon's Cat in Norwegian===
+
 
+
<flashmp3>EL_L1.mp3</flashmp3>
+
 
+
*[http://typecraft.org/TCEditor/1838/#tab1 Go to the database for the annotated text]
+
 
+
*'''Source information''':
+
*Speaker: Espen Lønvik
+
*Language: Norwegian
+
*Date: 13.01.2012
+
* Size of the records: 2.28 MB
+
* Duration of the records: 00:02:30
+
* Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
+
*Purpose: Class work LING 2208, exercise L1/P1
+
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
+
'''Transcript'''  [[Corpus:Katta te Simon]]
+
 
+
<br>
+
<br>
+
<br>
+
<flashmp3>IA_L1.mp3</flashmp3>
+
 
+
 
+
*[http://typecraft.org/TCEditor/1866/#tab1 Go to the database for the annotated text]
+
 
+
*'''Source information''':
+
*Speaker: Isak Askheim
+
*Language: Norwegian
+
*Date: 12.01.2012
+
* Size of the records: 2.16 MB
+
* Duration of the records: 00:02:21
+
* Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
+
*Purpose: Class work LING 2208, exercise L1/P1
+
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
+
'''Transcript'''  [[Corpus:Katta hass Simon]]
+
<br>
+
<br>
+
<br>
+
<flashmp3>Mtm_simon's_cat.mp3</flashmp3>
+
 
+
*'''Source information''':
+
*Speaker: Malin T.M.
+
*Language: Norwegian
+
 
*Date: 07.01.2013
 
*Date: 07.01.2013
* Size of the records: 1.84 MB
+
* Size of the records: 2.12 MB
* Duration of the records: 00:02:17
+
* Duration of the records: 00:02:18
* Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
+
*Purpose: Class work LING 2208, exercise L1/P1
+
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
+
 
+
===Simon's Cat in Persian===
+
 
+
<flashmp3>AM_L1.mp3</flashmp3>
+
 
+
*[http://typecraft.org/TCEditor/1839/#tab1 Go to the database for the annotated text]
+
 
+
 
+
*'''Source information''':
+
*Speaker: Atena Mahmoodan
+
*Language: Persian
+
*Date: 12.01.2012
+
* Size of the records: 2.14 MB
+
* Duration of the records: 00:02:20
+
* Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
+
*Purpose: Class work LING 2208, exercise L1/P1
+
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
+
'''Transcript'''  [[Corpus:Simon Gorbe]]
+
 
+
<br/>
+
 
+
<flashmp3>MM_L1.mp3</flashmp3>
+
 
+
<br/>
+
 
+
 
+
*'''Source information''':
+
*Speaker: Mahboubeh Maleki
+
*Language: Persian
+
*Date: 28.01.2013
+
* Size of the records: 1.03 MB
+
* Duration of the records: 00:02:13
+
* Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
+
*Purpose: Class work LING 2208, exercise L1/P1
+
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
+
'''Transcript'''  [[Corpus:Simon Gorbe1]]
+
 
+
===Simon's Cat in Ganda (Luganda)===
+
<flashmp3>SN_L1.mp3</flashmp3>
+
 
+
*[http://typecraft.org/tc2wiki/Special:TypeCraft/Editor/1886/#tab1 Go to the database for the annotated text]
+
*'''Source information''':
+
*Speaker: Samuel Namugala
+
*Language: Ganda (Luganda)
+
*Date: 12.01.2012
+
* Size of the records: 2.06 MB
+
* Duration of the records: 00:02:15
+
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
* Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
 
*Purpose: Class work LING 2208, exercise L1/P1
 
*Purpose: Class work LING 2208, exercise L1/P1
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
 
*Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
'''Transcript''' [[Corpus:Enjangu ya Simooni]]
+
'''Transcript'''   [[Corpus:بسة سايمون بلهجة الحجاز Simon's cat Hijazi Arabic]]

Latest revision as of 21:16, 19 November 2016

The material presented here is based on course work at Norwegian University of Science and Technology

Digital Methods for Speech and Text Processing

We are a group of graduate students at the linguistic department at the Norwegian University of Science and Technology that work on text and speech annotation.

Error creating thumbnail: Unable to save thumbnail to destination

We have used work by the English animator Simon Tofield to create parallel recordings in [ ] languages? We have then annotated these recordings using Praat and TypeCraft.

You can learn more about Simon and his cat by going to the official Simon's cat Website.

Simon's cat videos can be watched for example on YouTube.

Simon's cat in Brazilian Portuguese

  • Source information:
  • Speaker: Franciane Rocha
  • Language: Brazilian Portuguese
  • Date: 11.01.2011
  • Size of the records: 5.54 MB
  • Duration of the records: 00:04:43
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2008, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:O_gato_de_Simon

Simon's cat in Mandarin Chinese

  • Source information:
  • Speaker: Miaomiao Zhang
  • Language: Mandarin Chinese
  • Date: 13.01.2011
  • Size of the records: 1.63 MB
  • Duration of the records: 00:01:46
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2008, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Xīméng de māo



  • Source information:
  • Speaker: Qingqing Wang
  • Language: Mandarin Chinese
  • Date: 25.01.2012
  • Size of the records: 2.07 MB
  • Duration of the records: 00:02:15
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Xī Méng de Māo (2)



  • Source information:
  • Speaker: Yutong Pu
  • Language: Mandarin Chinese
  • Date: 28.01.2013
  • Size of the records: 2.04 MB
  • Duration of the records: 00:02:14
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Xī Méng de Māo 3


Simon's Cat in Ewe

  • Source information:
  • Speaker: Mercy Motte
  • Language: Ewe
  • Date: 11.01.2011
  • Size of the records: 2.9 MB
  • Duration of the records: 00:03:07
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2008, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Simon ƒe dadi le ʋegbe me

  • Source information:
  • Speaker: Theresa M. Sedzro
  • Language: Ewe
  • Date: 12.01.2012
  • Size of the records: 2.12 MB
  • Duration of the records: 00:02:19
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Simon Ď dadi

Simon's Cat in Safaliba

  • Source information:
  • Speaker: Bodua-Mango Kenneth
  • Language: Safaliba
  • Date: 11.01.2011
  • Size of the records: 2.2 MB
  • Duration of the records: 00:02:26
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2008, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Simon dɔgtέέ

Simon's Cat in Dagaare

  • Source information:
  • Speaker: John Ganaah
  • Language: Dagaare
  • Date: 11.01.2011
  • Size of the records: 2.8 MB
  • Duration of the records: 00:03:07
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2008, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:SIMON KOKOLAA

Simon's Cat in Runyankore-Rukiga

  • Go to the database for the annotated text
  • Source information:
  • Speaker: Daphine Tayebwa
  • Language: Runyankore-Rukiga
  • Date: 13.01.2012
  • Size of the records: 2.21 MB
  • Duration of the records: 00:02:25
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2008, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus: Pussy ya Simon

  • Go to the database for the annotated text
  • Source information:
  • Speaker: Misah Natumanya
  • Language: Runyankore-Rukiga
  • Date: 11.01.2011
  • Size of the records: 2.8 MB
  • Duration of the records: 00:03:07
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2008, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus: Enjangu ya Simooni

Simon's cat in Norwegian

  • Source information:
  • Speaker: Are Johansen Ormberg
  • Language: Norwegian (Bokmål)
  • Date: 11.01.2011
  • Size of the records: 1.05 MB
  • Duration of the records: 00:01:08
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2008, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Simons katt


  • Source information:
  • Speaker: Malin T.M.
  • Language: Norwegian
  • Date: 07.01.2013
  • Size of the records: 1.84 MB
  • Duration of the records: 00:02:17
  • Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Katta til Simon


  • Source information:
  • Speaker: Sindre Flø Hunnes
  • Language: Norwegian
  • Date: 07.01.2013
  • Size of the records: 2.12 MB
  • Duration of the records: 00:02:18
  • Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Simons katt, Volding dialect


  • Source information:
  • Speaker: Henning Fleischer Rognlien
  • Language: Norwegian
  • Date: 10.01.2013
  • Size of the records: 1.19 MB
  • Duration of the records: 00:02:36
  • Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Saimens kætt

  • Go to the database for the annotated text
  • Source information:
  • Speaker: Thomas René Hansen
  • Language: Norwegian
  • Date: 7.01.2013
  • Size of the records: 2.16 MB
  • Duration of the records: 00:02:21
  • Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Katten til Simon (Vega-dialekt)

Simon's cat in Akan

  • Source information:
  • Speaker: Ernest Boadu Boakye
  • Language: Akan
  • Date: 28.01.2013
  • Size of the records: 2.16 MB
  • Duration of the records: 00:02:21
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2013, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan

Simon's cat in Slovenian

  • Source information:
  • Speaker: Jana Kocjan
  • Language: Slovenian
  • Date: 07.01.2013
  • Size of the records: 2.8 MB
  • Duration of the records: 00:02:20
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus: Simonova mačka (2)

Simon's cat in Nzema

  • Source information:
  • Speaker: Francis Benle
  • Language: Nzema
  • Date: 28.01.2013
  • Size of the records: 2.08 MB
  • Duration of the records: 00:02:15
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2008, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Simon kusu ne

Simon's cat in Dangme

  • Source information:
  • Speaker: Prosper Teye Akortia
  • Language: Dangme
  • Date: 28.01.2013
  • Size of the records: 02:03 MB
  • Duration of the records: 00:02:13
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2013, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Simon anɔ ɔ

Simon's Cat in Persian

  • Source information:
  • Speaker: Atena Mahmoodan
  • Language: Persian
  • Date: 12.01.2012
  • Size of the records: 1.14 MB
  • Duration of the records: 00:02:20
  • Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Simon Gorbe

  • Source information:
  • Speaker: Mahboubeh Maleki
  • Language: Persian
  • Date: 28.01.2013
  • Size of the records: 2.03 MB
  • Duration of the records: 00:02:13
  • Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Simon's cat in Persian

  • Speaker: Alma Jafarzadeh Marandi
  • Language: Persian
  • Date: 28.01.2013
  • Size of the records: 2.03 MB
  • Duration of the records: 00:02:03
  • Place: Phonetics laboratory - ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Simon´s Cat in Persian (Farsi)

Simon's Cat in Ganda (Luganda)

  • Go to the database for the annotated text
  • Source information:
  • Speaker: Samuel Namugala
  • Language: Ganda (Luganda)
  • Date: 12.01.2012
  • Size of the records: 2.06 MB
  • Duration of the records: 00:02:15
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Enjangu ya Simooni

Simon's Cat in Kinyarwanda

  • Source information:
  • Speaker: Joy Kabibi
  • Language: Kinyarwanda
  • Date: 28.01.2013
  • Size of the records: 2.06 MB
  • Duration of the records: 00:02:15
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise 2
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Injangwe ya Simon

Simon's Cat in Runyankole-Rukiga

  • Source information:
  • Speaker: Joy Kabibi
  • Language: Runyankole-Rukiga
  • Date: 28.01.2013
  • Size of the records: 1.96 MB
  • Duration of the records: 00:02:09
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise 2
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:Enjangu ya Simon

Simon's cat in Hijazi Arabic

  • Source information:
  • Speaker: Mohammed A. Mohammed-Yahya .
  • Language: Hijazi Arabic
  • Date: 07.01.2013
  • Size of the records: 2.12 MB
  • Duration of the records: 00:02:18
  • Place: Phonetic laboratory – ISK Dragvoll (Trondheim - Norway)
  • Purpose: Class work LING 2208, exercise L1/P1
  • Supervisor: Williem A. van Dommelen and Dorothee B. Hellan
Transcript Corpus:بسة سايمون بلهجة الحجاز Simon's cat Hijazi Arabic