Typecraft v2.5
Jump to: navigation, search

Difference between revisions of "Parallel Annotation of Speech and Text"

Line 47: Line 47:
  
 
----
 
----
 
+
<Phrase>on</Phrase>
 
== Test sentences ==
 
== Test sentences ==
 
''(taken from the "Sound to Sense" project)''
 
''(taken from the "Sound to Sense" project)''
Line 56: Line 56:
  
 
<Phrase>10903</Phrase>
 
<Phrase>10903</Phrase>
<flashmp3>PSTA01.mp3</flashmp3>
+
<flashmp3>PSTA01.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA01.mp3]], TextGrid: [[File:PSTA01.txt]]
 
Download files for viewing in the Praat application: Sound: [[File:PSTA01.mp3]], TextGrid: [[File:PSTA01.txt]]
  
 
<Phrase>10904</Phrase>
 
<Phrase>10904</Phrase>
<flashmp3>PSTA02.mp3</flashmp3>
+
<flashmp3>PSTA02.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA02.mp3]], TextGrid: [[File:PSTA02.txt]]
 
Download files for viewing in the Praat application: Sound: [[File:PSTA02.mp3]], TextGrid: [[File:PSTA02.txt]]
  
 
<Phrase>10905</Phrase>
 
<Phrase>10905</Phrase>
<flashmp3>PSTA03.mp3</flashmp3>
+
<flashmp3>PSTA03.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA03.mp3]], TextGrid: [[File:PSTA03.txt]]
 
Download files for viewing in the Praat application: Sound: [[File:PSTA03.mp3]], TextGrid: [[File:PSTA03.txt]]
  
Line 74: Line 74:
  
 
<Phrase>10906</Phrase>
 
<Phrase>10906</Phrase>
<flashmp3>PSTA04.mp3</flashmp3>
+
<flashmp3>PSTA04.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA04.mp3]], TextGrid: [[File:PSTA04.txt]]
 
Download files for viewing in the Praat application: Sound: [[File:PSTA04.mp3]], TextGrid: [[File:PSTA04.txt]]
  
 
<Phrase>10907</Phrase>
 
<Phrase>10907</Phrase>
<flashmp3>PSTA05.mp3</flashmp3>
+
<flashmp3>PSTA05.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA05.mp3]], TextGrid: [[File:PSTA05.txt]]
 
Download files for viewing in the Praat application: Sound: [[File:PSTA05.mp3]], TextGrid: [[File:PSTA05.txt]]
  
 
<Phrase>10908</Phrase>
 
<Phrase>10908</Phrase>
<flashmp3>PSTA06.mp3</flashmp3>
+
<flashmp3>PSTA06.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA06.mp3]], TextGrid: [[File:PSTA06.txt]]
 
Download files for viewing in the Praat application: Sound: [[File:PSTA06.mp3]], TextGrid: [[File:PSTA06.txt]]
  
 
<Phrase>10909</Phrase>
 
<Phrase>10909</Phrase>
<flashmp3>PSTA07.mp3</flashmp3>
+
<flashmp3>PSTA07.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA07.mp3]], TextGrid: [[File:PSTA07.txt]]
 
Download files for viewing in the Praat application: Sound: [[File:PSTA07.mp3]], TextGrid: [[File:PSTA07.txt]]
  
 
----
 
----
  
Sentences 8 to 10:
+
Sentences 8 to 10:<br>Speaker dialect: Eastern Norway
 
+
Speaker dialect: Eastern Norway
+
  
 
<Phrase>10910</Phrase>
 
<Phrase>10910</Phrase>
<flashmp3>PSTA08.mp3</flashmp3>
+
<flashmp3>PSTA08.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA08.mp3]], TextGrid: [[File:PSTA08.txt]]
 
Download files for viewing in the Praat application: Sound: [[File:PSTA08.mp3]], TextGrid: [[File:PSTA08.txt]]
  
 
<Phrase>10911</Phrase>
 
<Phrase>10911</Phrase>
<flashmp3>PSTA09.mp3</flashmp3>
+
<flashmp3>PSTA09.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA09.mp3]], TextGrid: [[File:PSTA09.txt]]
 
Download files for viewing in the Praat application: Sound: [[File:PSTA09.mp3]], TextGrid: [[File:PSTA09.txt]]
  
 
<Phrase>10912</Phrase>
 
<Phrase>10912</Phrase>
<flashmp3>PSTA03.mp3</flashmp3>
+
<flashmp3>PSTA03.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA10.mp3]], TextGrid: [[File:PSTA10.txt]]
 
Download files for viewing in the Praat application: Sound: [[File:PSTA10.mp3]], TextGrid: [[File:PSTA10.txt]]
  
Line 111: Line 109:
 
'''About the TextGrid files:'''
 
'''About the TextGrid files:'''
  
The TextGrid files are opened together with the matching sound file for viewing in the Praat application. The TextGrid files consist of three tiers, 'Word' (rendered in Bokmål orthography) 'Phoneme' (shows underlying segments) and 'Note' (shows surface realisation with IPA symbols, and other notes).
+
The TextGrid files are opened together with the matching sound files for viewing in the Praat application. The TextGrid files consist of three tiers, 'Word' (rendered in Bokmål orthography) 'Phoneme' (shows underlying segments) and 'Note' (shows surface realisation with IPA symbols, and other notes).
  
 
Here is a list of glosses used in the 'Note' tier:
 
Here is a list of glosses used in the 'Note' tier:
  
''Phonology/Phonetics:''
+
''Phonology/Phonetics:''<br>
BrV = Segent realised with breathy voice
+
BrV = Segent realised with breathy voice<br>
CrV = Segent realised with creaky voice
+
CrV = Segent realised with creaky voice<br>
DV = Underlying voiced segment realised devoiced
+
DV = Underlying voiced segment realised devoiced<br>
EPN = Epenthesis
+
EPN = Epenthesis<br>
RD = Reduction of segment (e.g. corner vowel realised as schwa or plosive as fricative).
+
RD = Reduction of segment (e.g. corner vowel realised as schwa or plosive as fricative).<br>
V = Underlying non-voiced segment realised voiced
+
V = Underlying non-voiced segment realised voiced<br>
  
''Morphophonology/Syntax''
+
''Morphophonology/Syntax''<br>
CL = Clitic
+
CL = Clitic<br>
  
''Other''
+
''Other''<br>
ERR = The speaker errs and corrects himself
+
ERR = The speaker errs and corrects himself<br>
HES = (Audible) hesitation from speaker
+
HES = (Audible) hesitation from speaker<br>

Revision as of 16:23, 16 March 2010

PraatTest.png

The following is a test picture (of a picture produced in Praat) The test picture will be replaced by a Praat picture which reflects Asger's text grids. Every Praat picture will correspond to a TC sentence which we can load very easily to this wiki by exporting it from the TC database. The size of the the Praat picture can be adjusted.









Here we can add some text that discusses our annotations, and makes the reader aware of some of the interesting aspects of our annoations.



Not all of your text grids and annotation tables need to be represented equally large. The reader of the this page may always click on one of the Praat representations to view a larger version of the Praat picture, while TC annotations can be searched in the TC database in order to be inspected more carefully.

0.52 - 0.67


PraatTest.png
PraatTest.png



Test sentences

(taken from the "Sound to Sense" project)

Sentences 1 to 3:

Speaker dialect: Bergen



Download files for viewing in the Praat application: Sound: File:PSTA01.mp3, TextGrid: File:PSTA01.txt
Download files for viewing in the Praat application: Sound: File:PSTA02.mp3, TextGrid: File:PSTA02.txt
Download files for viewing in the Praat application: Sound: File:PSTA03.mp3, TextGrid: File:PSTA03.txt

Sentences 4 to 7:

Speaker dialect: Trondheim



Download files for viewing in the Praat application: Sound: File:PSTA04.mp3, TextGrid: File:PSTA04.txt
Download files for viewing in the Praat application: Sound: File:PSTA05.mp3, TextGrid: File:PSTA05.txt
Download files for viewing in the Praat application: Sound: File:PSTA06.mp3, TextGrid: File:PSTA06.txt
Download files for viewing in the Praat application: Sound: File:PSTA07.mp3, TextGrid: File:PSTA07.txt

Sentences 8 to 10:
Speaker dialect: Eastern Norway



Download files for viewing in the Praat application: Sound: File:PSTA08.mp3, TextGrid: File:PSTA08.txt
Download files for viewing in the Praat application: Sound: File:PSTA09.mp3, TextGrid: File:PSTA09.txt
Download files for viewing in the Praat application: Sound: File:PSTA10.mp3, TextGrid: File:PSTA10.txt

About the TextGrid files:

The TextGrid files are opened together with the matching sound files for viewing in the Praat application. The TextGrid files consist of three tiers, 'Word' (rendered in Bokmål orthography) 'Phoneme' (shows underlying segments) and 'Note' (shows surface realisation with IPA symbols, and other notes).

Here is a list of glosses used in the 'Note' tier:

Phonology/Phonetics:
BrV = Segent realised with breathy voice
CrV = Segent realised with creaky voice
DV = Underlying voiced segment realised devoiced
EPN = Epenthesis
RD = Reduction of segment (e.g. corner vowel realised as schwa or plosive as fricative).
V = Underlying non-voiced segment realised voiced

Morphophonology/Syntax
CL = Clitic

Other
ERR = The speaker errs and corrects himself
HES = (Audible) hesitation from speaker