Difference between revisions of "Parallel Annotation of Speech and Text - Part 2"
(13 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
+ | '''NTNU project - 2010''' | ||
+ | |||
go back to [[Parallel Annotation of Speech and Text]] | go back to [[Parallel Annotation of Speech and Text]] | ||
+ | go to [[Parallel_Annotation_of_Speech_and_Text_-_Part_3|Parallel Annotation of Speech and Text - Part 3]] | ||
− | |||
+ | '''Speaker dialect: Trondheim''' | ||
+ | |||
+ | ''Sentence 4'' | ||
<Phrase>10906</Phrase> | <Phrase>10906</Phrase> | ||
<flashmp3>PSTA04.mp3</flashmp3><br> | <flashmp3>PSTA04.mp3</flashmp3><br> | ||
− | Download files for viewing in the Praat application: | + | Download files for viewing in the Praat application: |
+ | *[[Media:PSTA04.mp3| Sound]] | ||
+ | *[[Media:PSTA04.txt|TextGrid]] | ||
+ | |||
+ | <br> | ||
<br> | <br> | ||
+ | ''Sentence 5'' | ||
<Phrase>10907</Phrase> | <Phrase>10907</Phrase> | ||
<flashmp3>PSTA05.mp3</flashmp3><br> | <flashmp3>PSTA05.mp3</flashmp3><br> | ||
− | Download files for viewing in the Praat application: | + | Download files for viewing in the Praat application: |
+ | *[[Media:PSTA05.mp3|Sound]] | ||
+ | *[[Media:PSTA05.txt|TextGrid]] | ||
+ | |||
<br> | <br> | ||
+ | <br> | ||
+ | ''Sentence 6'' | ||
<Phrase>10908</Phrase> | <Phrase>10908</Phrase> | ||
<flashmp3>PSTA06.mp3</flashmp3><br> | <flashmp3>PSTA06.mp3</flashmp3><br> | ||
− | Download files for viewing in the Praat application: | + | Download files for viewing in the Praat application: |
+ | *[[Media:PSTA06.mp3|Sound]] | ||
+ | *[[File:PSTA06.txt|TextGrid]] | ||
+ | |||
+ | <br> | ||
<br> | <br> | ||
+ | ''Sentence 7'' | ||
<Phrase>10909</Phrase> | <Phrase>10909</Phrase> | ||
+ | <br> | ||
+ | A phenomenon that is interesting relative to the sentence above is, how toneme 1 and toneme 2 is used in inflection. | ||
+ | This is truly best documented by combining morphological annotation with speech annotation. Only the later allows a look at the actual pronunciation. | ||
+ | <br> | ||
+ | |||
+ | {| border="1" | ||
+ | |+'''Difference in tone contour as a result of inflection''' | ||
+ | |- | ||
+ | | valign="top"| | ||
+ | [[Image:Taket.jpg|thumb|left|350px|/ta:k+e/]] | ||
+ | | valign="bottom"| | ||
+ | [[Image:Pære.jpg|thumb|left|350px|/pæ:r+e/]] | ||
+ | |} | ||
+ | <br> | ||
+ | Above we have screenshots of two of the words from the sentence, “taket” and “pære” in orthography, viewed in the Praat application. The latter word is part of a compound word, but for the sake of simplicity we can disregard this. The pitch is shown as a blue curve in the middle. Phonemically these words are transcribed /ta:k+e/ and /pæ:r+e/, where the suffix /e/ in the first word denotes definite singular (“the roof”) and in the second indefinite singular (“bulb”). It would seem we have two different morphemes that are realised by the same morph /e/, however, this is not the case. As we can see, the inflected words have different pitch contours, in /ta:k+e/ there is an HLH contour (the word has toneme 2), while in /pæ:r+e/ there is a LH contour (toneme 1). One way to analyse this is to say that the suffix /e/ in /ta:k+e/ carries an extra tone (or tone bearing unit) with it, while the suffix /e/ in /pæ:r+e/ does not. This means that the two suffixes are not phonologically the same. | ||
+ | |||
<flashmp3>PSTA07.mp3</flashmp3><br> | <flashmp3>PSTA07.mp3</flashmp3><br> | ||
− | Download files for viewing in the Praat application: | + | Download files for viewing in the Praat application: |
+ | *[[Media:PSTA07.mp3|Sound]] | ||
+ | *[[Media:PSTA07.txt|TextGrid]] | ||
+ | |||
+ | |||
+ | [[Category:Projects]] |
Latest revision as of 20:10, 18 February 2016
NTNU project - 2010
go back to Parallel Annotation of Speech and Text
go to Parallel Annotation of Speech and Text - Part 3
Speaker dialect: Trondheim
Sentence 4
Foran |
foran |
in.front.ofDIRSRC |
PREP |
kortsida | ||
kort | sid | a |
short | side | DEFSG |
N |
av |
a |
ofPART |
CL |
svampen | |
svamp | en |
sponge | DEFSG |
N |
så |
so |
thenDM |
CONJC |
er |
e: |
bePRES |
V |
det |
de |
3SGNEUT |
PN |
en |
en |
INDEFSGCOMM |
ART |
bøtte | |
bøt: | e |
bucket | INDEFSG |
N |
med |
me |
withINSTR |
PREP |
en |
en |
INDEFSGCOMM |
ART |
hank |
hank |
handleINDEFSG |
N |
på |
po |
onLOC |
ADVm |
og |
o |
and |
CONJC |
halvveis | ||
halv | vei | s |
half | way | N>ADJ |
ADVm |
fylt | |
fyl: | t |
fill | PASTPART |
ADJ |
med |
me |
withINSTR |
PREP |
vann |
van: |
waterINDEFSG |
N |
Download files for viewing in the Praat application:
Sentence 5
Altså |
aso |
that.isDM |
ADVm |
du |
dʉ |
2SG |
PN |
har | |
ha: | r |
have | PRES |
V |
TVen | |
te:v | en |
TV | DEFSG |
N |
som |
som |
that |
PNrel |
er |
e: |
bePRES |
V |
nesten |
nesten |
almost |
ADVm |
helt | |
he:l | t |
whole | ADJ>ADV |
ADVm |
inntil |
inte |
in.againstLOC |
PREP |
nederste | |
nederst | e |
downmostSUP | DEF |
ADJ |
delen | |
de:l | en |
part | DEFSG |
N |
av |
a |
ofPART |
CL |
rektangelet | |
rektanɡel | et |
rectangle | DEFSG |
N |
og |
o |
and |
CONJC |
så |
so |
then |
CONJC |
har | |
ha: | r |
have | PRES |
V |
du |
dʉ |
2SG |
PN |
et |
et |
INDEFSGNEUT |
ART |
område | |
umro:d | e |
area | INDEFSG |
N |
som |
som |
PNrel |
er |
e: |
bePRES |
V |
like | |
li:k | e |
like | ADJ>ADV |
ADVm |
bredt | |
bre: | t |
wide | NEUT |
ADJ |
som |
som |
PNrel |
TVen | |
te:v | en |
TV | DEFSG |
N |
nedenfor |
neafor |
belowDIR |
PREP |
TVen | |
te:v | en |
TV | DEFSG |
N |
igjen |
ijen: |
again |
ADVm |
Download files for viewing in the Praat application:
Sentence 6
Og |
o |
and |
CONJC |
de |
dem |
3PL |
PN |
når | |
no: | r |
reach | PRES |
V |
cirka |
sirka |
circa |
ADVm |
opp |
op |
upDIR |
PREP |
til |
te |
toDIR |
PREP |
hodet | |
hu:d | e |
head | DEFSG |
N |
på |
po |
onDIRSRC |
PREP |
slangen | |
slanɡ | en |
snake | DEFSG |
N |
altså |
aso |
that.isDM |
ADVm |
den |
den |
DEFSGCOMM |
ART |
slangeslangen | |||
slanɡ | e | slanɡ | en |
snake | snake | DEFSG | |
N |
ikke |
ik:e |
not |
ADVm |
vannslangen | ||
van: | slanɡ | en |
water | hose | DEFSG |
N |
Download files for viewing in the Praat application:
Sentence 7
Ved |
ve |
byLOC |
PREP |
siden | |
si:d | en |
side | DEFSG |
N |
av |
av |
ofPART |
PREP |
døra | |
dø:r | a |
door | DEFSG |
N |
så |
so |
thenDM |
CONJC |
henger |
henɡ |
hangPRES |
V |
det |
de |
3SGNEUT |
PN |
fra |
fra |
fromDIRSRC |
PREP |
taket | |
ta:k | e |
ceiling | DEFSG |
N |
rett |
ret: |
straight |
ADVm |
ned |
ne |
down(wards) |
ADVm |
en |
en |
aINDEFSGCOMM |
ART |
en |
en |
aINDEFSGCOMM |
ART |
sånn |
son: |
DM |
ADVm |
lyspære | ||
ly:s | pæ:r | e |
light | bulb | INDEFSG |
N |
A phenomenon that is interesting relative to the sentence above is, how toneme 1 and toneme 2 is used in inflection.
This is truly best documented by combining morphological annotation with speech annotation. Only the later allows a look at the actual pronunciation.
Above we have screenshots of two of the words from the sentence, “taket” and “pære” in orthography, viewed in the Praat application. The latter word is part of a compound word, but for the sake of simplicity we can disregard this. The pitch is shown as a blue curve in the middle. Phonemically these words are transcribed /ta:k+e/ and /pæ:r+e/, where the suffix /e/ in the first word denotes definite singular (“the roof”) and in the second indefinite singular (“bulb”). It would seem we have two different morphemes that are realised by the same morph /e/, however, this is not the case. As we can see, the inflected words have different pitch contours, in /ta:k+e/ there is an HLH contour (the word has toneme 2), while in /pæ:r+e/ there is a LH contour (toneme 1). One way to analyse this is to say that the suffix /e/ in /ta:k+e/ carries an extra tone (or tone bearing unit) with it, while the suffix /e/ in /pæ:r+e/ does not. This means that the two suffixes are not phonologically the same.
Download files for viewing in the Praat application: