Difference between revisions of "Krio Corpus"
(15 intermediate revisions by 3 users not shown) | |||
Line 14: | Line 14: | ||
+ | ===About the TypeCraft Krio Corpus=== | ||
+ | The Krio Corpus consists of 33 short stories and transcribed narrations and a total of 937 phrases (8299 words).You can access the corpus from the TypeCraft [[Special:TypeCraft/PortalOfLanguages|Portal of Language]] | ||
+ | In the list of Krio texts you find texts owned by [[User:Beatrice Owusua Nyampong|Beatrice Owusua Nyampong]] and by the [[User:Typecraft|TypeCraft project]]. The latter corpus is a further development of the Nyampong corpus. Some of the annotations of the Nyampong corpus have been rectified. Preserving the resource, we in addition added new layers of annotation reflecting the Kri TMA system in the light of the analysis presented in Beermann 2016 <ref>Dorothee Beermann (2016) Features and Domains. Presentation at the SLE Conference in Naples, Sep 2016. | ||
+ | [[Notions_of_'feature'_in_linguistic_theory:_cross-theoretical_and_cross-linguistic_perspectives| SLE 2016 ''Notions of 'features''' workshop page ]] .</ref> | ||
− | |||
− | |||
− | ==About the project== | + | ===Krio narratives=== |
+ | Currently, we explore oral narratives with respect to time anchoring and event chains. One of these narratives can be found on the [[A_Krio_narrative| Krio narrative page]], as parallel Krio -English text. We also added it audio source which we transcribed, as a mp3 file. If you are interested in the wave file, please contact the TypeCraft project. | ||
+ | |||
+ | |||
+ | ===Download=== | ||
+ | |||
+ | [[File:Krio TC-Corpus 06-10-2019.zip ]] | ||
+ | |||
+ | The Krio TC-Corpus is an TC-XML corpus of 86 interlinear glossed Krio texts. The October release 19 has been compiled by Dorothee Beermann, Anna Struck and Beatrice Nyampong, TypeCraft Project, Polytext, Oslo. | ||
+ | |||
+ | |||
+ | |||
+ | ===About the project=== | ||
The objective of our study is to model the Krio TMA system using a written Krio corpus. We have studied the system in terms of the grammatical features that are present in our corpus. For our study of the Krio narrative we have imported these features into the narrative domain which imposes its own set of features. This leads to a new relationship between feature sets which we aim to describe. | The objective of our study is to model the Krio TMA system using a written Krio corpus. We have studied the system in terms of the grammatical features that are present in our corpus. For our study of the Krio narrative we have imported these features into the narrative domain which imposes its own set of features. This leads to a new relationship between feature sets which we aim to describe. | ||
Line 29: | Line 43: | ||
of the Krio TMA system in terms of its features and the use of its exponents. <ref>Dorothee Beermann (2016) Features and Domains. Presentation at the SLE Conference in Naples, Sep 2016. The presentation can be found at: </ref> | of the Krio TMA system in terms of its features and the use of its exponents. <ref>Dorothee Beermann (2016) Features and Domains. Presentation at the SLE Conference in Naples, Sep 2016. The presentation can be found at: </ref> | ||
− | |||
− | |||
− | |||
− | |||
− | === | + | ====Tense, Modality, Aspect==== |
+ | |||
+ | Our Krio corpus has received in-depth morpho-syntactic annotations. In order to describe the Krio TMA features we have made use of the following attributes and values: | ||
+ | |||
+ | {| class="wikitable" width: 70%;" | ||
+ | ! colspan="3" style="color: black; text-align:center;" cellpadding=''10'' | Tense Modality Aspect | ||
+ | |- | ||
+ | |<tt>Past</tt> || <tt>Dynamic</tt> || <tt>Continuous</tt> | ||
+ | |- | ||
+ | |<tt>Past Perfect</tt> || <tt>Epistemic</tt> || <tt>Inceptive</tt> | ||
+ | |- | ||
+ | |<tt>Perfect</tt> || <tt>Deontic</tt> || <tt>Completive</tt> | ||
+ | |- | ||
+ | |<tt>Future</tt> || <tt>Conditional</tt> || <tt>Habitual</tt> | ||
+ | |} | ||
+ | |||
+ | |||
+ | |||
+ | This are the gloss and part of speech tags we assigned to the verbal categories: | ||
+ | |||
+ | |||
+ | |||
+ | [[File:Imageboth.png|thumb|500px|left|Gloss tags and POS tags]] | ||
+ | |||
+ | <br/> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
− | + | ==== Morphological template==== | |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
The verbal tense system has a particular ordering that can be expressed as a formula: | The verbal tense system has a particular ordering that can be expressed as a formula: | ||
+ | :<span style="font-size:100%; line-height: 3.0em;"> '''past [0...1] ⊰ MODAL [0...1] ⊰ TNS-perf [0...1] ⊰ ASP [0...2] ROOT ⊱ASP-compl [0...1]''' </span> | ||
− | |||
− | |||
− | |||
+ | ====Particularities==== | ||
+ | |||
+ | <u> '''The perfect tense''' </u> <br/> | ||
The Krio perfect is formed using dɔn which is derived from the Krio verb dɔn 'finish'. | The Krio perfect is formed using dɔn which is derived from the Krio verb dɔn 'finish'. | ||
As a tense marker the verb occurs in a preverbal or pre-copular position. | As a tense marker the verb occurs in a preverbal or pre-copular position. | ||
Line 68: | Line 105: | ||
− | + | <u> '''The future tense''' </u> <br/> | |
− | + | ||
The Krio future marker is the preverb gò which is derived from the Krio verb go. | The Krio future marker is the preverb gò which is derived from the Krio verb go. | ||
The Krio future refers to a time after the time in focus. | The Krio future refers to a time after the time in focus. | ||
− | + | <u> '''Modality''' </u> <br/> | |
− | + | ||
''Kin'' marks as a preverb habitual aspect; as a modal verb it expresses dynamic modality (ability), as well as epistemic modality. | ''Kin'' marks as a preverb habitual aspect; as a modal verb it expresses dynamic modality (ability), as well as epistemic modality. | ||
It also naturally occurs in conditional construction, communicating contingent possibility. | It also naturally occurs in conditional construction, communicating contingent possibility. | ||
− | ===Annotation profiles=== | + | ====Annotation profiles==== |
− | {| | + | Our overall annotation profile for Krio is shown in Table 2. |
− | |+ '''Krio Gloss and Pos Tags''' | + | |
− | |- | + | {| class="wikitable sortable" |
− | + | |+ '''Table 2 Krio Gloss and Pos Tags''' | |
− | + | |- | |
− | + | !width="15%" style="background-color:navajowhite" |'''Gloss Tag''' | |
− | + | !width="35%"class="unsortable"|'''Description''' | |
− | + | !width="15%" style="background-color:navajowhite"|'''Pos Tag''' | |
+ | !width="35%" class="unsortable|'''Description''' | ||
|- | |- | ||
|FUT || ''future tense'' || N || ''noun'' | |FUT || ''future tense'' || N || ''noun'' | ||
Line 162: | Line 198: | ||
|} | |} | ||
+ | <!--Do not delete this | ||
− | + | {{#ev:youtube|IldnXit6_f4}} --> | |
− | + | ||
<references/> | <references/> |
Latest revision as of 08:26, 7 October 2019
Krio | ||
---|---|---|
Spoken in | Sierra Leone | |
Total speakers | 500 000 native speakers | |
Language family | English Creole
| |
Writing system | Latin | |
Language codes | ||
ISO 639-1 | None | |
ISO 639-2 | – | |
ISO 639-3 | kri | |
Linguasphere | ||
Note: This page may contain IPA phonetic symbols in Unicode. |
Contents
About the TypeCraft Krio Corpus
The Krio Corpus consists of 33 short stories and transcribed narrations and a total of 937 phrases (8299 words).You can access the corpus from the TypeCraft Portal of Language In the list of Krio texts you find texts owned by Beatrice Owusua Nyampong and by the TypeCraft project. The latter corpus is a further development of the Nyampong corpus. Some of the annotations of the Nyampong corpus have been rectified. Preserving the resource, we in addition added new layers of annotation reflecting the Kri TMA system in the light of the analysis presented in Beermann 2016 [1]
Krio narratives
Currently, we explore oral narratives with respect to time anchoring and event chains. One of these narratives can be found on the Krio narrative page, as parallel Krio -English text. We also added it audio source which we transcribed, as a mp3 file. If you are interested in the wave file, please contact the TypeCraft project.
Download
File:Krio TC-Corpus 06-10-2019.zip
The Krio TC-Corpus is an TC-XML corpus of 86 interlinear glossed Krio texts. The October release 19 has been compiled by Dorothee Beermann, Anna Struck and Beatrice Nyampong, TypeCraft Project, Polytext, Oslo.
About the project
The objective of our study is to model the Krio TMA system using a written Krio corpus. We have studied the system in terms of the grammatical features that are present in our corpus. For our study of the Krio narrative we have imported these features into the narrative domain which imposes its own set of features. This leads to a new relationship between feature sets which we aim to describe.
Our theoretical aim is to present a domain analysis of features in order to model them according to the requirements introduced by the domain.
Our practical aim is to use the model to achieve a more realistic description of the Krio TMA system in terms of its features and the use of its exponents. [2]
Tense, Modality, Aspect
Our Krio corpus has received in-depth morpho-syntactic annotations. In order to describe the Krio TMA features we have made use of the following attributes and values:
Tense Modality Aspect | ||
---|---|---|
Past | Dynamic | Continuous |
Past Perfect | Epistemic | Inceptive |
Perfect | Deontic | Completive |
Future | Conditional | Habitual |
This are the gloss and part of speech tags we assigned to the verbal categories:
Morphological template
The verbal tense system has a particular ordering that can be expressed as a formula:
- past [0...1] ⊰ MODAL [0...1] ⊰ TNS-perf [0...1] ⊰ ASP [0...2] ROOT ⊱ASP-compl [0...1]
Particularities
The perfect tense
The Krio perfect is formed using dɔn which is derived from the Krio verb dɔn 'finish'.
As a tense marker the verb occurs in a preverbal or pre-copular position.
In a post verbal position dɔn marks the completive aspect.
The future tense
The Krio future marker is the preverb gò which is derived from the Krio verb go.
The Krio future refers to a time after the time in focus.
Modality
Kin marks as a preverb habitual aspect; as a modal verb it expresses dynamic modality (ability), as well as epistemic modality.
It also naturally occurs in conditional construction, communicating contingent possibility.
Annotation profiles
Our overall annotation profile for Krio is shown in Table 2.
Gloss Tag | Description | Pos Tag | Description |
---|---|---|---|
FUT | future tense | N | noun |
PRF | perfect tense | NPname | personal name |
PAST | past tense | Np | personal name |
DEF | definitive | PN | pronoun |
EXCL | exclusive | PNrel | relative pronoun |
NEG | negation | PNrefl | reflexive pronoun |
INDEF | indefinitive | PNposs | possessive pronoun |
FOC | focus | PNdem | dem. pronoun |
SBJ | subject | V | verb |
OBJ | object | Vpre | preverb |
DIR | direction | Vmod | modal verb |
PL | plural | Vlght | light verb |
HAB | habitual aspect | V1 | first serial verb |
CONT | continuous aspect | V2 | second serial verb |
INCEP | inceptive aspect | V3 | third serial verb |
DIR | direction | CONJ | conjunction |
LOC | location | CONJC | connect. conjunction |
PURP | purpose clause | CONJS | sub. conjunction |
REDP | reduplication | PUN | punctuation |
CMPL | complement | DET | determiner |
POSS | possessive | PRT | particle |
ADJ | adjective | ||
PREP | preposition | ||
COMP | comparative | ||
COP | copular | ||
NUM | numeral | ||
DEM | demonstrative | ||
ADV | adverb | ||
ADVplc | placement adverb | ||
ADVtemp | temporal adverb | ||
PREPdir | directional preposition | ||
ORD | ordering | ||
INTRJCT | interjection | ||
QUANT | quantifier | ||
Wh | Question word |
- ↑ Dorothee Beermann (2016) Features and Domains. Presentation at the SLE Conference in Naples, Sep 2016. SLE 2016 Notions of 'features' workshop page .
- ↑ Dorothee Beermann (2016) Features and Domains. Presentation at the SLE Conference in Naples, Sep 2016. The presentation can be found at: