Typecraft v2.5
Jump to: navigation, search

Difference between revisions of "Krio Corpus"

 
(21 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 +
{{Infobox language
 +
|name        = Krio
 +
|nativename  =
 +
|states      = Sierra Leone
 +
|region      =
 +
|speakers    = 500 000 native speakers
 +
|familycolor = Creole
 +
|fam1        = English Creole
 +
|fam2=  Atlantic
 +
|script      = Latin
 +
|nation      =
 +
|iso3= kri
 +
}}
  
  
===A Krio narrative===
+
===About  the TypeCraft Krio Corpus===
Currently, there is a public text from the Corpus, which shows an example of a typical Krio short story. The english translation is listed parallell to the Krio story and makes it easier to look at the differences made in, for example, the Speech-Event time and the marking of tenses.
+
The Krio Corpus consists of 33 short stories and transcribed narrations and a total of 937 phrases (8299 words).You can access the corpus from the TypeCraft [[Special:TypeCraft/PortalOfLanguages|Portal of Language]]
 +
In the list of Krio texts you find texts owned by [[User:Beatrice Owusua Nyampong|Beatrice Owusua Nyampong]] and by the [[User:Typecraft|TypeCraft project]]. The latter corpus is a further development of the Nyampong corpus. Some of the annotations of the Nyampong corpus have been rectified. Preserving the resource, we in addition added new layers of annotation reflecting the Kri TMA system in the light of the analysis presented in Beermann 2016 <ref>Dorothee Beermann (2016) Features and Domains. Presentation at the SLE Conference in Naples, Sep 2016.
 +
[[Notions_of_'feature'_in_linguistic_theory:_cross-theoretical_and_cross-linguistic_perspectives| SLE 2016  ''Notions of 'features''' workshop page ]] .</ref>
  
https://typecraft.org/tc2wiki/A_Krio_narrative
 
  
  
==About the project==
 
  
The objective of our study is to model the Krio TMA system using a written Krio corpus. We have studied the system in terms of its grammatical features. We then imported these features into the narrative domain which imposes its own set of features. This leads to a new relationship between feature sets.
+
===Krio narratives===
 +
Currently, we explore  oral narratives with respect to time anchoring and event chains. One of these  narratives can be found on the [[A_Krio_narrative| Krio narrative page]], as parallel Krio -English text. We also added it audio source which we transcribed, as a mp3 file. If you are interested in the wave file, please contact the TypeCraft project.
 +
 
 +
 
 +
===Download===
 +
 
 +
[[File:Krio TC-Corpus 06-10-2019.zip ]]
 +
 
 +
The Krio TC-Corpus is an TC-XML corpus of 86 interlinear glossed Krio texts. The October release 19 has been compiled by  Dorothee Beermann, Anna Struck and Beatrice Nyampong, TypeCraft Project, Polytext, Oslo.
 +
 
 +
 
 +
 
 +
===About the project===
 +
 
 +
The objective of our study is to model the Krio TMA system using a written Krio corpus. We have studied the system in terms of the  grammatical features that are present in our corpus. For our study of the Krio narrative we have imported these features into the narrative domain which imposes its own set of features. This leads to a new relationship between feature sets which we aim to describe.
  
 
Our theoretical aim is to present a domain analysis of features in order to model them according to the requirements introduced by the domain.  
 
Our theoretical aim is to present a domain analysis of features in order to model them according to the requirements introduced by the domain.  
  
 
Our practical aim is to use the model to achieve a more realistic description  
 
Our practical aim is to use the model to achieve a more realistic description  
of the Krio TMA system in terms of its features and the use of its exponents. <ref>As quoted by Professor Dorothee Beermann at the SLE Conference in Naple, Sep 2016</ref>
+
of the Krio TMA system in terms of its features and the use of its exponents. <ref>Dorothee Beermann (2016) Features and Domains. Presentation at the SLE Conference in Naples, Sep 2016. The presentation can be found at: </ref>
 +
 
 +
 
 +
====Tense, Modality, Aspect====
 +
 
 +
Our Krio corpus has received  in-depth morpho-syntactic annotations. In order to describe the Krio TMA features we have made use of the following attributes and values: 
 +
 
 +
{| class="wikitable"  width: 70%;"
 +
! colspan="3" style="color: black; text-align:center;" cellpadding=''10'' | Tense        Modality          Aspect
 +
|-
 +
|<tt>Past</tt> || <tt>Dynamic</tt> || <tt>Continuous</tt>
 +
|-
 +
|<tt>Past Perfect</tt> || <tt>Epistemic</tt> || <tt>Inceptive</tt>
 +
|-
 +
|<tt>Perfect</tt> || <tt>Deontic</tt> || <tt>Completive</tt>
 +
|-
 +
|<tt>Future</tt> || <tt>Conditional</tt> || <tt>Habitual</tt>
 +
|}
 +
 
 +
 
 +
 
 +
This are the gloss and part of speech tags we assigned to the verbal categories:
 +
 
 +
 
 +
 
 +
[[File:Imageboth.png|thumb|500px|left|Gloss tags and POS tags]]
 +
 
 +
<br/>
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
  
  
==About the Corpus==
 
The Krio Corpus consists of 33 short stories and transcribed narrations and a total of 937 phrases (8299 words). For all phrases in the corpus that have been annotated, it is layered in a systematical fashion of three analytical tiers, a meaning tier and a Gloss and POS tier.
 
<ref>The original resource has been created by Beatrice Owusua Nyampong, a native speaker of Krio.</ref><ref>Preserving the resource, we added new layers of annotation reflecting the Kri TMA system in the light of the analysis presented here.</ref>
 
  
===The Tense, Modality, Aspect features===
 
  
The annotations in Krio have been made in consideration to a tense, modality and aspect features system. Each category contains four tags.
+
==== Morphological template====
  
*Tense
 
**<tt>Past</tt>
 
**<tt>Past Perfect</tt>
 
**<tt>Perfect</tt>
 
**<tt>Future</tt>
 
*Modality
 
**<tt>Dynamic</tt>
 
**<tt>Epistemic</tt>
 
**<tt>Deontic</tt>
 
**<tt>Conditional</tt>
 
*Aspect
 
**<tt>Continuous</tt>
 
**<tt>Inceptive</tt>
 
**<tt>Completive</tt>
 
**<tt>Habitual</tt>
 
  
 
The verbal tense system has a particular ordering that can be expressed as a formula:  
 
The verbal tense system has a particular ordering that can be expressed as a formula:  
  
past [0...1] ⊰  MODAL [0...1] ⊰  TNS-perf [0...1] ⊰ ASP [0...2]      ROOT  ⊱ASP-compl [0...1]
+
:<span style="font-size:100%; line-height: 3.0em;"> '''past [0...1] ⊰  MODAL [0...1] ⊰  TNS-perf [0...1] ⊰ ASP [0...2]      ROOT  ⊱ASP-compl [0...1]''' </span>
  
  
  
===Particularities===
 
  
:'''''The perfect tense'''''
+
====Particularities====
  
 +
<u> '''The perfect tense''' </u> <br/>
 
The Krio perfect is formed using dɔn which is derived from the Krio verb dɔn 'finish'.  
 
The Krio perfect is formed using dɔn which is derived from the Krio verb dɔn 'finish'.  
 
As a tense marker the verb occurs in a preverbal or pre-copular position.
 
As a tense marker the verb occurs in a preverbal or pre-copular position.
In a post verbal position dɔn functions as a light verb and marks the completive aspect.  
+
In a post verbal position dɔn marks the completive aspect.  
  
  
:'''''The future tense'''''
+
<u> '''The future tense''' </u> <br/>
 
+
 
The Krio future marker is the preverb gò which is derived from the Krio verb go.  
 
The Krio future marker is the preverb gò which is derived from the Krio verb go.  
 
The Krio future refers to a time after the time in focus.  
 
The Krio future refers to a time after the time in focus.  
  
  
:'''''Modality'''''  
+
<u> '''Modality''' </u> <br/>
 
+
''Kin'' marks as a preverb habitual aspect; as a modal verb it expresses  dynamic modality (ability), as well as epistemic modality.  
Kin marks as a preverb habitual aspect; as a modal verb it expresses  dynamic modality (ability), as well as epistemic modality.  
+
 
It also naturally occurs in conditional construction, communicating contingent possibility.
 
It also naturally occurs in conditional construction, communicating contingent possibility.
  
  
 +
====Annotation profiles====
  
 +
Our overall annotation profile for Krio is shown in Table 2. 
  
===Annotation profiles===
+
{| class="wikitable sortable"
 
+
|+ '''Table 2  Krio Gloss and Pos Tags'''
{| border="1" cellpadding="2"
+
|+ '''Krio Gloss and Pos Tags'''
+
|-valign="top"
+
|width="15%"|'''Gloss Tag'''
+
|width="35%"|'''Description'''
+
|width="15%"|'''Pos Tag'''
+
|width="35%"|'''Description'''
+
|-valign="top"
+
 
|-
 
|-
|FUT || ''future tense'' ||"xx"||"xx"
+
!width="15%" style="background-color:navajowhite" |'''Gloss Tag'''
 +
!width="35%"class="unsortable"|'''Description'''
 +
!width="15%" style="background-color:navajowhite"|'''Pos Tag'''
 +
!width="35%" class="unsortable|'''Description'''
 
|-
 
|-
|PRF || ''perfect tense''
+
|FUT || ''future tense'' || N || ''noun''
 
|-
 
|-
|PAST || ''past tense''
+
|PRF || ''perfect tense'' || NPname || ''personal name''
 
|-
 
|-
|DEF || ''definitive''
+
|PAST || ''past tense'' || Np || ''personal name''
 
|-
 
|-
|EXCL || ''exclusive''
+
|DEF || ''definitive'' || PN || ''pronoun''
 
|-
 
|-
|NEG || ''negation''
+
|EXCL || ''exclusive'' || PNrel || ''relative pronoun''
 
|-
 
|-
|INDEF || ''indefinitive''
+
|NEG || ''negation'' || PNrefl || ''reflexive pronoun''
 
|-
 
|-
|FOC || ''focus''
+
|INDEF || ''indefinitive'' || PNposs || ''possessive pronoun''
 
|-
 
|-
|SBJ || ''subject''
+
|FOC || ''focus'' || PNdem || ''dem. pronoun''
 
|-
 
|-
|OBJ || ''object''
+
|SBJ || ''subject'' || V || ''verb''
 
|-
 
|-
|DIR || ''direction''
+
|OBJ || ''object'' || Vpre || ''preverb''
 
|-
 
|-
|PL || ''plural''
+
|DIR || ''direction'' || Vmod || ''modal verb''
 
|-
 
|-
|HAB || ''habitual aspect
+
|PL || ''plural'' || Vlght || ''light verb''
 
|-
 
|-
|CONT || ''continuous aspect''
+
|HAB || ''habitual aspect'' || V1 || ''first serial verb''
 
|-
 
|-
|INCEP || ''inceptive aspect''
+
|CONT || ''continuous aspect'' || V2 || ''second serial verb''
 
|-
 
|-
|DIR || ''direction''
+
|INCEP || ''inceptive aspect'' || V3 || ''third serial verb''
 
|-
 
|-
|LOC || ''location''
+
|DIR || ''direction'' || CONJ || ''conjunction''
 
|-
 
|-
|PURP || ''purpose clause''
+
|LOC || ''location'' || CONJC || ''connect. conjunction''
 
|-
 
|-
|REDP || ''reduplication''
+
|PURP || ''purpose clause''|| CONJS || ''sub. conjunction''
 
|-
 
|-
|CMPL || ''complement''
+
|REDP || ''reduplication''|| PUN || ''punctuation''
 
|-
 
|-
|POSS || ''possessive''
+
|CMPL || ''complement''|| DET || ''determiner''
|}
+
 
+
 
+
{| class=''wikitable'' style="text-align:left;"
+
!colspan=''2'' | POS tags
+
 
|-
 
|-
!scope=''row'' | Tag
+
|POSS || ''possessive''|| PRT || ''particle''
!scope=''row'' |Word
+
 
|-
 
|-
|N || ''noun''  
+
| || ||ADJ || ''adjective''
 
|-
 
|-
|V || ''verb''
+
| || ||PREP || ''preposition''
 
|-
 
|-
|PN || ''pronoun''
+
| || ||COMP || ''comparative''
 
|-
 
|-
|PUN || ''punctuation''
+
| || ||COP || ''copular''
 
|-
 
|-
|DET || ''determiner''
+
| || ||NUM || ''numeral''
 
|-
 
|-
|Vpre || ''preverb''
+
| || ||DEM || ''demonstrative''
 
|-
 
|-
|Vmod || ''modal verb''
+
| || || ADV || ''adverb''
 
|-
 
|-
|Vlght || ''light verb''
+
| || || ADVplc || ''placement adverb''
 
|-
 
|-
|V1 || ''first serial verb''
+
| || || ADVtemp || ''temporal adverb''
 
|-
 
|-
|V2 || ''second serial verb''
+
| || || PREPdir || ''directional preposition''
 
|-
 
|-
|V3 || ''third serial verb''
+
| || || ORD || ''ordering''
 
|-
 
|-
|PRT || ''particle''
+
| || || INTRJCT || ''interjection''
 
|-
 
|-
|ADJ || ''adjective''
+
| || || QUANT || ''quantifier''
 
|-
 
|-
|PREP || ''preposition''
+
| || || Wh || ''Question word''
|-
+
|ADV || ''adverb''
+
|-
+
|CONJ || ''conjunction''
+
|-
+
|CONJC || ''connect. conjunction''
+
|-
+
|CONJS || ''sub. conjunction''
+
|-
+
|PNrel || ''relative pronoun''
+
|-
+
|PNposs || ''possessive pronoun''
+
|-
+
|NPname || ''personal name''
+
|-
+
|COP || ''copular''
+
|-
+
|NUM || ''numeral''
+
|-
+
|DEM || ''demonstrative''
+
|-
+
|COMP || ''comparative''
+
|-
+
|Np || ''personal name''
+
|-
+
|ADVplc || ''placement adverb''
+
|-
+
|ADVtemp || ''temporal adverb''
+
|-
+
|PNrefl || ''reflexive pronoun''
+
|-
+
|PREPdir || ''directional preposition''
+
|-
+
|ORD || ''ordering''
+
|-
+
|INTRJCT || ''interjection''
+
|-
+
|QUANT || ''quantifier''
+
|-
+
|PNdem || ''demonstrative pronoun''
+
|-
+
|Wh || ''Question word''
+
 
|}
 
|}
  
 +
<!--Do not delete this
  
 
+
{{#ev:youtube|IldnXit6_f4}} -->
  
  
  
 
<references/>
 
<references/>

Latest revision as of 08:26, 7 October 2019

Krio
Spoken in Sierra Leone
Total speakers 500 000 native speakers
Language family English Creole
  • Atlantic
    • Krio
Writing system Latin
Language codes
ISO 639-1 None
ISO 639-2
ISO 639-3 kri
Linguasphere


About the TypeCraft Krio Corpus

The Krio Corpus consists of 33 short stories and transcribed narrations and a total of 937 phrases (8299 words).You can access the corpus from the TypeCraft Portal of Language In the list of Krio texts you find texts owned by Beatrice Owusua Nyampong and by the TypeCraft project. The latter corpus is a further development of the Nyampong corpus. Some of the annotations of the Nyampong corpus have been rectified. Preserving the resource, we in addition added new layers of annotation reflecting the Kri TMA system in the light of the analysis presented in Beermann 2016 [1]



Krio narratives

Currently, we explore oral narratives with respect to time anchoring and event chains. One of these narratives can be found on the Krio narrative page, as parallel Krio -English text. We also added it audio source which we transcribed, as a mp3 file. If you are interested in the wave file, please contact the TypeCraft project.


Download

File:Krio TC-Corpus 06-10-2019.zip

The Krio TC-Corpus is an TC-XML corpus of 86 interlinear glossed Krio texts. The October release 19 has been compiled by Dorothee Beermann, Anna Struck and Beatrice Nyampong, TypeCraft Project, Polytext, Oslo.


About the project

The objective of our study is to model the Krio TMA system using a written Krio corpus. We have studied the system in terms of the grammatical features that are present in our corpus. For our study of the Krio narrative we have imported these features into the narrative domain which imposes its own set of features. This leads to a new relationship between feature sets which we aim to describe.

Our theoretical aim is to present a domain analysis of features in order to model them according to the requirements introduced by the domain.

Our practical aim is to use the model to achieve a more realistic description of the Krio TMA system in terms of its features and the use of its exponents. [2]


Tense, Modality, Aspect

Our Krio corpus has received in-depth morpho-syntactic annotations. In order to describe the Krio TMA features we have made use of the following attributes and values:

Tense Modality Aspect
Past Dynamic Continuous
Past Perfect Epistemic Inceptive
Perfect Deontic Completive
Future Conditional Habitual


This are the gloss and part of speech tags we assigned to the verbal categories:


Gloss tags and POS tags











Morphological template

The verbal tense system has a particular ordering that can be expressed as a formula:

past [0...1] ⊰ MODAL [0...1] ⊰ TNS-perf [0...1] ⊰ ASP [0...2] ROOT ⊱ASP-compl [0...1]



Particularities

The perfect tense
The Krio perfect is formed using dɔn which is derived from the Krio verb dɔn 'finish'. As a tense marker the verb occurs in a preverbal or pre-copular position. In a post verbal position dɔn marks the completive aspect.


The future tense
The Krio future marker is the preverb gò which is derived from the Krio verb go. The Krio future refers to a time after the time in focus.


Modality
Kin marks as a preverb habitual aspect; as a modal verb it expresses dynamic modality (ability), as well as epistemic modality. It also naturally occurs in conditional construction, communicating contingent possibility.


Annotation profiles

Our overall annotation profile for Krio is shown in Table 2.

Table 2 Krio Gloss and Pos Tags
Gloss Tag Description Pos Tag Description
FUT future tense N noun
PRF perfect tense NPname personal name
PAST past tense Np personal name
DEF definitive PN pronoun
EXCL exclusive PNrel relative pronoun
NEG negation PNrefl reflexive pronoun
INDEF indefinitive PNposs possessive pronoun
FOC focus PNdem dem. pronoun
SBJ subject V verb
OBJ object Vpre preverb
DIR direction Vmod modal verb
PL plural Vlght light verb
HAB habitual aspect V1 first serial verb
CONT continuous aspect V2 second serial verb
INCEP inceptive aspect V3 third serial verb
DIR direction CONJ conjunction
LOC location CONJC connect. conjunction
PURP purpose clause CONJS sub. conjunction
REDP reduplication PUN punctuation
CMPL complement DET determiner
POSS possessive PRT particle
ADJ adjective
PREP preposition
COMP comparative
COP copular
NUM numeral
DEM demonstrative
ADV adverb
ADVplc placement adverb
ADVtemp temporal adverb
PREPdir directional preposition
ORD ordering
INTRJCT interjection
QUANT quantifier
Wh Question word



  1. Dorothee Beermann (2016) Features and Domains. Presentation at the SLE Conference in Naples, Sep 2016. SLE 2016 Notions of 'features' workshop page .
  2. Dorothee Beermann (2016) Features and Domains. Presentation at the SLE Conference in Naples, Sep 2016. The presentation can be found at: