Typecraft v2.5
Jump to: navigation, search

Difference between revisions of "Data-driven Valence Typology"

Line 27: Line 27:
 
       The TypeCraft database as of now consists of {{#textCount:}} texts, each consisting of a smaller or larger number of ''phrases''. There are at present 145 unique languages in the database. Our  [[Special:TypeCraft/PortalOfLanguages|Portal of Languages]] is a dynamic list of those languages that have more than 5 public texts in the database. Through the Portal you can download texts and phrases using different export formats, as explained in [[Help:QuickStart|Quick Start page]].The TypeCraft IGT comes in the form illustrated below, an annotation of a phrase from Akan:
 
       The TypeCraft database as of now consists of {{#textCount:}} texts, each consisting of a smaller or larger number of ''phrases''. There are at present 145 unique languages in the database. Our  [[Special:TypeCraft/PortalOfLanguages|Portal of Languages]] is a dynamic list of those languages that have more than 5 public texts in the database. Through the Portal you can download texts and phrases using different export formats, as explained in [[Help:QuickStart|Quick Start page]].The TypeCraft IGT comes in the form illustrated below, an annotation of a phrase from Akan:
  
<Phrase>6471</Phrase>
+
<Phrase>454187</Phrase>
  
 
TypeCraft data can be searched on any of the levels here illustrated - from the text to the morph level, using '''TypeCraft Tools -> TypeCraft Search ->''' "Text search" or "Phrase search". Search results can be freely downloaded in various formats, for instance to TC's own wiki as in the example. The [[Help:QuickStart|Quick Start page]] provides information on how to proceed for search and export, as well as for ''creating'' IGTs.
 
TypeCraft data can be searched on any of the levels here illustrated - from the text to the morph level, using '''TypeCraft Tools -> TypeCraft Search ->''' "Text search" or "Phrase search". Search results can be freely downloaded in various formats, for instance to TC's own wiki as in the example. The [[Help:QuickStart|Quick Start page]] provides information on how to proceed for search and export, as well as for ''creating'' IGTs.
Line 33: Line 33:
 
   </div>
 
   </div>
 
</div>
 
</div>
<div class="col-md-6" >
+
<div class="col-md-12" >
 
   <div class="box box-sapphire">
 
   <div class="box box-sapphire">
 
     <div class="text-center">
 
     <div class="text-center">
Line 40: Line 40:
 
       <p class="text-justify">The category page '''[[:Category:Languages|Languages]]''' offers an overview over the TypeCraft Wiki pages ordered by language, with [[:Category:Runyankore-Rukiga Corpus|Runyankore-Rukiga]] as an example of a category of language specific pages. The following pages instantiate views into typological variation: [[:Category:Typological Features Template|Typological Features]], [[:Category:Grammar squib|Grammar squibs]]. Specific areas treated in depth are for instance ''valency'' across languages, with contributions accessed at [[:Category:Valence - general and multilingual|Valence - general]] and [[:Category:Valence by language|Valence by language]]. Various research ''projects'' are presented under [[:Category:Projects|Projects]].
 
       <p class="text-justify">The category page '''[[:Category:Languages|Languages]]''' offers an overview over the TypeCraft Wiki pages ordered by language, with [[:Category:Runyankore-Rukiga Corpus|Runyankore-Rukiga]] as an example of a category of language specific pages. The following pages instantiate views into typological variation: [[:Category:Typological Features Template|Typological Features]], [[:Category:Grammar squib|Grammar squibs]]. Specific areas treated in depth are for instance ''valency'' across languages, with contributions accessed at [[:Category:Valence - general and multilingual|Valence - general]] and [[:Category:Valence by language|Valence by language]]. Various research ''projects'' are presented under [[:Category:Projects|Projects]].
 
.</p>  
 
.</p>  
    </div>
+
 
  </div>
+
</div>
+
<div class="col-md-6" >
+
  <div class="box box-sapphire">
+
    <div class="text-center">
+
      <h4>xx</h4>
+
      <hr class="tagline-divider" />
+
      <p class="text-justify">
+
    xx
+
      </p>
+
 
       </div>
 
       </div>
 
     </div>
 
     </div>

Revision as of 18:03, 22 December 2017


TypeCraft


The multilingual Interlinear Glossed Text (IGT) Bank.

With TypeCraft you can freely access grammatically glossed examples (IGT) from more than 150 languages (see:Portal of Languages). Examples can be exported in various formats. You can also use TypeCraft to create your own Interlinear Glossed Text for any language. You can store the data in your own privat space, or share your work with a group. The TypeCraft Wiki consists of 289 articles which discuss mostly less described languages and address linguistic questions, often embedding Interlinear Glossed Texts drawn from the database. Viewing existing data in the database or reading articles in the wiki do not require a login, whereas creation of data and writing in the wiki requires a login (upper right corner). To access the database and data editing functions, push the TypeCraft Tools button in the upper left corner, and to access the wiki pages, write part of the name of the page wanted into the Search Mediawiki slot in the upper right cormer. The Quick Start page gives a general introduction to the use of TypeCraft, and information about system updates.

IGT data and the TypeCraft Database


The TypeCraft database as of now consists of 3065 texts, each consisting of a smaller or larger number of phrases. There are at present 145 unique languages in the database. Our Portal of Languages is a dynamic list of those languages that have more than 5 public texts in the database. Through the Portal you can download texts and phrases using different export formats, as explained in Quick Start page.The TypeCraft IGT comes in the form illustrated below, an annotation of a phrase from Akan: TypeCraft data can be searched on any of the levels here illustrated - from the text to the morph level, using TypeCraft Tools -> TypeCraft Search -> "Text search" or "Phrase search". Search results can be freely downloaded in various formats, for instance to TC's own wiki as in the example. The Quick Start page provides information on how to proceed for search and export, as well as for creating IGTs.

The TypeCraft Wiki


The category page Languages offers an overview over the TypeCraft Wiki pages ordered by language, with Runyankore-Rukiga as an example of a category of language specific pages. The following pages instantiate views into typological variation: Typological Features, Grammar squibs. Specific areas treated in depth are for instance valency across languages, with contributions accessed at Valence - general and Valence by language. Various research projects are presented under Projects. .

www.typecraft.org-thumb.jpg





Mary Esther Kropp Dakubu and Lars Hellan

Nov. 7, 2011

Data-driven Valence Typology (DVT) is a project where we seek to represent the characteristic sentence construction types of a language – called its c-profile - in a transparent, detailed and non-theory-biased format, drawing from a common, restricted repertory of analytic-descriptive primitives, cf. [1]. By adhering to a common classification system, DVT in principle allows for its data to be searchable in a relational database. DVT has so far been developed with a view to cover significantly different languages (Ga from the Niger-Congo family Kwa, Norwegian from Germanic, and Kistaninya from Ethio-Semitic), while in a current phase the project has a more ‘micro-comparative’ focus, in showing how a profile for one language of a given family can be derived from the c-profile of another language in the same family. In Germanic we envisage such extensions with regard English and German, and in Kwa/Gur with regard to Dangme and Gurene.


In situating DVT among current projects and initiatives, it can perhaps be most directly related to VerbNet [2], its non-computational predecessor in Levin's work [3], and a cross-linguistic development of the latter, the Leipzig Valency Classes Project[4].

In future publications we will show how an inventory of verb classes in the Levin approach can be derived from a DVT c-profile and an accompanying verb construction lexicon, as are available for Ga [5], and for Norwegian [6]. We will also assess the notion of ‘valence alternation’ as a comparison unit, by itself notoriously difficult to define, and show that for the 150 most salient frames in Ga, none of them are interconnected by any of the ‘alternation’ patterns which are commonly applied in the European setting. We will advocate DVT as offering a sounder general basis for valence typology, not being directly dependent on notions like 'alternation'.


Further pages at this site giving information about the project include:

1. The three parts of [1], consisting of: The system , Ga Appendix , Norwegian Appendix

2. Verbconstructions cross-linguistically - Introduction, a predecessor of [1], and introducing the system particularly as applied to Norwegian, with wiki pages illustrating the by then established c-profile of Norwegian, with annotated examples for each type.

3. The following TypeCraft annotated texts:

 Ga sentence types	                Mary Esther Kropp Dakubu
 Norwegian verb constructions	        Lars Hellan
 Verb constructions in Kistaniniya	Bedilu Debela


References

  1. 1.0 1.1 1.2 Hellan and Dakubu 2010 Identifying Verb Constructions Cross-linguistically, SLAVOB series 6:3, University of Ghana, 2010
  2. Verbnet
  3. Levin 1993 English Verb Classes and Alternations, University of Chicago Press, Chicago,IL
  4. Leipzig Valency Classes Project
  5. Dakubu 2011 Ga Verbs and their Constructions
  6. Hellan 2011 Norwegian Verbs and their Constructions