Difference between revisions of "Data-driven Valence Typology"
|Line 22:||Line 22:|
<div class="box box-sapphire">
<div class="box box-sapphire">
<h4 class="text-center">IGT data</h4>
<h4 class="text-center">IGT data </h4>
The TypeCraft IGT comes in the form illustrated below, from Akan
The TypeCraft IGT comes in the form illustrated below, from Akan:
Revision as of 10:22, 22 December 2017
The multilingual Interlinear Glossed Text (IGT) Bank.
With TypeCraft you can freely access grammatically glossed examples (IGT) from more than 150 languages (see:Portal of Languages). Examples can be exported in various formats. You can also use TypeCraft to create your own Interlinear Glossed Text for any language. You can store the data in your own privat space, or share your work with a group. The TypeCraft Wiki consists of 286 articles which discuss mostly less described languages and address linguistic questions, often embedding Interlinear Glossed Texts drawn from the database. Viewing existing data in the database or reading articles in the wiki do not require a login, whereas creation of data and writing in the wiki requires a login (upper right corner). To access the database and data editing functions, push the TypeCraft Tools button in the upper left corner, and to access the wiki pages, write part of the name of the page wanted into the Search Mediawiki slot in the upper right cormer. The Quick Start page gives a general introduction to the use of TypeCraft, and information about system updates.
IGT data and the TypeCraft Database
The TypeCraft database as of now consists of 2848 texts, each consisting of a smaller or larger number of phrases. There are at present 145 unique languages in the database. Our Portal of Languages is a dynamic list of those languages that have more than 5 public texts in the database. Through the Portal you can download texts and phrases using different export formats, as explained in Quick Start page.The TypeCraft IGT comes in the form illustrated below, an annotation of a phrase from Akan:
TypeCraft data can be searched on any of the levels here illustrated - from the text to the morph level, using TypeCraft Tools -> TypeCraft Search -> "Text search" or "Phrase search". Search results can be freely downloaded in various formats, for instance to TC's own wiki as in the example. The Quick Start page provides information on how to proceed for search and export, as well as for creating IGTs.
The TypeCraft Wiki
The category page Languages offers an overview over the TypeCraft Wiki pages ordered by language, with Runyankore-Rukiga as an example of a category of language specific pages. The following pages instantiate views into typological variation: Typological Features, Grammar squibs. Specific areas treated in depth are for instance valency across languages, with contributions accessed at Valence - general and Valence by language. Various research projects are presented under Projects. .
The TypeCraft Database
The TypeCraft database as of now consists of 2848 texts. There are at present 145 unique languages in the database. Our Portal of Languages is a dynamic list of those languages that have more than 5 public texts in the database. Through the Portal you can download texts and phrases using different export formats, as explained in Quick Start page .
Nov. 7, 2011
Data-driven Valence Typology (DVT) is a project where we seek to represent the characteristic sentence construction types of a language – called its c-profile - in a transparent, detailed and non-theory-biased format, drawing from a common, restricted repertory of analytic-descriptive primitives, cf. . By adhering to a common classification system, DVT in principle allows for its data to be searchable in a relational database. DVT has so far been developed with a view to cover significantly different languages (Ga from the Niger-Congo family Kwa, Norwegian from Germanic, and Kistaninya from Ethio-Semitic), while in a current phase the project has a more ‘micro-comparative’ focus, in showing how a profile for one language of a given family can be derived from the c-profile of another language in the same family. In Germanic we envisage such extensions with regard English and German, and in Kwa/Gur with regard to Dangme and Gurene.
In situating DVT among current projects and initiatives, it can perhaps be most directly related to VerbNet , its non-computational predecessor in Levin's work , and a cross-linguistic development of the latter, the Leipzig Valency Classes Project.
In future publications we will show how an inventory of verb classes in the Levin approach can be derived from a DVT c-profile and an accompanying verb construction lexicon, as are available for Ga , and for Norwegian . We will also assess the notion of ‘valence alternation’ as a comparison unit, by itself notoriously difficult to define, and show that for the 150 most salient frames in Ga, none of them are interconnected by any of the ‘alternation’ patterns which are commonly applied in the European setting. We will advocate DVT as offering a sounder general basis for valence typology, not being directly dependent on notions like 'alternation'.
Further pages at this site giving information about the project include:
2. Verbconstructions cross-linguistically - Introduction, a predecessor of , and introducing the system particularly as applied to Norwegian, with wiki pages illustrating the by then established c-profile of Norwegian, with annotated examples for each type.
3. The following TypeCraft annotated texts:
Ga sentence types Mary Esther Kropp Dakubu Norwegian verb constructions Lars Hellan Verb constructions in Kistaniniya Bedilu Debela
- Hellan and Dakubu 2010 Identifying Verb Constructions Cross-linguistically, SLAVOB series 6:3, University of Ghana, 2010
- Levin 1993 English Verb Classes and Alternations, University of Chicago Press, Chicago,IL
- Leipzig Valency Classes Project
- Dakubu 2011 Ga Verbs and their Constructions
- Hellan 2011 Norwegian Verbs and their Constructions