Typecraft v2.5
Jump to: navigation, search

Difference between revisions of "TypeCraft:About"

Line 2: Line 2:
 
Since the mid eighties, groups of researchers and students at the [http://www.ntnu.no/english/ Norwegian University of Science and Technology] have explored the use of formal and computational linguistic methods for natural language applications. The formalization and encoding of morpho-syntactic and semantic information, both at lexical and phrasal level, has been a central theme for a group which in 2004 took the name '''LingLab'''.
 
Since the mid eighties, groups of researchers and students at the [http://www.ntnu.no/english/ Norwegian University of Science and Technology] have explored the use of formal and computational linguistic methods for natural language applications. The formalization and encoding of morpho-syntactic and semantic information, both at lexical and phrasal level, has been a central theme for a group which in 2004 took the name '''LingLab'''.
  
At present '''LingLab''' has two focal areas: '''Grammar Engineering''' and '''Language Documentation'''.
+
At present '''LingLab''' has two focal areas: '''Grammar Engineering''' and '''Language Development'''.
  
 
====Grammar Engineering at LingLab====
 
====Grammar Engineering at LingLab====
Line 11: Line 11:
  
 
A further effort to represent lexical and construction level information is the Construction Labeling Project [[Verbconstructions cross-linguistically - Introduction]], a system for encoding construction types across languages [http://www.hf.ntnu.no/hf/isk/Ansatte/lars.hellan/personInfo.html Lars Hellan].
 
A further effort to represent lexical and construction level information is the Construction Labeling Project [[Verbconstructions cross-linguistically - Introduction]], a system for encoding construction types across languages [http://www.hf.ntnu.no/hf/isk/Ansatte/lars.hellan/personInfo.html Lars Hellan].
 +
 +
 +
===Language Development===
 +
Between 1996 - 2009 the former linguistic department at NTNU and the linguistic departments at the University of Ghana, Legon cooperated in a project that was sponsored by the Norwegian NUFU programme, which is now terminated. The name '' Legon Trondheim Linguistics Project'' is that by which the project is usually referred to, while its official name is Computational Lexicography, Typology and Adult Literacy. The Ghanaian coordinator was the Head of the Linguistics Department in office. For more information about this project consult also the following [[The Legon Trondheim Linguistics Project| page]].
 +
 +
As part of this project the TypeCraft development started in 2005 with the goal to allow a better project internal management for linguistic data, and in order to offer a linguistic tool for the creation of morpheme-level glossed data from African languages. The idea was tha the tool should be tailored to the needs of students working with lesser-described languages, and that the students should be able to exchange and discuss the data. 
 +
The first prototype of the system was developed in cooperation with Businesscape, an NTNU spin-off IT-company, led by [https://www.linkedin.com/in/atleprange Atle Prange]]
  
 
===TypeCraft===
 
===TypeCraft===
TypeCraft itself is a product of LingLab's effort in Language Documentation. A prototype of TypeCraft was developed in 2006 by
+
TypeCraft itself is a product of LingLab's effort in supporting data-driven Language Description and analysis. After an early prototype of TypeCraft, presented at the University of Ghana, Legon and at the Texas Linguistic Society in Austin in 2006. TypeCraft v.1.0  was developed as a joint-effort of [[User:Pavel|Pavel Mihaylov]] and [[User:Dorothee|Dorothee Beermann]]. In August 2014, TypeCraft v.2.0 was presented which is a co-development of the TypeCraft team and CIDLeS, the [http://www.cidles.eu/ Interdisciplinery Centre of Social and Language Documentation]
[http://www.hf.ntnu.no/hf/isk/Ansatte/dorothee.beermann/personInfo.html Dorothee Beermann] and Atle Prange at [http://fourspark.no/ Fourspark]. The present beta version of TypeCraft represents a joined effort by [[User:Pavel|Pavel Mihaylov]] and [[User:Dorothee|Dorothee Beermann]].
+
 
  
 
'''A short description of TypeCraft'''
 
'''A short description of TypeCraft'''
Line 24: Line 31:
 
TC has been designed for projects on minority languages. TypeCraft features an intuitive user interface and allows distributive usage. The application is written in Java using PostgreSQL database. It is hosted at a server owned by the Norwegian University of Science and Technology in Trondheim.  
 
TC has been designed for projects on minority languages. TypeCraft features an intuitive user interface and allows distributive usage. The application is written in Java using PostgreSQL database. It is hosted at a server owned by the Norwegian University of Science and Technology in Trondheim.  
  
TypeCraft can be freely used. To use the TypeCraft Editor the user needs to be login. Click on login in the right upper corner of  your browser.
+
TypeCraft can be freely used. To use the TypeCraft Editor the user needs to be login.

Revision as of 19:54, 1 August 2014

Background

Since the mid eighties, groups of researchers and students at the Norwegian University of Science and Technology have explored the use of formal and computational linguistic methods for natural language applications. The formalization and encoding of morpho-syntactic and semantic information, both at lexical and phrasal level, has been a central theme for a group which in 2004 took the name LingLab.

At present LingLab has two focal areas: Grammar Engineering and Language Development.

Grammar Engineering at LingLab

In Grammar Engineering the main application developed by LingLab is the Norwegian computational grammar NorSource (Lars Hellan, Dorothee Beermann and Ben Waldron). Together with partners in the DELPH-IN network LingLab applies Head-Driven Phrase Structure Grammar (Pollard and Sag (1994) and Minimal Recursion Semantics (MRS)(Copestake et.al.2005) to advance deep natural language processing.

As part of this work Pavel Mihaylov developed for LingLab an LKB multi-script interface called Trollet.

A further effort to represent lexical and construction level information is the Construction Labeling Project Verbconstructions cross-linguistically - Introduction, a system for encoding construction types across languages Lars Hellan.


Language Development

Between 1996 - 2009 the former linguistic department at NTNU and the linguistic departments at the University of Ghana, Legon cooperated in a project that was sponsored by the Norwegian NUFU programme, which is now terminated. The name Legon Trondheim Linguistics Project is that by which the project is usually referred to, while its official name is Computational Lexicography, Typology and Adult Literacy. The Ghanaian coordinator was the Head of the Linguistics Department in office. For more information about this project consult also the following page.

As part of this project the TypeCraft development started in 2005 with the goal to allow a better project internal management for linguistic data, and in order to offer a linguistic tool for the creation of morpheme-level glossed data from African languages. The idea was tha the tool should be tailored to the needs of students working with lesser-described languages, and that the students should be able to exchange and discuss the data. The first prototype of the system was developed in cooperation with Businesscape, an NTNU spin-off IT-company, led by Atle Prange]

TypeCraft

TypeCraft itself is a product of LingLab's effort in supporting data-driven Language Description and analysis. After an early prototype of TypeCraft, presented at the University of Ghana, Legon and at the Texas Linguistic Society in Austin in 2006. TypeCraft v.1.0 was developed as a joint-effort of Pavel Mihaylov and Dorothee Beermann. In August 2014, TypeCraft v.2.0 was presented which is a co-development of the TypeCraft team and CIDLeS, the Interdisciplinery Centre of Social and Language Documentation


A short description of TypeCraft

TypeCraft is an online application consisting of a natural language database and a INTERLINEAR GLOSSING EDITOR; the user adds linguistic annotation to written material which is stored in a relational database from where it can be retrieved using multiple views. The system is wrapped into a customised mediawiki which serves as an entrance port to the system.

Texts as well as annotations are in Unicode. Annotated data can be exported to standard text editors (WORD, Open Office and LaTex)as well as to XML format. TC has been designed for projects on minority languages. TypeCraft features an intuitive user interface and allows distributive usage. The application is written in Java using PostgreSQL database. It is hosted at a server owned by the Norwegian University of Science and Technology in Trondheim.

TypeCraft can be freely used. To use the TypeCraft Editor the user needs to be login.