A short history
Since the mid eighties, groups of researchers and students at the Norwegian University of Science and Technology have explored the use of formal and computational linguistic methods for natural language applications. The formalization and encoding of morpho-syntactic and semantic information, both at lexical and phrasal level, has been a central theme for a group which in 2004 took the name LingLab which then became the research group in digital linguistics
At present the group has two focal areas: Grammar Engineering and Language Development.
Grammar Engineering at LingLab
In Grammar Engineering the main application developed by LingLab is the Norwegian computational grammar NorSource (Lars Hellan, Dorothee Beermann and Ben Waldron). Together with partners in the DELPH-IN network LingLab applies Head-Driven Phrase Structure Grammar (Pollard and Sag (1994) and Minimal Recursion Semantics (MRS)(Copestake et.al.2005) to advance deep natural language processing.
As part of this work Pavel Mihaylov developed for LingLab an LKB multi-script interface called Trollet.
A further effort to represent lexical and construction level information is the Construction Labeling Project Verbconstructions cross-linguistically - Introduction, a system for encoding construction types across languages Lars Hellan.
Support for lesser-described languages
Between 1996 - 2009 the former linguistic department at NTNU and the linguistic departments at the University of Ghana, Legon cooperated in a project that was sponsored by the Norwegian NUFU programme, which is now terminated. The name Legon Trondheim Linguistics Project is that by which the project was usually referred to, while its official name is Computational Lexicography, Typology and Adult Literacy. The Ghanaian coordinator was the Head of the Linguistics Department in office. For more information about this project consult also the following page.
As part of this project the TypeCraft development started in 2005 with the goal to allow a better project internal management for linguistic data, and in order to offer a linguistic tool for the creation of morpheme-level glossed data from African languages. The idea was tha the tool should be tailored to the needs of students working with lesser-described languages, and that the students should be able to exchange and discuss the data. The first prototype of the system was developed in cooperation with Businesscape, an NTNU spin-off IT-company, led by Atle Prange]
TypeCraft itself is a product of LingLab's effort in supporting data-driven language description and analysis. After an early prototype of TypeCraft, presented at the University of Ghana, Legon and at the Texas Linguistic Society in Austin in 2006. TypeCraft v.1.0 was developed as a joint-effort of Pavel Mihaylov and Dorothee Beermann. In August 2014, TypeCraft v.2.0 was presented which is a co-development of the TypeCraft team and CIDLeS, the Interdisciplinery Centre of Social and Language Documentation
A short description of TypeCraft
TypeCraft is an online application consisting of a natural language database and a linguistic editor for interlinear glossing. The user adds linguistic annotation to written material which is stored in a relational database from where it can be retrieved using multiple views. The system is wrapped into a customised mediawiki which serves as an entrance port to the system.
Texts as well as annotations are in Unicode. Annotated data can be exported to standard text editors (WORD, Open Office and LaTex)as well as to XML format. TC has been designed for projects on minority languages. TypeCraft features an intuitive user interface and allows distributive usage. The application is written in Java using PostgreSQL database. It is hosted at a server owned by the Norwegian University of Science and Technology in Trondheim.
TypeCraft can be freely used. To use the TypeCraft Editor the user needs to be logged in.