Typecraft v2.5
Jump to: navigation, search

About TypeCraft

Revision as of 12:37, 28 October 2008 by Dorothee Beermann (Talk | contribs)

under construction :=)

Background

Since the mid eighties, a group of researchers and students at the Linguistics Department at the Norwegian University of Science and Technology has explored the use of formal and shallow linguistic methods for natural language applications. The formalization and encoding of morpho-syntactic and semantic information, both at lexical and phrasal level has been a central theme for work done by our research consortium. In 2003 we took the name LinLab.

At present we have two focal areas: Grammar Engineering and Language Documentation.

Grammar Engineering at LingLab

In Grammar Engineering the main application developed by LinLab is the Norwegian computational grammar NorSource (Hellan, Beermann, Waldron). Together with other partners in the DELPH-IN network LinLab applies Head-Driven Phrase Structure Grammar (HPSG (Pollard and Sag (1994)) and Minimal Recursion Semantics (MRS)(Copestake et.al.2005) to advance deep natural language processing.

As part of this work LingLab has developed an LKB multi-script interface called Trollet (by Pavel Mihaylov).

As a further effect to represent lexical and construction level information is the Verbconstructions cross-linguistically - Introduction Construction Labeling Project, a system for encoding construction types across languages (Hellan).

TypeCraft

TypeCraft itself is a product of LingLab's effort in Language Documentation. A prototype of TypeCraft was developed in 2006 by Dorothee Beermann and Atle Prange at Businesscape. The present beta version of TypeCraft has been developed through a joined effort by Pavel Mihaylov and Dorothee Beermann.

Here a short description of TypeCraft:

TypeCraft is an INTERLINEAR GLOSSING EDITOR; the user adds linguistic annotation to written material which is stored in a relational database from where it can be retrieved using multiple views.

Texts as well as annotations are in Unicode. Annotated data can be exported to standard text editors (WORD, Open Office and LaTex) as well as to XML format. TC has been designed for projects on minority languages, joining linguists and native speakers. It is intuitive to use and allows wide and distributive usage.

TC uses PostgreSQL as database format. The database is hosted at a server owned by the Norwegian University of Science and Technology in Trondheim. The application is written in the Java.

TypeCraft can be freely use online with Mozilla Firefox.