Typecraft v2.5
Jump to: navigation, search

Difference between revisions of "Help:QuickStart"

Line 1: Line 1:
 
 
 
 
 
<span style="font-size:150%;">This is the TypeCraft QuickStart page</span>
 
<span style="font-size:150%;">This is the TypeCraft QuickStart page</span>
  

Revision as of 19:52, 29 September 2014

This is the TypeCraft QuickStart page

For more information about linguistic online editing with TypeCraft, search in the TypeCraft database, exporting TypeCraft data, or editing the TypeCraft wiki, please consult one of the following TypeCraft wiki pages:


Help with the TC Editors

Annotation using the TypeCraft 2.0 editor

You enter the TC2 editor by clicking on New text one the TypeCraft navigation bar. The editor opens a dialog window with the following message:

Your text will use the new and better TC editor.
If for some reason you want to use the old editor choose  
*No* below. 

The word "Text" in the phrase "Your text" above refers to any piece of digital writing not yet linguisticaly annotated. To annotate this text in the new editor, you now press "yes".

the Editor text area-no text loaded yet

The editor's text area opens (see screenshot to the left). To the right the Metadata matrix is accessible. We offer a default Metadata template and a template for the Norwegian Centre for Writing and Writing Research. (You find a *Change Metadata set* bottom at the end of the template.)

You enter your text into the text area by copying & pasting text from a file or from an online site into the text area. Next you select the text's lanuage in the metadata template and provide the rest of the metadata. You always can come back to this, and fill in missing information.

Back to the text area you now define what we call Phrases for annotation. TypeCraft phrases might be sentences or fragments. Also, text does not need to be annotated sentence by sentence, although that is also possible by .............. ............... .

You select an element for annotation by highlighting it. You then press the New Phrase button which will put the selected element into the Phrase list. In the text area this phrase now appears in green.

Now it is time for starting with the core annotation. You can do that in two ways. You double-click on one of the instantiated elements in the text area (those sentence in green colour), or you open the Phrase list by clicking on *View Phrase list*.

When the phrase opens a dialogue window pops-up with the following message:

 TypeCraft wants to know 
 which of the following options you prefer:
 1. to insert full forms into the table directly (recommended).
      Choosing this option allows you to separate affixes from word stems
      in the input mask below. Insert hyphens "-" or spaces " " to indicate morph 
      or word boundaries and then click OK.
 2. to manually insert words from your phrase into the table.
      For this option, click *Cancel* and an empty table will appear.

The tabular annotation editor

After you have decided whether you would like to work with a pre-filled table which realises your choices of morph boundaries, or you would like to start with a clean table, the tabular editor will open, respectively, pre-filled or empty.

You annotate by navigating through the table. We recommend that you add annotations to the tiers vertically by making use of the space bar. This method is in our experience the fastest.

 To learn more about the tiers and annotation tags and levels go to
         Multi-level linguistic annotation with TypeCraft

The *WORD* and the *MORPH* tiers feature a menu bar which allows you to modify existing entries. The menu bar appears when you activate the field you want to change. From the menu you can also add words or change the word's segmentation. *Gloss* and *POS* tags are chosen from a predefined list. You find an overview over all Gloss and POS tags on your navigation bar. These lists are auto-generated and can be ordered by category at your convenience. The lists also provide short definitions for each tag.

The annotation table is supplemented by a large note field. Notice that also the content of the note field can be searched, and if you for example use a designated marker to flag sentences that you would like to target by a search, this can be done easily. Is the annotation of a phrase questionable, you could add a question mark to the Note field. A search for "?" in the note field will then allow you to target by a search only sentences with questionable annotations.


Annotation for Sense

The focus of manual corpus annotation changes, and recently linguists take a new interest in text annotation: Next to semantic and pragmatic phenomena also writing research takes interest in reliably annotating fragments larger than a single sentence in order to determine writer proficiency starting from early age. The new TypeCraft editor allows to annotate text fragments in the context of the text that contains them and to annotate words or word sequence for discourse senses.

   To learn more about the discourse senses in TypeCraft go to
         Multi-level linguistic annotation with TypeCraft


In order to add discourse senses to your annotation, go to your tabular editor. At the bottom of the annotation table you find the function *Add Discourse Senses*. You can use tree additional tiers to add discourse annotations. At present we offer one experimental set of annotation tags for discourse senses. These tags can be accessed from the annotation bar, and just like POS and Gloss tags the list of discourse sense tags is updated automatically from the database.



Annotation for Valence using the Valence description template

You enter the valence annotation mode from the tabular editor by pressing the * Change* to the right of the label Valence which you find above the word- and morph-level annotation table.

An additional annotation window, as shown in the screenshot to the left, appears and allows you to specify valency attributes using a predefined vocabulary.

Valence Annotation Schema

While the valence annotation schema is still under development, we allow at this point the input of the following attributes:

   * Syntactic Argument Structure   * Salient Sentence Pattern
   * Situation Type                 * Force & Eventuality
   * Diathesis                      * Modality
   * Adjunct of Interest            * Sentence Aspect
Drop-down window for the attribute Syntactic argument structure

Each of these attributes has a set of possible values. Some of the values for the attribute Syntactic Argument Structure are shown to the the right in the way they appear in the drop-down menu.


More about Valence annotation in TypeCraft can be found under:

   To learn more about Valence annotation in TypeCraft go to
         Multi-level linguistic annotation with TypeCraft



The Valence annotation is highlighted in yellow


When finished with the annotation of valency you return to your tabular annotation editor, where the valence values now appear as a hyphenated string (as shown in the screenshot to the left) exposing the valency specifications that you have chosen for the phrase under annotation. This is illustrated in the screenshot to the left; the Valence annotations is highlighted in yellow.










Annotation using the TypeCraft 1.0 editor

We recommend the use of the TC 2 editor.


Click *My Text* in the navigation-bar. The TC Editor opens. You may now enter or copy-and-paste a text into the left part of the editor window. Do not add morph boundaries at this point. At this point the TC Editor only accepts text strings. Before you start to tokenize your text strings, determine the language of your text by going to the *CHANGE* button. TC uses the ISO-639-1 code for languages. Please use the drop-down window to select one of the ISO language namYour text will use the new and better TC editor. If for some reason you want to use the old editor choose No below.s. Give your text a title and a title translation if appropriate. Text title and title translation will inform *Text search* and therefore should be chosen with care.

Tokenization You can tokenise your text into sentences. This generally works quite well. TypeCraft has at this point still problems with period signs for example in titles like Mr. or Dr. and semicolons. In order to tokenise you text or collection of sentences press *CREATE PHRASES*; this will initiate the tokenization. Inspect the result before you choose *Yes* from the dialogue box. If you have not highlighted parts of your text, TC will ask you whether you would like to tokenize the whole text. Say *Yes*. The tokenization can be repeated several times until you are content with the result.

Morph break-up Select a sentence from the set of tokenised sentences which now have appeared on the right hand side of your editor window. Click on one sentence. This will open a dialogue box. Follow the instructions in the dialogue box to insert morph boundaries into the annotation table.

Annotation Table Navigate through the annotation levels vertically by making use of the space bar. The *WORD* and the *MORPH* tier feature a menu bar which allows you to delete words/morphs which appear when you click on the field in those rows that you would like to change. From the menu you can also add words or change the word segmentation. *Gloss* and *POS* tags are chosen from a predefined list. You find an overview over all Gloss and POS tags on your navigation bar. These lists are auto-generated and can be ordered by category at your convenience.