Typecraft v2.5
Jump to: navigation, search

Difference between revisions of "Norwegian Valency Corpus"

(Version 1.0 - trial version)
Line 21: Line 21:
  
 
The present version is a trial of a methodology described in [[to appear]], which potentially allows for a rapid increase in corpus size. The present version has clear errors, to be improved for a next stage.
 
The present version is a trial of a methodology described in [[to appear]], which potentially allows for a rapid increase in corpus size. The present version has clear errors, to be improved for a next stage.
 
 
<Phrase>622490</Phrase>
 
<Phrase>622491</Phrase>
 
<Phrase>622492</Phrase>
 
<Phrase>622493</Phrase>
 
<Phrase>622494</Phrase>
 
<Phrase>622495</Phrase>
 
<Phrase>622496</Phrase>
 
<Phrase>622497</Phrase>
 
<Phrase>622498</Phrase>
 
<Phrase>622499</Phrase>
 
<Phrase>622500</Phrase>
 
<Phrase>622501</Phrase>
 
<Phrase>622502</Phrase>
 
<Phrase>622503</Phrase>
 
<Phrase>622504</Phrase>
 
<Phrase>622505</Phrase>
 
<Phrase>622506</Phrase>
 
<Phrase>622507</Phrase>
 
<Phrase>622508</Phrase>
 
<Phrase>622509</Phrase>
 
<Phrase>622510</Phrase>
 
<Phrase>622511</Phrase>
 
<Phrase>622512</Phrase>
 
<Phrase>622513</Phrase>
 
<Phrase>622514</Phrase>
 
<Phrase>622515</Phrase>
 
<Phrase>622516</Phrase>
 
<Phrase>622517</Phrase>
 
<Phrase>622518</Phrase>
 
<Phrase>622519</Phrase>
 
<Phrase>622520</Phrase>
 
<Phrase>622521</Phrase>
 
<Phrase>622522</Phrase>
 
<Phrase>622523</Phrase>
 
<Phrase>622524</Phrase>
 
<Phrase>622525</Phrase>
 
<Phrase>622526</Phrase>
 
<Phrase>622527</Phrase>
 
<Phrase>622528</Phrase>
 
<Phrase>622529</Phrase>
 
<Phrase>622530</Phrase>
 
<Phrase>622531</Phrase>
 
<Phrase>622532</Phrase>
 
<Phrase>622533</Phrase>
 
<Phrase>622534</Phrase>
 
<Phrase>622535</Phrase>
 
<Phrase>622536</Phrase>
 
<Phrase>622537</Phrase>
 
<Phrase>622538</Phrase>
 
<Phrase>622539</Phrase>
 
<Phrase>622540</Phrase>
 
<Phrase>622541</Phrase>
 
<Phrase>622542</Phrase>
 
<Phrase>622543</Phrase>
 
<Phrase>622544</Phrase>
 
<Phrase>622545</Phrase>
 

Revision as of 19:24, 9 September 2017

Version 1.0 - trial version

--Typecraft (talk) 10:03, 18 July 2017 (CEST)

The corpus consists of 22000 sentences imported from the Leipzig Corpus Collection, all with the standard TypeCraft IGT annotation and with valency information for each verb occurrence, given in the form exemplified for ditransitive:

SAS: NP+NP+NP
FCT: ditransitive
SIT: ternaryRel
ConstructionLabel: v-ditr 

Here 'SAS' stands for 'syntactic argument structure', 'FCT' stands for 'functional characterization', 'SIT' for situation structure, and 'ConstructionLabel' for a code described at Verbconstructions cross-linguistically - Introduction. The valency information is stated relative to the ACTIVE form of the verb, even if the example provided is in passive. When doing search you can use either of these types of labels. The array of options within each type is explained and exemplified as follows:

SAS at Valency label 'SAS'
FCT at Valency label 'FCT'
SIT at [[]] 
ConstructionLabel at Valence Profile Norwegian (for illustrations using English, see
Valence Profile English).

Joint illustrations of them all are given in Valency code illustrations.

You can search relative to valency type in general, or specifically for a given verb, where the verb can be stated by citation form or by its actually occurring form. The search interface is the standard one for TypeCraft:

TypeCraft Tools (in upper left corner) -> TypeCraft Search -> Phrase search.

On this page choose 'Norwegian Bokmål' from the Language menu; at 'Phrase level', write (or glue) the valency label into the slot 'Phrase description'. If you want to search also relative to verb, enter the exact form of the verb under 'Word level - Exact form'. (The slot for its citation form is 'Morpheme level - Exact base form', however this search option is temporarily disabled. The same holds for any other search for morphological properties when done in conjunction with 'Phrase description'.)

The present version is a trial of a methodology described in to appear, which potentially allows for a rapid increase in corpus size. The present version has clear errors, to be improved for a next stage.