[Prev][Next][Index]
[XTAG Summary]: feature errors
-
To: anoop@unagi.cis.upenn.edu
-
Subject: [XTAG Summary]: feature errors
-
From: Anoop Sarkar <anoop@unagi.cis.upenn.edu>
-
Date: Thu, 29 Jul 1999 20:32:21 -0400 (EDT)
-
Reply-to: anoop@linc.cis.upenn.edu
Minutes from the XTAG meeting on 22nd July 1999
-----------------------------------------------
Errors found in XTAG feature structures after type-checking against a
signature.
There were four main types of errors found:
1. Grammar-based: ambiguous names, or ambiguous functions for
features.
2. Typos.
3. Undocumented.
4. Incorrect or obsolete feature.
The errors discussed during the meeting were the following. There are
a few others in the list, but they are of the same types as the ones
below.
1. <assign-comp> has some non-overlap in feature values compared to <comp>
<assign-comp> extras = { ecm, ppart_nil, none }
<comp> extras = { nil }
- type of error: possibly ambiguous, grammar-based
- fix: possible duplication of effort for <assign-comp>
- cannot fix without detailed knowledge of <assign-comp>
2. <case> = nom ==> nominative
<mode> = nom ==> nominal/adjectival predicate
- type of error: typo
- fix: change <mode> = nom to <mode> = pred
3. <rel-pron> has overlap with <mode> but has extra values
<rel-pron> extra values = { ppart, ger, adj_clause }
- type of error: possibly ambiguous, grammar-based
- fix: dunno
4. <relpron> exists alongside <rel-pron>
- type of error: typo
- change <relpron> to <rel-pron>
5. <disc-conj> not documented
- type of error: documentation
- fix: add to tech-report
- prolly stands for discourse conjunction
6. <predet> occurs in noun and det signs
- type of error: incorrect feature
- fix: should be removed since Carlos thinks it was widely removed
- from the grammar at some point and remainder were skipped
7. <gen> occurs ambiguously, stands for 'gender' as well as 'genitive'
- type of error: ambiguous, grammar-based
- fix: should change all occurences of <gen> in noun signs to
- another name, say <gendr>
8. In one tree, NP.b:<assign-case> = N.t:<case>
- type of error: typo
- fix: change <assign-case> to <case>
9. <punct> is defined as a complex feature, e.g. <punct contains
comma> but instances of <punct> are found in the grammar.
D_r.b:<punct bal> = Punct_1.t:<punct>
- type of error: incorrect feature
- fix: dunno
10. In tree PsPu:
S_f.t:<assign-comp> = ind/...
- type of error: unknown
- check lexical features to see if 'ind' is allowed for
- <assign-comp>
11. Should we have case on sentential subjects?
- type of error: grammar-based
- fix: remove all instance of <case> = nom or <assign-case> = nom
- in sentential subject tree families.