LinguisticFrames0

advertisement
Linguistic Frames v0 WORKING DRAFT
1 Introduction
BPP11 Project 4 task 2 is developing fact extraction techniques to turn NL sentences,
such as the SYNCOIN corpus, into Controlled English (CE).
A key aspect of this work is the development of “linguistic frames” to specifying how
syntactic expressions in an NL (or CNL) may be structured and may be linked to the
ACM semantics. It is proposed that these frames be used for both the NL parsing and the
configuration of the extended CE parser.
The processing relies upon a set of models, as described in [1], and it is these models,
together with the linguistic frames, that embody our current theory of linguistic
processing.
[1] The CE linguistic models, Mott, D., November 2011, ITACS
https://www.usukitacs.com/node/1852
2 CE sentences to define the linguistic frames
The linguistic frames are written in CE sentences which gather together all of the
syntactic and semantic information and which is used to configure a revised version of
the CE parser; thus allowing the potential for human configuration of the syntax and
semantics of their desired dialect of CNL. One aim of the linguistic frames was to ensure
that the same parse trees were constructed for the CE parser as for the Stanford NL
parser, in keeping with our plan to use standard English grammar structures to parse the
CNL. However we have added an "annotation" capability to allow the syntax trees to be
provided with further structural information, enhancing the understandability of the
phrase structures.
The following structures are contained in the linguistic frame:





the name of the structure being defined
the syntactic sequence of words and phrases
the semantic statements that link items in the syntactic sequence with entities in
the ACM
examples
annotations (that are added to the parse tree)
The set of possible sentences that may be used to express the semantics include:
TBD
D Mott (IBM), S. Poteet, (Boeing)
1
Linguistic Frames v0 WORKING DRAFT
3 Sample Linguistic Frames
The following frames have been defined to represent the basic syntactic structures in the
current CE specification, as exemplified by “there is a person named Joe”, “the person
Joe is married to the person Jane”, “the person John has the person Fred as brother”. In
addition a simple extension to allow predicated adjectives is provided (“the person john is
tall”) .
These are experimental and under development.
--===================== START ==========================================
there is a linguistic frame named start that
defines the root ROOT and
has the sequence ( the sentence phrase SENT_start )
as syntactic pattern and
has the statement that
( the root ROOT expresses the same as the sentence phrase SENT_start )
as semantic statement
.
--===================== SENTENCE =====================================
there is a linguistic frame named s1 that
defines the sentence phrase SENT_s1 and
has the sequence ( the noun phrase SUBJ_s1 , and the verb phrase COMP_s1 )
as syntactic pattern and
has the statement that
( the noun phrase SUBJ_s1 is predicated on the thing TSUBJ_s1 ) and
( the verb phrase COMP_s1 is predicated on the thing TCOMP_s1 ) and
( the thing TSUBJ_s1 is the same as the thing TCOMP_s1 )
as semantic statement
.
--===================== VERB PHRASE =====================================
there is a linguistic frame named vp0 that
defines the verb phrase VP_vp0 and
has 'is the dog Fido' as example and
has copula as annotation and
has the sequence ( the copula VB_vp0 , and the noun phrase OBJ_vp0 )
D Mott (IBM), S. Poteet, (Boeing)
2
Linguistic Frames v0 WORKING DRAFT
as syntactic pattern and
is predicated on the thing X and
has the statement that
( the noun phrase OBJ_vp0 is predicated on the thing TOBJ_vp0 ) and
( the thing X is the same as the thing TOBJ_vp0 )
as semantic statement
.
there is a linguistic frame named vp0a that
defines the verb phrase VP_vp0a and
has 'is red' as example and
has adj as annotation and
has the sequence ( the copula VB_vp0a , and the adjective ADJ_vp0a )
as syntactic pattern and
is predicated on the thing X and
has the statement that
( the adjective ADJ_vp0a expresses the entity concept EC_vp0a ) and
( the thing X is categorised as the entity concept EC_vp0a )
as semantic statement
.
-------------------------------------------------------------------------there is a linguistic frame named vp1 that
defines the verb phrase VP_vp1 and
has 'hits the dog' as example and
has complement as annotation and
has the sequence ( the present third singular verb VB_vp1 , and the noun phrase
OBJ_vp1 )
as syntactic pattern and
is predicated on the thing X and
has the statement that
( the verb VB_vp1 expresses the relation concept RC_vp1 ) and
( the noun phrase OBJ_vp1 is predicated on the thing TOBJ_vp1 ) and
( the thing TOBJ_vp1 is categorised as the entity concept EC_vp1 ) and
( the relation concept RC_vp1 has the entity concept EC_vp1 as range ) and
( there is a situation named SVP1 that
is categorised as the concept RC_vp1 and
has the thing X as agent role and
has the thing TOBJ_vp1 as patient role )
as semantic statement
D Mott (IBM), S. Poteet, (Boeing)
3
Linguistic Frames v0 WORKING DRAFT
.
------------------------------------------------------------------------there is a linguistic frame named vp2 that
defines the non finite verb phrase VP_vp2 and
has 'named John' as example and
has namingpastpart as annotation and
has the sequence ( the naming past participle VBN_vp2 , and the noun phrase
NAME_vp2 )
as syntactic pattern and
is predicated on the thing X and
has the statement that
( the noun phrase NAME_vp2 is predicated on the thing TNAME_vp2 ) and
( the thing X is the same as the thing TNAME_vp2 )
as semantic statement
.
------------------------------------------------------------------------there is a linguistic frame named vp3 that
defines the verb phrase VP_vp3 and
has 'has the person John as father' as example and
has possession as annotation and
has the sequence ( the possession verb HAS_vp3 , the noun phrase NP_vp3 , and the as
prepositional phrase PP_vp3 )
as syntactic pattern and
is predicated on the thing X and
has the statement that
( the noun phrase NP_vp3 is predicated on the thing TNP_vp3 ) and
( the prepositional phrase PP_vp3 is predicated on the attribute concept AC_vp3 ) and
( there is a possession named S_vp3 that
is categorised as the attribute concept AC_vp3 and
has the thing X as agent role and
has the thing TNP_vp3 as patient role )
as semantic statement
.
--===================== NOUN PHRASE =====================================
D Mott (IBM), S. Poteet, (Boeing)
4
Linguistic Frames v0 WORKING DRAFT
there is a linguistic frame named np1 that
defines the noun phrase NP_np1 and
has 'the person PAULA' as example and
has the sequence ( the determiner DET_np1 , the noun COMMON_np1 , and the
proper noun NAME_np1 )
as syntactic pattern and
is predicated on the thing X and
has the statement that
( the noun COMMON_np1 expresses the entity concept EC_np1 ) and
( the thing X is categorised as the concept EC_np1 and
is the same as the proper noun NAME_np1 ) and
( the noun phrase NP_np1 stands for the thing X )
as semantic statement
.
-------------------------------------------------------------------------there is a linguistic frame named np2 that
defines the noun phrase NP_np2 and
has 'there' as example and
has the sequence ( the existential EX_np2 )
as syntactic pattern and
is predicated on the thing X and
has the statement that
( the thing X is an individual )
as semantic statement
.
-------------------------------------------------------------------------there is a linguistic frame named np3 that
defines the noun phrase NP_np3 and
has 'a person' as example and
has the sequence ( the determiner DET_np3 , and the noun COMMON_np3 )
as syntactic pattern and
is predicated on the thing X and
has the statement that
( the noun COMMON_np3 expresses the entity concept EC_np3 ) and
( the thing X is categorised as the concept EC_np3 ) and
( the noun phrase NP_np3 stands for the thing X )
as semantic statement
.
D Mott (IBM), S. Poteet, (Boeing)
5
Linguistic Frames v0 WORKING DRAFT
------------------------------------------------------------------------there is a linguistic frame named np4 that
defines the noun phrase NP_np4 and
has 'a person named Joe' as example and
has 'postmod' as annotation and
has the sequence ( the noun phrase HEAD_np4 , and the non finite verb phrase
POSTMOD_np4 )
as syntactic pattern and
is predicated on the thing X and
has the statement that
( the noun phrase HEAD_np4 is predicated on the thing THEAD_np4 ) and
( the non finite verb phrase POSTMOD_np4 is predicated on the thing
TPOSTMOD_np4 ) and
( the thing X is the same as the thing THEAD_np4 ) and
( the thing X is the same as the thing TPOSTMOD_np4 )
as semantic statement
.
-------------------------------------------------------------------------there is a linguistic frame named np5 that
defines the noun phrase NP_np5 and
has 'Joe' as example and
has the sequence ( the proper noun NAME_np5 )
as syntactic pattern and
is predicated on the thing X and
has the statement that
( the thing X is the same as the proper noun NAME_np5 )
as semantic statement
.
--------------------------------------------------------------------------there is a linguistic frame named np6 that
defines the noun phrase NP_np6 and
has 'she' as example and
has the sequence ( the personal pronoun P_np6 )
as syntactic pattern and
is predicated on the thing X and
has the statement that
( the thing X is an individual )
as semantic statement
.
D Mott (IBM), S. Poteet, (Boeing)
6
Linguistic Frames v0 WORKING DRAFT
--===================== PREP PHRASE =====================================
there is a linguistic frame named pp1 that
defines the as prepositional phrase PP_pp1 and
has 'as father' as example and
has the sequence ( the as preposition AS_pp1 , and the noun N_pp1 )
as syntactic pattern and
is predicated on the thing X and
has the statement that
( the noun N_pp1 expresses the attribute concept X )
as semantic statement
.
D Mott (IBM), S. Poteet, (Boeing)
7
Download