The Design and Knowledge Representation of
the Professional Activity Domain Sentence Category
Miao Jian-ming (Signal and Information Processing)
Directed by Zhang Quan
Abstract
Chinese
sentences are constructed by the meaning of the words. We take the sentence
group as the example. The sentence group is the sentences which have the central
subject, and each sentence indicate the characteristic and knowledge of the
subject. The knowledge is called the domain knowledge by the HNC theory. The
domain knowledge both must be able to manifest the semantic role of the each
main constituent part, and have to be able to point out its grammar ingredient.
We use the sentence category expression to formalize the domain knowledge.
This dissertation uses the formalized
method of the sentence category expression to organize the domain knowledge, and
forms the domain sentence category knowledge which can be used by computer in
the sentence group processing. This dissertation takes the symbolism of concept
element as the starting point, and summarizes the domain knowledge which be
contained in the concept extending structure, through the effective
organization, and expressed by the way of the domain sentence category
expression, and produces the correlation the concept connection expression.
As to methodology, this dissertation
focuses on analysis and induction, and the process of analysis embodied in
description of the concept nodes and the extended structure, and the process of
induction embodied in the further reorganization of the knowledge of the nodes
analysis.
According to the many kinds of designs
content of the HNC concept extended structure, this dissertation proposed the
corresponding four design principle of the domain sentences category, and
proposed the process of domain sentence category design further refined into
four steps. The steps are the analysis of concept node, the induction of the
domain knowledge, the design of the domain sentence category expression and the
design of the concept connection expression. The knowledge library of the domain
sentence category is used in the promotion of the sentence category analysis
system to the sentence group unit extract.
The main points of the contribution in
this dissertation are listed following:
(1)
Formed the description method of the domain sentence category knowledge in the
HNC concept element symbolism. The concept element symbolism has revealed the
semantics and the systematic characteristic of the concept element, and
described the connection characteristic between the concepts. This dissertation
defines the corresponding extended structure of the concept and it¡¯s the
concept connection knowledge; the sentence category expression can effectively
manifest the semantic in-depth structure of the sentence. This dissertation
takes the domain concept in the symbolism of concept element as the starting
point, and takes the sentence category expression type as the outline, and
formed the formalized method of the domain sentence category expression.
£¨2£©Formed
the general method of the domain sentence category knowledge design in the HNC
concept element symbolism foundation, and proposed the design concrete steps.
This dissertation takes the extended structure in the symbolism of concept
element as the starting point, through the concrete analysis of the extended
structure, and obtains the outline of the knowledge design of the high level
concept node; and further carries on the analysis to the first floor concept
node, and take the Action-Effect chain as the center, and induce the domain
knowledge of the lower level extended structure; and took the sentence category
knowledge as the instruction, and assigned the semantic role for the semantics
block content, and determined its sentence category code, and finally forms the
domain sentence category expression; Under the overall frame of the domain
sentence category, this dissertation analysis the semantic block content and the
concept node itself, and finally obtained various concepts connection knowledge,
through the processing of the HNC mapping mark formalization, and produced the
concept connection expression.
(3)
Proposed the design principle of the domain sentence category expression. The
design principle of the graduation system has effectively solved the boundary
problem of the domain knowledge induction concentration knowledge; The design
principle of the Action-Effect has guaranteed the overall extrication to the
Action-Effect chain criterion in the domain knowledge design process; The design
principle of the extended structural made the corresponding design principle to
the differently extended structure node design; the design principle of the
sentence conformity has solved the application question which the domain
sentence category knowledge be used in the sentence category analysis system.
(4)
Realized the induction and formalized expression with the domain knowledge of
the four big domains concepts forest in the professional activity domain.
Through the statistical analysis to the real news language material, these four
big domains concept tree (general character, politics, economy, culture) have
covered approximately 56% domain space. We designed the corresponding domain
sentence category expression for each domain concept, and have disposed the
concept connection expression. These research results will become the main body
content of the domain sentence category knowledge library, and will builds the
solid foundation for the completion of the domain sentence category knowledge
library.
(5)
Explored the application of the domain sentence category knowledge in the
sentence category analysis system. The domain sentence category knowledge
finally serves the sentence category analysis system, on the one hand the domain
sentence category knowledge may enhance the sentence group handling ability of
the sentence category analysis system, on the other hand, and simultaneously the
domain sentence category knowledge may enhance the handling ability to the new
word and the semantics cutting fuzzy of sentence kind of analysis system, and so
on. The domain sentence category knowledge will make the sentence category
analysis system surmounted from the first referral level to the first referral
level.
In
summary, under the HNC theory frame, this dissertation has studied the design
question of the domain sentence category, and proposed the corresponding design
procedure and the principle of design, and have carried on the domain sentence
category concrete design of the four big domains concepts forest in the
professional activity. This dissertation research results will be helpful the
related sentence group processing research in a deepened sentence category
analysis.
Key words: Hierarchical Network of Concepts(HNC) theory; Sentence Category annlysis(SCA); Sentence Group(SG); Domain Sentence Category(DSC); Concept connection expression