The Design and Knowledge Representation of 

the Professional Activity Domain Sentence Category

Miao Jian-ming (Signal and Information Processing)
Directed by Zhang Quan


Abstract

Chinese sentences are constructed by the meaning of the words. We take the sentence group as the example. The sentence group is the sentences which have the central subject, and each sentence indicate the characteristic and knowledge of the subject. The knowledge is called the domain knowledge by the HNC theory. The domain knowledge both must be able to manifest the semantic role of the each main constituent part, and have to be able to point out its grammar ingredient. We use the sentence category expression to formalize the domain knowledge.

This dissertation uses the formalized method of the sentence category expression to organize the domain knowledge, and forms the domain sentence category knowledge which can be used by computer in the sentence group processing. This dissertation takes the symbolism of concept element as the starting point, and summarizes the domain knowledge which be contained in the concept extending structure, through the effective organization, and expressed by the way of the domain sentence category expression, and produces the correlation the concept connection expression.

As to methodology, this dissertation focuses on analysis and induction, and the process of analysis embodied in description of the concept nodes and the extended structure, and the process of induction embodied in the further reorganization of the knowledge of the nodes analysis.

According to the many kinds of designs content of the HNC concept extended structure, this dissertation proposed the corresponding four design principle of the domain sentences category, and proposed the process of domain sentence category design further refined into four steps. The steps are the analysis of concept node, the induction of the domain knowledge, the design of the domain sentence category expression and the design of the concept connection expression. The knowledge library of the domain sentence category is used in the promotion of the sentence category analysis system to the sentence group unit extract.

The main points of the contribution in this dissertation are listed following:

(1) Formed the description method of the domain sentence category knowledge in the HNC concept element symbolism. The concept element symbolism has revealed the semantics and the systematic characteristic of the concept element, and described the connection characteristic between the concepts. This dissertation defines the corresponding extended structure of the concept and its the concept connection knowledge; the sentence category expression can effectively manifest the semantic in-depth structure of the sentence. This dissertation takes the domain concept in the symbolism of concept element as the starting point, and takes the sentence category expression type as the outline, and formed the formalized method of the domain sentence category expression.

2Formed the general method of the domain sentence category knowledge design in the HNC concept element symbolism foundation, and proposed the design concrete steps. This dissertation takes the extended structure in the symbolism of concept element as the starting point, through the concrete analysis of the extended structure, and obtains the outline of the knowledge design of the high level concept node; and further carries on the analysis to the first floor concept node, and take the Action-Effect chain as the center, and induce the domain knowledge of the lower level extended structure; and took the sentence category knowledge as the instruction, and assigned the semantic role for the semantics block content, and determined its sentence category code, and finally forms the domain sentence category expression; Under the overall frame of the domain sentence category, this dissertation analysis the semantic block content and the concept node itself, and finally obtained various concepts connection knowledge, through the processing of the HNC mapping mark formalization, and produced the concept connection expression.

(3) Proposed the design principle of the domain sentence category expression. The design principle of the graduation system has effectively solved the boundary problem of the domain knowledge induction concentration knowledge; The design principle of the Action-Effect has guaranteed the overall extrication to the Action-Effect chain criterion in the domain knowledge design process; The design principle of the extended structural made the corresponding design principle to the differently extended structure node design; the design principle of the sentence conformity has solved the application question which the domain sentence category knowledge be used in the sentence category analysis system.

(4) Realized the induction and formalized expression with the domain knowledge of the four big domains concepts forest in the professional activity domain. Through the statistical analysis to the real news language material, these four big domains concept tree (general character, politics, economy, culture) have covered approximately 56% domain space. We designed the corresponding domain sentence category expression for each domain concept, and have disposed the concept connection expression. These research results will become the main body content of the domain sentence category knowledge library, and will builds the solid foundation for the completion of the domain sentence category knowledge library.

(5) Explored the application of the domain sentence category knowledge in the sentence category analysis system. The domain sentence category knowledge finally serves the sentence category analysis system, on the one hand the domain sentence category knowledge may enhance the sentence group handling ability of the sentence category analysis system, on the other hand, and simultaneously the domain sentence category knowledge may enhance the handling ability to the new word and the semantics cutting fuzzy of sentence kind of analysis system, and so on. The domain sentence category knowledge will make the sentence category analysis system surmounted from the first referral level to the first referral level.

In summary, under the HNC theory frame, this dissertation has studied the design question of the domain sentence category, and proposed the corresponding design procedure and the principle of design, and have carried on the domain sentence category concrete design of the four big domains concepts forest in the professional activity. This dissertation research results will be helpful the related sentence group processing research in a deepened sentence category analysis.

Key words:  Hierarchical Network of Concepts(HNC) theory; Sentence Category annlysis(SCA); Sentence Group(SG); Domain Sentence Category(DSC); Concept connection expression