GUIDELINES FOR AUTHORS

advertisement
INTELLIGENT EXPANSION OF THE GEOINFORMATION SYSTEM
A. Ryumkin1, A. Yankovskaya2
1 Tomsk
State University of Architecture and Building,
2, Solyanaya Square, 634003, Tomsk, Russia, yank@tisi.tomsk.su, yank@tsuab.ru
2 Tomsk State University, 36, Lenin pr., 634050, Tomsk, Russia, rai@sibgeoi.tomsk.ru
The intelligent expansion tradition vector GIS with the use of intelligent recognizing system is proposed. Integration of two systems is put in a basis of intelligent expansion GIS –
typical vector GIS and the intelligent recognizing system based on software tool IMSLOG
with the purpose of union of functionalities of both. Original test methods of pattern recognition are used.
Introduction
There is a fairly wide list of commercial software products in the field of geoinformation
system (GIS) in the world scale as well as markets. The basic world leaders in the given
sphere of the software are products of the
American firms: ESRI, MapInfo, ERDAS, Autodesk, Intergraph [1]. Prominent features of
this software are: 1) presence of the advanced
means of representation of the graphic information both vector formats and raster; 2) opportunity of use of databases such widespread
DBMS as Oracle, MS SQL Server, dBase,
FoxPro and a number of others. The majority of
systems of the given class give users support of
a big number of the various geographical projections, advanced means of the analysis of the
graphic information represented in various formats, advanced means of graphic search.
However the enumerated software products do
not contain components using knowledge bases
for the solution of wide a number of problems,
and are not equipped with intelligent means,
which does not allow to solve practically important problems for which heterogeneity of the
processable information, an fuzzybility and
great volume of the data and knowledge are essential. The problems of decision-making at
planning, development of territories of cities or
regions, estimations of the real estate, an ecological condition, expediency of investments
[2] are typical applications. Research on application of methods of an artificial intellect in
GIS are at the initial stage. In widespread raster
GIS ERDAS IMAGINE, intended for processing images (including for GIS), subsystem
IMAGINE Expert Classifier using production
rules of knowledge representation at the solution of problems of classification has appeared
recently. In some vector GIS there are also elements of AI, but all this has trial character.
Problem of investigation developed by us is
construction of the geoinformation system possessing possibilities of representation, processing and application of knowledge on the
basis of original methods of an artificial intellect.
Bases of intelligent expansion of
geoinformation system
Integration of two systems is put in a basis of
intelligent expansion GIS – typical vector GIS
such as ArcVier [1] or the GrafIn [3] and the
intelligent recognizing system based on software tool IMSLOG [4] with the purpose of union of functionalities of both: effective means
of extraction, systematization, representation
and data processing about the territories realized in GIS, and intelligent means of the data
and knowledge analysis [5]; revealings of a various regularities in the data and knowledge; decision-making on the basis of a combination of
various schemes (mechanisms) of a inference
logic-combinatory (l-c) [5], logic-probabilistic
(l-p) [5,6], logic-combinatorial-probabilistic (lc-p) [7] and voting procedures on a set of solv-
ing rules constructed on a set of logic tests (unconditional and mixed, ones which are a compromise between unconditional and conditional
components) at each scheme of a inference as
well as various, oriented on users of different
qualification, graphic (including cognitive)
means of visualization of information structures, regularities, decision-making and substantiation of the decision-making results [8]
realized in intelligent system.
The structure traditional GIS usually includes
the advanced graphic editor and the block of
management of the attribute data, allowing for
the description of a subject area to use a composition of graphic and attribute components. In
vector GIS for this purpose a set of graphic
primitives (a point, a line, polygon) is used.
Each them is characterized by a set of the attribute data. In aggregate they form the digital
terrain models (DTM) equivalent to usual not
spatial databases. In GIS the modules of the
spatial analysis using geometrical operations
above graphic objects, as well as usual for
DBMS operations above tables of the data are
realized. Compositions of these operations, mutual transitions between them give the user
convenient means of work with (DTM) in the
dialogue. It is easy to see that similar models of
the data are also convenient for the application
to this problem area of methods of AI. One of
the major in construction GIS is the stage of
formation DTM on materials of aerial photograph, space shooting, or on available cartographical materials. Here, and also in many
other problems GIS methods of pattern recognition and images analysis [9] are used. Formalizing typical for GIS spatial relations and conditions it is possible to receive typical statement
of a problem.
It is supposed to carry out realization and preliminary debugging of the intelligent block of
system on the basis of methods of test patterns
recognition.
Applied methods
The test approach to pattern recognition assumes performance of the following stages.
1. Adaptive code conversion of characteristic
features of a different type (quantitative, nominal, serial) with the purpose of the maximal
partition of classes (patterns) and reduction a
kind used in a matrix methods of data and
knowledge representation [5,6].
2. The analysis of a database and knowledge on
consistency (check of paired crossing of descriptions of objects from different patterns
(classes on each mechanism of classification)).
3. Revealing of knowledge representativenes in
two ways: logic-combinatory with deep optimizing transformations and statistical methods
on the basis of so-called divergence of Koulbac
information; an estimation (on a basis of
Koulbac divergence) of additional volume of
training knowledge with a purpose of obtaining
reliable conclusions.
4. Construction of irredundant implication matrix U' (with simultaneous revealing constant,
steady, noninformative, obligatory, alternative
and dependent features and calculation of
weight coefficients of all features), assigning
either necessary and sufficient conditions or
sufficient conditions of distingrushability of
any pair the objects belonging to different patterns (to classes at each mechanism of classification) [5].
5. The finding of all shortest (all or a part irredundant) columns coverings of matrix U', to
each of which corresponds minimal (irredundant) distinguishing subset of features (minimal (irredundant) test), assigning necessary and
sufficient (sufficient) conditions of distingrushability of any pair of objects from different patterns. Revealing of nonexistent and pseudoobligatory (at construction only of a part irredundant column coverings of matrix U') features. Construction of all minimal (all or a part
irredundant) the unconditional and mixed tests
[10].
6. Construction of a set of the solving rules taking into account all found regularities and realizing a set of independent ways of recognition
(schemes of a logic inference) of the same ob-
ject under investigation (OUI). The number of
recognition ways is equal to the number of tests
used for recognition [5].
7. Recognition of the OUI by one of approaches
(at the l-c approach – on the basis of similarity
coefficients and taking into account of an admissible error of decision-making assigned by
the user [11]; at l-p approach - on the basis of a
partial implication at a partial ortogonalization
of some DNF of Boolean functions describing a
pattern in space of features included in the minimal (irredundant) test [6]; at l-c-p approach –
on the basis of similarity coefficients taking
into account of values probability of some features of the OUI) [6].
8. Acceptance of the final decision on results of
voting on a set of ways of recognition (tests)
and approaches.
9. Application of various cognitive means of
acceptance and a substantiation of decisions
and graphic means of visualization of information structures and the revealed regularities
[8].
We list the basic approaches to construction of
tests for 2-, 3-and k-values features: 1) with
construction irredundant implication matrix U';
2) with partial construction of the matrix U'; 3)
without construction of the matrix U'.
A particular approach is applied. This depends
on the dimension of knowledge (the number of
characteristic, classification features, the number of rows of matrix Q), of knowledge, and its
representativeness. In the first approach, one of
the following algorithms can be executed: a)
the search for all shortest column coverings
with simultaneous detection of the regularities;
b) the search of irredundant tests with simultaneous detection of obligatory features, with calculation of the weight coefficients feature and
with the use of a step-cyclic algorithm; c) the
search of the minimum and irredundant unconditional tests with the use of genetic algorithms.
At the second and third approaches the algorithms similar to algorithms b,c from the first
approach are used. Addition of columns at construction of minimal and irredundant descriptions matrices is carried out to nonexcluded
subset of columns, corresponded to the core of
diagnostic tests, and at removal columns the
ones corresponded to the core are not removed.
Algorithms of construction unconditional and
mixed minimal and irredundant tests at the first
approach in case of realization of algorithm a)
include the following steps: 1) calculation of a
distinguishing vector-function for the next pair
a class-class (an object-class, object-object)
from different classes at the fixed classification
mechanism (a pattern-pattern, an object-pattern,
object-object from different patterns for nonpartitioned matrix U'); 2) calculation of weight
features coefficients; 3) revealing of obligatory
features and, at presence of which, their inclusion in a core; 4) addition of the next value of a
distinguishing vector function to the current
matrix U; consecutive formation of the vector
used for revealing of constant features and vectors for each patterns (a class on each classification mechanism) for revealing steady features
and removing in the current matrix U of covering rows; 5) construction of matrix U' with
simultaneous revealing a part of regularities; 6)
Construction of a set of all possible shortest
(optimum) column coverings of a matrix U'
with the use of features weight coefficients; 7)
construction of the minimal (optimum) unconditional diagnostic test on each from the shortest (optimum) column coverings; 8) construction of the mixed test on the base of unconditional test with inclusion of obligatory features
in a unconditional component of the test; 9)
calculation of weights of diagnostic tests.
Let's note that features are considered obligatory if in matrix U there are rows containing only
one unit in columns corresponding to these features. Features are considered to be alternative
if corresponding to them columns matrix U are
equal. Features to which zero columns of matrix U correspond are not informative as far as
they do not distinguish any pair of objects from
different patterns. If the i-th the column of matrix U is covered by j-th (not equal to the i-th)
column corresponding i-th column the feature
distinguishes only some pairs objects from a set
of pairs, distinguished by j-th feature, and it is
considered a dependent one.
The features, which have not been included in
all shortest (irredundant) coverings of the matrix U', are considered nonessential at decisionmaking on the basis of minimal (irredundant)
tests and are not used for the description of OUI
and decision-making.
Search of all shortest column coverings of matrix U' is based on construction on matrix U' the
hierarchical system of submatrices being a tree
of search, and are reduced to selection of all
nonrecurrent shortest ways in a tree of search.
In a basis of algorithms of construction of a tree
of the mixed test [12] at the first approach (of a
partial tree of the mixed test on presentation of
the object description at the second approach)
on the unconditional test procedure of partition
of rows of a descriptions matrix on a set of reactions on values of obligatory and pseudoobligatory features lies.
Partition coefficient C is used for forming of a
conditional component of the mixed test. The
partition coefficient reflect a degree of nonintersection of patterns subsets corresponding to
different feature values. The feature with maximum coefficient C is chosen on every step of
construction of a tree of the mixed test. The
calculation procedure of a number of voices
given for the test is used for a final solution.
Conclusion
Intelligent expansion of a geoinformation system will allow to develop the existing traditional possibilities of geoinformation system directed at visualization of territory data, the spatial analysis with the use of solving rules allowing to carry out classification of spatial situations and to take administrative solution for the
specialized applications.
This work was supported by the Russian Foundation for Basic Research, project nos. 01-0100772 and 01-01-01050).
References
1. Kashkarev A.V., Tikounov V.S. Geoinformatics. М.,
Kartgeotzentr-Geodesisdat, 1993.
2. Rjumkin A.I., Choumichev I.I. Integration of geoinformation technologies and data of remote sounding
in problems of management of steady development of
a region // Interkarto-4. GIS for optimization of wildlife management with a view of steady development
of territories. Materials of the International conference. Barnaul, 1998, Pp. 232-240.
3. Skvortzov A.V. Tool GIS GraphIn // Geoinformatics
- 2000. Works of the international conference,
Tomsk, 2000, Pp. 90-96.
4. Yankovskaya A.E., Gedike A.I., Ametov R.V.,
Bleikher A.M. IMSLOG-2002 Software Tool for
Supporting Information Technologies of Test Pattern
Recognition // Pattern Recognition and Image Analysis. - 2003. - Vol. 13. - No. 2. - Pp. 243-246.
5. Yankovskaya A.E. Logical Test in Knowledge-Based
Recognizing system // Pattern Recognition and Image
Analysis. - 2001. - Vol. 11, No. 1. – p. 127.
6. Yankovskaya A.E. An Automaton Model, Fuzzy
Logic, and Means of Cognitive Graphics in the Solution of Forecast Problems // Pattern Recognition and
Image Analysis. - 1998. - Vol. 8, No. 2. - Pp. 154156.
7. Yankovskaya A.E. Logic-Combinational Probabilistic
Recognition Algorithms // Pattern Recognition and
Image Analysis. - 2001. - Vol. 11, No. 1. - Pp. 123126.
8. Yankovskaya A.E. Decision-making and argument of
decisions with the use of methods of cognitive
graphics on the basis of knowledge of experts of various qualification // Isv. The Russian Academy of Science. The theory and control systems. - 1997. - № 5 Pp. 125-128.
9. Ryumkin A.I, Kabanov M.M., Kapustin S.N., Fuks
A.L., Chumichev I.I Control of the Territory Condition using the Space Survey in Optical Range // Pattern Recognition and Image Analysis 1999, vol. 9 №
2, p. 380.
10. Yankovskaya A.E., Gedike A.I. Construction and
Evaluation of Compressed Descriptions of Patterns in
an Intelligent Recognizing System // Pattern Recognition and Image Analysis. - 1999. - Vol. 9, No. 1. - Pp.
124-127.
11. Yankovskaya A.E. Minimization of Orthogonal Disjunctive Normal Forms of Boolean Function to be
Used as a Basis for Similarity and Difference Coefficients in Pattern Recognition Problems // Pattern
Recognition and Image Analysis. - 1996. - Vol. 6, No
1. - Pp. 60-61.
12. Yankovskaya A.E., Kouzovatkin A.N. Decisionmaking in intelligent software tool IMSLOG 2002 on
the basis of mixed diagnostic tests // Information systems and technologies. Materials of the international
conference. Tom 3. - Novosibirsk: NGTU, 2003. - Pp.
182-186.
Download