Springer

advertisement
The ICOM Project
Computational Intelligence: Principles and Applications
Gerson Zaverucha, Valmir Barbosa, Felipe França, Ines de Castro Dutra, Luis Alfredo de Carvalho, Luiz Adauto
Pessoa and Priscila Lima
COPPE-Sistemas Universidade Federal do Rio de Janeiro
Marley Vellasco and Marco Aurélio Pacheco
ICA - Pontifícia Universidade Católica do Rio de Janeiro
Teresa Ludermir and Wilson de Oliveira
DI- Universidade Federal de Pernambuco
Victor Navarro da Silva
CEPEL
Mario Fiallos Aguilar
EE - Universidade Federal do Ceará
Suzana Fuks
CNPS – EMBRAPA
Alberto de Souza and Marcos Negreiros
DI- Universidade Federal do Espirito Santo
Abstract: In this paper, we present the results obtained by the ICOM project. We have developed and implemented some neural,
symbolic and hybrid propositional and first order theory refinement systems.We also describe our work on ANN compilers and
parallelization, on genetic algoritms and computational neuroscience. The systems developed in this project were applied to:
computational biology, real world electric power systems fault diagnosis, load forecasting and overflow control of electric generator
reservoir, inductive design of Distributed Object Oriented Database, Data Mining, agriculture, image classification and circuit design.
Finally, we present some conclusions.
Keywords: Computational Intelligence, Hybrid Systems, Artificial Neural Networks, Theory Refinement Systems, Inductive Logic
Programming, Applications.
1. Introduction
This project‘s main objective was the research in Computational Inteligence, its principles and applications.
The development and application of Computation Intelligence systems has been one of the main growing areas of
research in Computer Science [Kandel and Langholz 92], which can be seen by the ever increasing number of
conferences and workshops. These systems use and integrate a number of AI techniques in a new paradigm to reproduce
aspects of human behaviour such as reasoning, learning and perception. Among these techniques we have: Neural
Networks; Classical, Fuzzy and Nonmonotonic Logics; Inductive Logic Programming; Genetic Algorithms and Bayesian
Networks.
Each one of these AI techniques have individually shown its importance and its strengths. But it is through their
integration, considering each one of their characteristic strengths, that one can build powerful and robust hybrid
intelligent systems, allowing to efficiently solve problems in different areas that were not possible with conventional
methods.
Taking into account the evolution of computer science for massively parallel architectures [Barbosa 93], intelligent
systems, to be effective, could be based on some artificial neural network’s model. Notwithstanding, they should be able
to justify their decision making process while learning, acquiring a higher level of formalism. Therefore, it is very
important to outline tight correlation between symbolic knowledge and artificial neural networks, assigning to the later an
“explanation capability”.
“Right now we are caught in the middle; neither purely connectionist nor purely symbolic systems seem able to
support the sorts of intellectual performances we take for granted even in young children. I’ll argue that the solution lies
somewhere between these two extremes, and our problem will be to find out how to build a suitable bridge... Above all,
we currently need more research on how to combine both types of ideas” [Minsky 91].
1
“Connectionism and Automated Inferencing are two such subfields which have been developed as if they were
independent although they both are concerned with high-level reasoning. Automated Inference Systems can only be
adequate if they are based on a massively parallel computational model. The super computing community is convinced
that by the year 2000 the fastest computers will be massively parallel. Hence, massive parallelism may serve as a vehicle
to reconcile the research in Connectionism and Automated Inferencing. We do not just want to build systems which show
intelligent behavior, we also want to understand how humans do it.” [Holldobler 93]
In this paper, we present the results obtained by the ICOM project. We have developed and implemented some neural,
symbolic and hybrid propositional and first order theory refinement systems, which are described in section 2.1; in
section 2.2 we describe our work on ANN compilers and parallelization, on genetic algoritms and computational neuroscience.
Section 3 presents some important applications of systems developed in this project, among them: real world electric
power systems fault diagnosis, load forecasting, overflow control of electric generator reservoir, inductive design of
Distributed Object Oriented Database, Data Mining, agriculture, image classification and electronic circuit design.
Finally, section 4 draws some conclusions.
2. Principles
2.1 Theory Refinement Systems
Theory refinement systems combine analytical and inductive learning, that is given a theory, describing a domain
problem, which can be incorrect and/or incomplete, it is refined with examples by applying a learning algorithm such that
it correctly classifies all of them and is able to generalize to unseen ones. The use of background (prior) knowledge has
shown to lead to faster training and better generalization, specially when there is a reduced number of training examples
[Pazzani, Kibler 92].
Knowledge-based neural networks (KBNN) are propositional theory refinement systems using neural networks that
promotes the integration of symbolist and connectionist paradigms of AI ([Shavlik 94]. Propositional symbolic
knowledge (like rules) describing a domain problem is mapped into a neural network through an insertion algorithm, then
this network is trained with examples (refinement) and finally an extraction algorithm is applied to it generating a
refined propositional symbolic knowledge. KBNN systems like KBANN [Towell and Shavlik 94], RAPTURE [Mahoney
and Mooney 93] and CASCADE ARTMAP [Tan, Ah-Hwee 97].have been shown to outperform purely neural and
symbolic inductive learning systems.
2.1.1 The Connectionist Inductive Learning and Logic Programming System
In this project, [Garcez and Zaverucha 99, Garcez et al 96,97] developed a KBNN called the Connectionist Inductive
Learning and Logic Programming (CIL2P) system. Starting with the background knowledge represented by a
propositional normal (general) logic program [Lloyd 87], a translation algorithm is applied generating a partially
recurrent three layer neural network ([Jordan 86]) network:
 the number of hidden units is equal to the number of clauses in the logic program;
 for each clause, its head is mapped to an output unit and each literal that appears in the body its atom is mapped to an
input unit with weight sign corresponding to the literal sign;
 for each atom that appears both in the head and the body of a clause, a recurrent link (with fixed weight equal to one)
is created from its correpondent output unit to its correpondent input unit.
Moreover, it imposes a well defined structure through the setting of threshold and weight values such that each unit in
the hidden layer works as an AND gate, being activated if and only if all units in the input layer connected to it are
activated, and each unit in the output layer works as an OR gate, being activated if and only if at least one unit in the
hidden layer connected to it is activated. This is similar to a set of clauses in a normal logic program: a consequent of a
clause is satisfied if and only if all of its antecedents are satisfied, and it is necessary to have at least one clause satisfied
in order to have the literal in its head proved. It is important to notice that, despite the fact that the units work as AND or
OR gates, their activation functions are bipolar semi-linear activation functions (tanh).
When the propositional normal logic program is acceptable [Protti and Zaverucha 98], a theorem was proved showing
that the neural network computes exactly what the logic program computes - its stable model [Gelfond and Lifschitz 88],
as a massively parallel system for Logic Programming.
This neural network can be trained with examples and the results obtained with this refined network can be explained
by extracting a revised logic program from it . This way, CIL2P offers [Holldobler and Kalinke 94]’s parallel model for
Logic Programming the capability to perform inductive learning from examples and background knowledge.
The symbolic knowledge extracted from the neural network can be more easily analyzed by a domain knowledge
expert that decides if it should feedback to the system, closing the learning cycle, or not, or even be used in analogous
domain applications.
The CIL2P system has been implemented in Delphi (Windows 3.x/95/NT) [Basilio 97, Basilio and Zaverucha 97] and
in the Stutgart Neural Network Simulator (SNNS) [Baptista and Kalinowksi 99].
The system have obtained very successful experimental results in two real-world problems in the molecular biology
domains from the Human Genome Project, specifically the promoter and splice-junction problems. Comparisons to the
results obtained by other neural (standard three layer feedforward network), symbolic (including FOCL ), and hybrid
inductive learning systems (including KBANN [Towell and Shavlik 94]) in the same domain knowledge showed the
effectiveness of CIL2P.
2.1.2 Rule Extraction From Neural Networks
The explanation capability of neural networks can be obtained by the extraction of symbolic knowledge from it,
through the so called “Rule Extraction” methods. It makes the knowledge learned accessible for an expert’s analysis and
allows the justification of the decision making process. This way, it allows ANN systems to be applied to safety-critical
problem domains, the verification and debugging of ANN components in software systems, improvement of the
generalisation of ANN solutions, data exploration and the induction of scientific. Moreover, since artificial neural
networks have shown to outperform symbolic machine learning methods in many application areas [Thrun et al. 91]
[Towell, Shavlik 94a], the development of techniques for extracting rules from trained neural networks gives rise to an
effective learning system that may contribute to the solution of the “Knowledge Acquisition Bottleneck” problem. The
knowledge extracted can be directly added to the knowledge base or used in the solution of analogous domains problems.
It is known that the knowledge acquired by a neural network during its training phase is encoded as: (i) the network
architecture itself; (ii) the activation function associated to it; and (iii) the value of its weights. As stated in [Andrews et
al. 95], the task of extracting explanations from trained artificial neural networks is the one of interpreting in a
comprehensible form the collective effect of (i), (ii), and (iii).
2
In knowledge-based neural networks, like KBANN and C-IL P, the training process, however, disturbs the structure,
by changing threshold and weight values in order to refine the knowledge based on the examples used for training. The
units no longer work as AND or OR gates, and, hence, the rule structure is lost. With this change, it is not trivial to
identify the knowledge encoded in the network.
An alternative to obviate the often high computational cost of extraction algorithms is to keep the rule structure during
training. This would allow the use of a decompositional extraction method - one that considers the topology of the
network, as opposed to pedagogical methods, which treat the network as a “black box” ([Andrews et al 95]). Pedagogical
extraction algorithms are “unbiased” methods, since no assumptions on the topology of the network are made. This often
leads these methods to be O(2n), for they consider as many subsets of the input space as necessary, to assure some
reliability.
In this project, [Menezes et al 98, 97] presented an approach for knowledge extraction which takes advantage of the
“rule structure” imposed to knowledge-based networks by the theory insertion algorithm. Keeping this structure during
2
the training process makes rule extraction an easier task. A penalty function was used for this purpose. The C-IL P
penalty function works similarly to the one in GDS ([Blasig 94]). Some differences, however, should be noted. GDS has
2
weights and thresholds converging to specific values, whereas C-IL P penalty function allows these values to vary in a
range. This increases the flexibility of the training process, and probably leads to better generalization results. Another
difference is the fact that it uses the same function to penalize weights and thresholds, what makes easier the
implementation of the method.
The investigation of the rule extraction problem in the RAM model of ANN known as Boolean Neural Networks
(BNN) [Ludermir et al 98] was also carried out in this project [Ludermir and Oliveira 98a]. Two methods have been
proposed for extracting rules from trained BNN [Ludermir and Oliveira 98b].The first one deals with the extraction of
symbolic grammatical rules from trained BNN [Ludermir and Oliveira 97]. The second one produces Boolean expression
rules from trained feedforward BNN [Barros et al 98, Ludermir 98]. The methods have been used with some benchmarks
and practical problems [1]. Proposal of a general theory of computation for dynamical systems, in general, and ANN, in
particular, via symbolic dynamics [Oliveira and Ludermir 98].
2.1.3 The Combinatorial Neural Model
Fuzzy neural networks have been introduced as a possibility to meet many of the requirements for hybrid intelligent
systems [Buckley & Eslami 96] [Buckley & Hayashi 94] [Buckley & Hayasi 95] [Gupta & Rao 94] [Kandel & Langholz
92] [Keller et al 92] [Lee & Lee 75] [Machado & Rocha 93] [Pedrycz 93] [Pedrycz & Rocha 93] [Rocha 92]. Such
networks, based on fuzzy logic theory, directly encode structured knowledge in a numerical framework, and can be seen
as trainable evidence-aggregation networks that utilize families of connectives from fuzzy set theory as neurons.
Because fuzzy sets lend themselves well to both a symbolic computational approach and a numeric one, they constitute
an attractive tool for merging neural networks and symbolic processing together.
3
In this research we have dealt with the Combinatorial Neural Model (CNM), a variation of fuzzy neural networks
introduced to tackle classification tasks and the more general task of establishing a mapping between two fuzzy
multidimensional spaces [Machado & Rocha 91, 94 a, 94b]. CNM has been investigated and analyzed from various
perspectives, and appears to point towards answers in the context of the issues that are relevant to the field. A discussion
of its properties related to the consultation process, such as uncertainty management, inquiry of new information,
explanation of conclusions, and evidence censorship, can be found in [Machado & Rocha 93, 97]. or properties related to
the construction of networks from expert knowledge and the refinement of their knowledge by means of examples, the
reader is referred to [Machado & Rocha 94]. CNM has also been combined with semantic networks for the construction
of expert systems [Kandel & Lanholz 92] [Rocha 92], and employed successfully in several problems of medical
diagnosis [Leão et al 93,95,96] and credit scoring [Pree et al 97].
We have focused on the various types of learning associated with CNM networks, namely the learning of membership
functions, of the network topology, and of connection weights, with strong emphasis on the latter. Our main contribution
has been the introduction of new learning algorithms, and a comparative study of their performance on a large-scale
classification problem from the field of image analysis. In addition to various results on the relative merits of the new
algorithms, we have shown that they outperform networks trained by error back-propagation on the same problem.
Details on theoretical and experimental results are to be found in [Machado et al 98].
A CNM network comprises n+m+1 neurons, arranged into an input layer with n neurons, a hidden layer with m
neurons, and a single-neuron output layer. Each input-layer neuron has state in [0,1], and connects forward to a (possibly
empty) subset of neurons of the hidden layer. Each of these connections is also called a synapse. Each hidden-layer
neuron has state given by the fuzzy AND of the states of at least one input-layer neuron, with provisions for both
excitatory and inhibitory synapses. The connectivity of a hidden-layer neuron is the number of input-layer neurons that
connect forward to it. The output-layer neuron has state given by the fuzzy OR of the weighted states of hidden-layer
neurons. Each hidden-layer neuron is thought of as being associated with an input-to-output pathway; the weight of the
corresponding connection to the output-layer neuron is referred to as the weight of that pathway.
In classification problems, a feature of the pattern under analysis can be either categorical (i.e., the feature can only
take on a finite number of values) or, more generally, numeric (i.e., the feature may take on any value from a certain real
domain). Inputing a categorical feature to a CNM network requires at most one input-layer neuron for each of the
feature's possible values (it may require less, depending, for example, on whether and how inhibitory synapses are used).
The state of each such neuron is either 0 or 1.
A numeric feature, on the other hand, cannot in general be input to the network directly, but rather has to be
approached by first partitioning its possible values into, say, "low," "medium," and "high" values. The way to input this
feature to the network is then to compute its degree of membership in each of the appropriate fuzzy sets (three, in our
example), and then use such degrees as inputs to the network. What this takes is one input-layer neuron for each fuzzy
set, each neuron having as activation function the desired membership function.
From the standpoint of a specific application, the question of interest is whether CNM networks can be guaranteed to
provide satisfactory approximations of functions in [0,1] whose domain is that of the application. What we have shown is
that CNM networks without inhibitory synapses are universal approximators. The role of inhibitory synapses is to allow
fewer input-layer neurons for some features, thereby potentially simplifying the network. CNM networks are trained by
supervised learning procedures along three different learning dimensions: learning of the membership functions, learning
of the network topology, and learning of the pathway weights. Learning along the former two dimensions is to be
regarded as a sort of preprocessing to the learning along the latter dimension. Also, depending on the problem at hand,
not every one of these learning dimensions has to be explored. For example, the learning of membership functions is only
meaningful when the patterns of the problem domain are described by numeric features. Similarly, if the topology of the
network is known, or if one may assume that the connectivity of pathways is limited by some upper bound, then one
proceeds directly to pathway-weight learning. We have in this research concentrated on the learning of pathway weights.
We have introduced six learning algorithms for CNM networks, called SS, RSS, DWD, H, IRP, and SRP. Of these,
the first four are based on subgradient procedures of nondifferentiable optimization, while the two others employ the
concepts of rewards and punishments. RSS is derived from SS in much the same way as RProp is derived from standard
error back-propagation. While RSS is conceptually equivalent to SS, the simplification it embodies accounts for
significant performance improvements over that algorithm. The other two subgradient-based algorithms, DWD and H,
are also conceptual descendants of SS, which has then provided the foundations for all the subgradient-based algorithms
we have investigated. SRP is the one-step version of IRP, and is used mostly as a start-up iteration that can precede any
of the other algorithms.
The problem we have used to illustrate the performance of our CNM learning algorithms is the interpretation of
Landsat-5 satellite images, aimed at monitoring the deforestation in the Amazon region [Barbosa et al 94]. This is a
large-scale problem, and automating its solution can be very beneficial in a variety of contexts related to environmental
issues. In addition, the choice of this problem for our performance evaluation is convenient because the data we have
employed is the same used previously to evaluate systems based on error back-propagation and RProp.
Our overall approach proceeds in two phases. The first phase, contrasting with the usual pixel-by-pixel approaches,
consists of the segmentation of the image into spectrally homogeneous portions (called segments, generated by a regiongrowing technique). In the second phase, each segment is then classified into one or more of the thematic categories
Water (W), Deforested area (D), Forest (F), Savanna (S), Cloud (C), and Shadow (Sh). The categories W, D, F, and S are
referred to as the basic categories. The remaining categories (C and Sh), called interfering categories, are needed to
account for the interference caused by clouds and shaded areas in the classification process.
In order to allow the treatment of complex situations, the natural way to approach the classification of segments is to
follow a fuzzy-logic approach, that is, allow a segment to belong to multiple categories with partial degrees of
membership. Such complex situations include transition phenomena (e.g., the regeneration of a deforested area) and
cases of multiple classification (e.g., a savanna overcast by clouds).
The architecture of our classifying system is centered on the construction of a specific CNM network for each
thematic category. The input to all CNM networks are the numeric features of the image segment to be classified. Eightytwo numeric features are used for each image segment, being of spectral, geometric and textural nature. For each segment
there are six spectral features, 22 geometric features, and 54 textural features.
The first layer of each CNM network performs the "fuzzification" of these numeric features by means of a learning
method that allows the design of specific fuzzy partitions of each feature domain for each thematic category. Note that
the fuzzy partition of the feature domain is dependent upon the category under consideration, and then different networks
may have different membership functions for the same feature. The method we have employed for the learning of
membership functions is based on the determination of singularities (minima, maxima, and null plateaus) of an entropylike function of the feature under consideration. By employing a measure of "relevance," these singularities are ranked
and only the most "relevant" give rise to a membership function. The eventual usefulness of each membership function
thus created depends on how the network's topology is obtained and on the learning of weights. Both processes allow for
the elimination of pathways, so that the corresponding membership functions may end up being done away with
completely. In several cases, this is what happens to membership functions which are too narrow.
Approximately 17,000 segments were used in our experiments, divided into approximately two-thirds to constitute the
training set and the remaining one-third to constitute the test set. These segments were obtained from five images of the
Amazon region, chosen for containing a great variety of the possible situations, including many of great complexity. The
five images come from the states of Acre, Mato Grosso, Para, and Rondonia, and from around Tucuruidam.
All segments were previously classified by a photointerpreter into the six categories, using (for simplicity) only five
values (0, 0.25, 0.5, 0.75, 1) for the degrees of membership of a segment in each of the categories. This classification
was subject to the constraints that the degrees of membership of a segment in the basic categories must add up to one,
and that its degrees of membership in all six categories must add up to some number between one and two.
A very welcome by-product of the structure of CNM as a collection of independent neural networks (one per category)
is that each network can be trained separately. Not only is this attractive from a purely computational standpoint (smaller
networks can be trained faster, and the various neural networks can be spread out for training over a network of
workstations or the processors of a parallel-processing machine), but also it allows the training set to be especially
calibrated to provide the most efficient training for each module. Specifically, each CNM network can be trained on an
extension of the training set obtained for that network by the unbiased replication of some of its members [Leporace et al
94]. This replication is aimed at correcting the uneven distribution of examples concurring toward and against
establishing the membership in the category of the network being trained.
Because expert knowledge on image interpretation was not available, we had to adopt learning from scratch in this
study. What this means is that an initial CNM network had to be built for the learning algorithms to act upon. For the
determination of this initial network, we have confined ourselves to the following. The hidden layer of the initial network
is such that there exist pathways leading out of all possible combinations of the input-layer neurons that do not violate a
predefined upper bound on connectivity.
Our results in a selected subset of all experimentation possibilities suggest that, with the exception of IRP, all our
algorithms have a significant role to play. However, it has been virtually impossible to select one of them as the absolute
best in all cases. Instead, depending on the category at hand, all of RSS, DWD, and H have performed best. The direction
these conclusions point in is that of building systems of a hybrid nature. In such systems, each module is tuned to its
specific needs, and may be based on models and techniques altogether different from those employed in other modules.
The hybrid system we have built along these directions is referred to as CNM*, and comprises, for each of the six
categories of the image classification problem, the best network obtained for that category. This has resulted in three
networks trained by DWD, two by RSS, and one by H. CNM* has shown a superior performance when compared to
systems trained by standard error back-propagation, and has in general performed comparably to RProp. In addition, we
note incidentally that the training of CNM* took significantly less time than that of RProp, itself significantly faster than
error back-propagation.
CNM networks trained by our algorithms have revealed a surprising degree of robustness when low-weight pathways
are pruned off them. What results are smaller networks and possibly indications as to which input features are indeed
important (input neurons that get disconnected in the pruning process are needless).
2.1.4 MMKBANN and MMCIL2P Systems
[Hallack et al 98] combined ideas from KBNN systems, like KBANN and CIL2P, and fuzzy NN, like CNM,
developing KBNN using neurons with min/max activation functions trained with subgradient descent methods. They
were applied in the same molecular biology domain, the promoter problem, and also to the chess game, obtaining good
results. Their extension to the first-order logic case is presented in section 2.8.
5
2.1.5 Neuro-Fuzzy Hierarchical Quadtree Models
[Souza et al 97, 99] proposed and developed a new hybrid neuro-fuzzy model, named Hierarchical Neuro-Fuzzy
Quadtree Model (HNFQ), which uses a recursive partitioning of the input space, called quadtree. The quadtree
partitioning was inspired in the quadtree structures proposed by Finkel and Bentley in 1974 and has been widely used in
the area of manipulation and compression of images. The model consists of an architecture, its basic cell, and its
learning algorithm. The HNFQ system has been evaluated in two well know benchmarks applications, The Mackey-Glass
chaotic series forecast and the Sinc(x, y) function approximation, and in real applications.
2.1.6 Representing First-order Logic in Symetric Neural Networks
[Pinkas 91,92] defined a bi-directional mapping between propositional logic formulas and energy functions of
symmetric neural networks. He showed that determining whether a propositional logic formula is satisfiable is equivalent
to finding whether the global minimum of its associated energy function is equal to zero. He also defined how to
transform a first-order resolution-based proof of a formula (query Q) from a given set of formulas (knowledge base KB)
into a set of constraints described by a set of propositional logic formulas C. Then he showed that the satisfaction of C is
sound and complete with respect to a first-order resolution-based proof of Q from KB. Therefore, finding that the global
minimum of the energy function associated to C is equal to zero is sound and complete with respect to a first-order
resolution-based proof of Q from KB.
Pinkas did not implement his system. In this project, [Kilkerry et al 98, 99] pointed out some adjustments to C and
presented an implementation of the revised system. The implementation has a user friendly interface - it is very easy to
present the query to the user-specified knowledge base and also to customize the network for it visually displays the
auxiliary matrices of neurons described by the theory. Besides using the system as a massively parallel theorem prover
(see also [Lima 94, 96]), it can be applied in optimizations problems where the user can describe the problem
declaratively (through logical formulas), instead of having to specify the network’s energy function, and the system will
explain the reason for the given answer.
The simulators developed are also being converted to ANDORRA-PROLOG in order to run in the ANDORRA
parallel environment [Yang et al 1991]. This way, the simulators will also be used as testbed for different processallocation mechanisms and strategies [Santos Costa et al 1996,1997).
[Carvalho98b] introduced a synthesis method for binary Hopfield neural networks according to which learning is
viewed as an optimization problem. A theory for concept representation is developed and synthesis criteria, used to
define the optimization problem's objective function and constraints, are presented. Experimental results are provided
based on the use of simulated annealing to solve the optimization problem.
2.1.7 First Order Symbolic Theory Refinement Systems
Most Inductive Logic Programming (ILP) systems restrict their language to definite or semi-definite programs [Lavrac
and Dzeroski 94,Muggleton and de Raedt94]. The subsumption mechanism introduces a syntactic notion of generality
between definite clauses that can be used to prune the search space of hypotheses. These properties form the basis of
most ILP systems that induce definite programs and also play an important role in the computation of heuristics.
However, they do not hold for normal programs in general. In this project, [Fogel and Zaverucha 98] identified a subclass
of normal logic programs, called strict and call-consistent, that can be handled and proposed training sets for the topdown induction of these logic programs (the gain heuristic needed to be changed).
Recently, there has been increased interest in the multiple predicate learning problem resulting in a number of systems,
including MPL [DeRaedt et al 93]. [Fogel and Zaverucha 98] investigated the problem of inducing normal programs of
multiple predicates in the empirical ILP setting. They have also presented a top-down algorithm for multiple predicate
learning, called NMPL, which is based on MPL and improved it in the following ways: the sets of training examples are
iteratively constructed; the language of definite clauses has been extended to strict and call-consistent normal programs.
Finally, wthey discussed the cost of the MPL's refinement algorithm and presented theoretical and experimental results
showing that NMPL can be as effective as MPL and is computationally cheaper.
2.1.8 Towards Extending KBNN to First-order
[McCarthy 88] pointed out that neural networks applications have all the basic predicates unary and even applied to a
fixed object and a concept is a propositional function of these predicates. Therefore they cannot perform concept
description, but only discrimination. An example can be found in [Hinton 86].
Extending KBNN to first-order involves two well-known and unsolved problems on neural networks: the
representation of structured terms and their unification. Existing models like [Shastri and Ajjanagade 93], [Holldobler 93]
and [Pinkas 92] have in common the use of architectures and resources that hinders their ability to perform learning,
which is our main goal. [Shavlik 94] pointed out that first-order theory refinement with neural networks is an open
problem.
An alternative way to induce relational concepts is to transform the original first-order theory into a propositional one
using the LINUS system [Lavrac, Dzeroski 94], such that propositional symbolic learning algorithms like ASSISTANT
and NEWGEN are applied.
In this project, [Basilio et al 98] used neural networks as the propositional learning algorithm. This way, neural
networks can induce relational concepts descriptions in a subset of First-Order Logic, namely constrained DHDB
(Deductive Hierarchical Database) formalism [Lloyd 87]. Given a DHDB background knowledge and a set of training
examples (ground facts of the target predicate), LINUS transforms a first-order problem into an attribute-value form. The
possible applications of the background predicates on the arguments of the target relation are determined, taking into
account argument types. Each such application introduces a new attribute. The attribute-value tuples are generalizations
(relative to the given background knowledge) of the individual facts about the target relation. The system picks a target
ground predicate from the examples, one at a time, and applies its terms in all attributes. If each resulting ground
predicate can be proved by the DHDB, it is set to TRUE, otherwise it is set to FALSE. This information is sent to a
neural network, which generates an attribute-value hypothesis and apply a rule extraction algorithm. Finally, the
knowledge obtained is converted back to DHDB form. Various examples from Machine Learning literature described in
[Lavrac, Dzeroski 94] were used to analyze the performance of neural networks’ learning via LINUS: family
relationships, the concept of an arch, rules that govern card sequences and the description of robots. In all the
experiments, a standard multilayer feedforward backpropagation neural network of three layers has been used. It is
important to notice that any neural network architecture could be used.
Also in this project, [Hallack et al 98] proposed a hybrid connectionist-symbolic model of first-order theory
refinement called MinMax Connectionist Inductive Logic Programming (MMCILP) system. Basically:
 the AND and OR usually achieved in KBNN through the setting of wieghts and thresholds are replace by the
fuzzy AND (min) and OR (max) functions;
 the standard neuron (computational element) is replaced by a node with extended capabilities inspired by [Sun]
“discrete neuron“;
 each link has a function mapping complex messages (set of tuples) into comples messages, besides having an
associated weight;
 learning is performed by subgradient descent methods together with ILP techniques.
2.2 ANN Compilers and Parallelization, Evolutionary Design of Circuits, Genetic Algorithms and
Computational Neuroscience
2.2.1 Artificial Neural Networks Compilers and Silicon Compilers
One of the achievements of this work consisted of the development of compilers which, from the description of a
target problem in logical clauses, produce a specification of a digital circuit in VHDL (a Hardware Description
Language) able to process the corresponding inference in an asynchronous and very agile way. Artificial Neural Network
(ANN) models are used as intermediate languages, or virtual machines, in the translation process. Two compilers were
developed and each one can be described as a concatenation of two translators: (i) Logical clauses to ANN (LA)
translator, and; (ii) ANN to VHDL (AV) translator. The first LA translator implemented [Aiello et all, 1995a, b] is able to
compile sets of propositional clauses into custom ANN architectures based on Caianiello's model [Caianiello, 1989]. A
second LA translator was developed and is able to compile sets of first order logic clauses limited to problems that only
require local search [Lima 1992] into customized Hopfield ANN architectures. For each LA translator, a particular AV
translator was built [Ferreira and França, 1997 a, b)].
Once a VHDL specification is obtained, via the first or the second compiler, many existing Computer Aided
Electornic (CAE) tools, e.g., Synopsysª, Mentor Graphicsª, Cadenceª, etc, can be used to synthesize a digital circuit using
Field Programmable Gate Arrays (FPGAs) or Application-Specific Integrated Circuits (ASICs) as substract. In particular,
an ALTERAª FPGA development system is the first host of the circuits generated by our compilers. A next step is to try
the Xilinxª development system, based on a fine-grain (internal gate architecture) FPGA family, in order to compare
implementation costs.
2.2.3 Parallelization of the Backpropagation Learning Algorithm in Clusters of Workstations
[Melcíades et al 98] describes the parallelization of the algorithm of learning backpropagation in clusters of
workstations. The studied modes are: online, batch e block [Coetzee 96]. The experimental results show that the
parallelization in the batch mode is the best alternative.
The backpropagation algorithm is one of the most used methods for training Artificial Neural Networks (ANN)
[Haykin 91]. Unhappily in many situations, its training time becomes unacceptable. To try to avoid this drawback many
7
studies have been carried through with the objective of using parallel machines and/or clusters of computers to speed up
the involved calculations in the backpropagation algorithm [Chaves et al 95] [Resk 93].
As a matter fact clusters of workstations and PCs have also shown its effectiveness in the parallelization of other in
Science and Engineering [Pfister 98]. Nowadays clusters of computers represent an alternative sufficiently consolidated
[Pfister 98]. The programming of these clusters can be carried through with some tools like PVM (Parallel Virtual
Machine) [PVM3] and MPI (Message Passing Interface) [Pacheco 96]. With these tools is possible to use a virtual
parallel environment of a network of heterogeneous machines. The algorithm can be discomposed in several parts that
will be executed in different processors simultaneously.
The parallelization of the algorithm of backpropagation for ANN allowed to get an acceleration of calculations
satisfactory, especially in batch mode. The parallelization of the algorithm in block mode showed that this mode has a
worse performance, but also allows to get acceleration of calculations.
The parallelization in this work is being used in computational intensive problems, such as the described classification
of fingerprints.
2.2.4 Performance and Convergence of Genetic Algorithms
[Costa 97] investigated the performance and convergence of Genetic Algorithms (GAs): solutions and related
measures. The work presented a discussion on the most important questions about the mathematical foundations of GAs:
the objections pointed by Thorton (1995) about the coherence of the association between schema and building blocks; the
contradiction between the Schema Theorem and the Price Theorem, mentioned by Altemberg (1994); and the new ideas
raised by the variance of fitness concept. The work also discussed aspects related to Deception and Epistasis: the Walsh
coeficients, deceptive functions such as 6-bits fully and 3-bits deceptive function by Deb, deceptive but easy and nondeceptive but hard by Greffenstette, F2 and F3 by Whitley, NK functions and Royal Road functions. Alternative GAs
such as: Messy GA (Goldberg), Structured GA (Dasgupta), Augmenated GA (Grefenstette), and GA’s of Paredis were
rewied. Finally, the work investigated two classes of convergence measures: probabilistic and based on landscapes, with
special attention to the FDC (Fitness Distance Correlation) proposed by Forrest (1995), which is evaluated on a set of
mathematical functions. The environment used was the GENEsYs 1.0 (Thomas Back), which has been adapted and
expanded to supply the purposes of this work.
2.2.4 Computational Neuroscience
2.2.4.1 Adaptive Resonance Theory in the Search of the Neuropsychological Patterns of
Schizophrenia
Neuropsychology is a clinical methodology which, through patient testing, assess cognitive performance leading to the
localization of brain dysfunctional areas. This methodology shows itself useful in clinical research because a
neurofunctional substrate is supposed to exist embedded in every psychiatric disorder. Schizophrenia is a chronic and
highly debilitating mental disease about which little is known except its social, familiar and personal costs. There seems
to exist consistent functional patterns in the presentation of this disorder. These patterns define at least three symptom
clusters or dimensions of schizophrenic patients which are the psychotic, the negative, and the disorganized. They must
be well interpreted as the concept of psychiatric syndrome denotes a pattern of correlated symptoms which the expert
qualitatively defines as a cluster. In this respect, artificial neural networks, mainly those that, like adaptive resonance
theory, are capable of unsupervisioned learning, indicate a highly promising direction for clustering the
neuropsychological dimensions of schizophrenia. The parameters for the neural networks cluster analysis were chosen
based on neurobiological studies of the disease, trying to highlight the damaged brain circuits. The correlation between
the neuropsychological patterns defined by the neural network and the known psychopathology was determined,
bringing interesting original results to the understanding of the exixting sub-types of this disorder [Carvalho97,
Carvalho98a]. This research was awarded by the World Biological Psychiatry Society with the Latin America
Biological Psychiatry Prize of 1998.
2.2.4.2 A Model for Autism
Autism is a mental disorder characterized by deficits in socialization, communication, and imagination. Along with the
deficits, autistic children may have savant skills (“islets of ability”) of unknown origin. The present article proposes a
neurobiological model for the autism. A neural network capable of defining neural maps simulates the process of
neurodevelopment. The computer simulations hint that brain regions responsible for the formation of higher level
representations are impaired in autistic patients. The lack of this integrated representation of the world would result in
the peculiar cognitive deficits of socialization, communication, and imagination and could also explain some “islets of
abilities”. The neuronal model is based on plausible biological findings and on recently developed cognitive theories of
autism. Close relations are established between the computational properties of the neural network model and the
cognitive theory of autism denominated “weak central coherence”, bringing some insight to the understanding of the
disorder [Carvalho99a].
2.2.4.3 A Model for Chronic Stress
This work defines a model for the neurobiological substrates of the chronic stress and its relation to depression. It is
based on the observation that there is an anatomical loop between two important pontine nuclei which release
norepinephrine and corticotropin-releasing factor. These two substances, deeply associated with the referred disorders,
have their action integrated by the mutually connecting nuclei. The loop forms a positive-feedback type circuit which
works as a local memory for stressful events, facilitating future actions in face of repetitive environmental challenges.
Under certain circumstances, the system there generated maintains a state of chronic neural hyperactivity,resulting in an
excessive release of norepinephrine and corticotropin-releasing factor, similar to what is observed in chronic stress and
depression. A mathematical model is developed, and some structural and physiological hypotheses are established and
supported by the experimental records found in the literature. Computer simulations, especially designed to incorporate
the mathematical equations and the model assumptions, are performed to show how the facilitatory effect and the chronic
hyperactivity state develop. The results of the model represent a contribution to the understanding of the neurobiological
etiology of the disorders of chronic stress and some forms of depression [Carvalho99b].
3. Applications
3.1 Electric Power Systems Diagnosis
The increasing complexity of Power Systems demands the development of sophisticated supervision, control and
automation techniques. Among the most commonly found applications are the faults detection and diagnosis systems
based on the processing of the target electric system's alarms.
Alarm processing in a large scale electric system requires the treatment of a great amount of information. When a
disturbance happens, a large number of isolated events occurs, making it difficult for the operator to recognize the cause
of this disturbance (make the diagnostic) and to determine the corrective actions that shall be taken.
In this project we analysed intelligent systems for alarm processing problem: two loosely coupled Hybrid System and
an extension to the C-IL2p (“Connectionist Inductive Learning and Logic Programming System”) that allows the
implementation of Extended Logic Programs in Neural Networks.
The hybrid systems developed consisted in a Neural Network module to pre-process the alarms, informing which
faults are more probable in the power system with that alarm configuration. The output is fed to:
 the Fuzzy Logic [Silva and Zebelum 96, Ribeiro et al 98] module to interpret the results (that means that from the
fault selection made by the Neural Network and from a simple rule set, the Fuzzy system returns a safer and more
precise diagnostic). In order to verify its validity, the intelligent system was subjected to an analysis from a
specialist in protection. During the tests the system correctly diagnosed 99% of the presented cases and is capable
of identifying simple and multiple faults, besides being able to manipulate noisy data. It is portable both to the
WINDOWS/DOS and the UNIX environments;
 an Expert system with similar objectives [Biondi et al 96, 97], closely related to [Silva et al 95, 96].
2
[Garcez et al 97] presented an extension of C-IL P system that allows the implementation of Extended Logic
Programs [Gelfond and Lifschitz 91] in Neural Networks. As a case example, the system was applied for fault diagnosis
of a simplified power system generation plant, obtaining good preliminary results results and an accuracy of 97% in
noisy multiple faults.
3.2 Electric Load Forecasting



Having a precise load-forecasting model is very important to the electric power system operation. The disposability
of load values estimations to the next hours or days to the system operator allows him to manage the generation
units, optimizing the actual power resources.
Several works relating computational intelligent techniques to load forecast have been published in recent years.
This project analyses the results obtained in the development of load forecasting system using different Artificial
Intelligence techniques.
[Ribeiro et al 98]developed a system that predicts hourly loads up to a weak ahead. A calendar of events is
introduced in the system, where special days (holidays, sportive events, etc) are inserted, which allows the system to
9





treat them in a special way, improving its performance. Its fast fuzzy rule generation made possible its training in
real time with the more recent data and allowed the kno wledge of the last trend of the considered series. This
characteristic together with the calendar of events made the system very flexible, fitting any utility very easily. The
procedure is applied to some utilities in Brazil using observations collected during several years. The results obtained
in these applications indicate that the proposed system is adequate to the matter. It is simple, matches to different
utilities very easily and the forecasts produced present small mean absolute percentage error (MAPE).
Another system using Fuzzy Logic has also been developed. We have implemented a Fuzzy Logic Forecasting
System using Matlab to predict load for the CEMIG company in 10-minutes interval, 144 steps ahead. In a fuzzy
systems for time series forecasting, the rule extraction is achieved from the numeric data. In this case, the own series
plays as the specialist. There are two basic methodologies for rule extraction: the data is used to define the fuzzy
sets, or the fuzzy sets are pre-assigned, with the data being late associated to these sets. We employed the second
methodology. Many tests were made varying the overlap ratio of the fuzzy sets in order to obtain the best results
(1.15% of mean absolute error).
[Teixeira et al 98, 99] present some results about the study of the partially recurrent neural networks applied in
hourly electric load forecasting with real data. In the present problem of load forecasting there is little information
from the output layer (it has only one output unit for the value of load which will be predicted). Therefore Elman
networks was chosen (instead of Jordan networks). Elman networks (EN’s) were applied to electric load forecasting,
using global and local models and were trained both by Backpropagation and Backpropagation through time
algorithms. For the local models the Neural Gas network was used as the classifier. Many networks architectures
were analyzed for the Global Models (Feedforward Networks and Elman Networks) and for the Local Models
(Neural Gas + Feedforward Networks, Neural Gas + Elman Networks and Recursive Neural Gas + Elman
Networks). The tests conducted showed better results for the local models than for the global models. The local
model architecture that performed better are the architectures that integrates Neural Gas or Recursive Neural Gas
with Elman’s neural network. The MAPE performance of these architectures were much better than the results
obtained by the other models investigated.
[Zebelum et al 97] developed the neural networks of software called NeuroPrev, which is used by 32 Brazilian
power electric companies for forecasting monthly electric load. The NeuroPrev system features an Artificial Neural
Network (ANN) modeled to perform load forecasting. The ANN employs the Backpropagation Algorithm for
network training and uses binary codification for the month as auxiliary ANN inputs. To evaluate training
performance, two statistics are provided graphically: the Mean Squared Error of the training process, and the
forecasting error for the last observed load data, which are not included in the training process.
[Ribeiro et al 97, 98] have also investigated the performance of ANN in Very Short Term (VST) load forecasting,
with prediction leading times of 10 minutes. The VST load forecasting system has been tested with real load data
from CEMIG Power Electric Co., Brazil. The systems involves: pre-processing, training and forecasting. Preprocessing the load series consists of two steps: preparing load data values for training and forecasting; and
interpolating missing values. The Backpropagation Algorithm has been used for training. The performance of the
VST load forecasting system has been assessed using a multi-step procedure, in which forecasted load values are fed
back as neural network inputs. A mean absolute percentage error of 1,17% has been achieved when predicting load
144 steps ahead (24 hours).
[Tito et al 99] have applied Bayesian Neural Networks to electric load forecasting. The Bayesian methods used were
the Gaussian Approximation and the Markov Chain Monte Carlo. Bayesian Neural Networks offer important
advantages over the backpropagation model such as the confidence interval which can be assigned to the predictions
generated by the network. The results of the Bayesian neural net were favorably compared with some statistical
techniques such as Box & Jenkins and Holt-Winterss and also with backpropagation neural networks.
3.3 Hybrid Statistics/Neural Models for Time Series
[Velloso 99] have developed a new class of non linear additive hybrid model for time series forecasting. In these
models the varying coefficients are modeled by a neural network and, both the conditional mean and variance are
modeled explicitly. This work demonstrated that the proposed models are universal approximators and can model
explicitly conditional variance. Gradient calculus and algorithms for weight estimation were also developed based on the
main estimation methods: least mean squares and maximum likelihood. These models have been tested with synthetic
and real series. The results obtained were superior or, at least, equivalent when compared with other methods.
[Soto 99] reviews the main algorithms and investigates the performance of Timed Neural Networks: TDNN (Time
Delay Neural Net), FIR (Finite-Response Impulse Response) Neural Net; Jordan and Elman Neural Nets. These
algorithms were implemented and assessed in two classes of applications: time series and voice filtering.
3.4 Overflow Control of Electric Generator Reservoir through Genetic Algorithm
[Ribeiro et al 98] have also assessed the performance of Genetic Algorithms (GA) in Overflow Control of Electric
Generator Reservoir. In this work we used GA to calculate the minimum volume of water to assign to a reservoir in order
to attend energy demand, avoiding overflow. This is basically a constrained problem which required a chromosome
decoder designed specifically to guarantee valid solutions.
3.5 A Theory Refinement Approach for the Design of Distributed Object Oriented Databases
Distributed Design involves making decisions on the fragmentation and placement of data across the sites of a
computer network [Oszu & Valduriez 91]. The first phase of the Distributed Design task in a top-down approach is the
fragmentation phase, which clusters in fragments the information accessed simultaneously by applications. Since
Distributed Design is a very complex task in the context of the OO data model [Ezeife & Barker 94, 95] [Karlapalem et
al 94] [Lima and Matoso 96] [Maier et al 94] [Savonnet et al 96], [Baião & Mattoso 97, 98] have developed in their
previous work an analysis algorithm to assist distributed designers in the fragmentation phase of OO databases. The
analysis algorithm decides the most adequate fragmentation technique to be applied in each class of the database schema,
based on some heuristics.
[Baião et al 98] proposed a theory refinement approach to the DDOODB problem. The objective of the work was to
uncover some previously unknown issues to be considered in the distributed design process. They represent the analysis
algorithm as a set of rules that will be used as background knowledge to obtain a new set of rules by applying the
Inductive Logic Programming (ILP) technique [Lavrac and Dzeroski 94], through the use of the FORTE system
implementation [Richards and Mooney 95]. This new set of rules will represent a revised analysis algorithm that will
propose fragmentation schemas with improved performance.
Some experiments have been done and have resulted in fragmentation schemas offering a high degree of parallelism
together with an important reduction of irrelevant data.
Although they have addressed the problem of class fragmentation in the DDOODB context, an important future work
is the use of the same inductive learning approach in other phases of the Distributed Design (such as the allocation
phase), as well as in the Database Design itself, possibly using another data models (relational or deductive). Also, the
resulting fragmentation schemas obtained from the revised algorithm may be applied to fragment the database that will
be used in the work of [Provost & Henessy 96], which proposes the use of distributed databases in order to scale-up data
mining algorithms.
3.6 Data Mining
[Baião et al 98] discussed some issues in Knowledge Discovery in Database.[Pereira 99] investigated the use of
Genetic Algorithms in KDD. The objective was to model, implement and assess the performance of a Genetic Algorithm
system, designed to evolve association rules from databases. Modelling the GA consisted in define the chromosome
representation and decoding, initialisation strategies, genetic operators, and evaluation functions. The evaluation
functions point to the best rules according to their accuracy and coverage. There have been tested 11 different evaluations
functions, some of them new, in various benchmark and real data mining applications. The tool system, called
RulerEvolver, which combines various resources to manipulate and mine in databases, presented competitive results,
when compared with other related works in data mining, using statistics and neural networks systems.
[Pereira 99] have also assessed the performance of Neural Network models in data mining. He employed various
models: backpropagation, Radio Basis Function and Kohonen, to classify database registers. In these applications, the
neural nets learned the characteristics of distinct clusters and were able to classify new registers correctly. These models
have been applied in real, marketing commercial databases.
3.7 A GIS Expert System for Land Evaluation
[Nehme et al 98] aims at developing a tool for Agricultural Planning based on an expert system for Land Evaluation.
The social and economical consequences of unsuitable land use is well known although, in some countries, the natural
potential is not considered causing the misuse of land potential and, as a result, economical and environmental
impairment.
A land use planning is essential during economical development. Without correct interventions in the distribution of
the development sectors, based on land planning, regional economic development problems will probably occur.
In many countries, economical development occurs without land use planning. This causes development problems
such as: imbalance growth, income disparity, decreases of water quality, to name some.
A Decision Support System for Land Evaluation will be used as a first step for land use planning. In this way, five
limiting factors can be initially considered: soil fertility, water excess/deficiency, erosion susceptibility and the required
degree of soil tillage. Besides those environmental aspects, some economical issues should be also considered,
expressing communities development degree, market tendency and so on.
According to FAO [1995 / Chapter 10- Agenda 21]: an integrated physical and land use planning and management is
an eminently practical way to move towards more effective and efficient use of the land and its natural resources. That
11
integration should take place at two levels, considering on the one hand all environmental, social and economic factors
and on the other all environmental and resources components together.
This integration can be obtained through the utilization of landscape ecological methods, through the definition of
landscape units able to represent this integrative approach. Landscape units will be used as elementary portions providing
information for the analytical process. Landscape units are geographical entities containing attributes which allow a
differentiation between them having a simultaneous dynamic connection.
This system encompasses two dimensions: the ecological, which reflects the limitations and potentialities of
sustainable use of natural resources, and the economical, which expresses the development of the communities that live
in the region or zone whilst exploiting it.
Geographic Information Systems (GIS) have been used for many applications as a tool for spatial analysis. Thus, a
GIS will be used to support the spatial aspects of the above mentioned knowledge-based system for land evaluation. This
information system will help decision making and can be a powerful tool for land use planning and environmental
management.
The Decision Support System and the methodology presented have been validated in Paty de Alferes region, Rio de
Janeiro, Brazil with good results in comparison with the traditional land evaluation methods and land adequacy analysis
performed by a soil scientists group of CNPS/Embrapa (National Soil Research Center of Brazilian Agricultural
Research Corporation). The next step is to implement fuzzy algorithms to the Expert System in order to make it
possible to measure costs and establish risk analysis associated with the decision making. For this purpose we intend to
use the methodology proposed by BOGRADI [1].
The fuzzy reliability measurement can be used to compare alternatives decisions when the consequence of failure is
considered.
3.8 Robot Control and Games
In this project we have used Genetic Algorithms, Neural Networks and Fuzzy Logic in the design of machine
learning systems and have applied them to robot control and games. In [Thomaz et al 99], a Genetic Algorithm (GA) is
used for designing behavioral controllers for a robot. A new GA path-planning approach employs the evolution of a
chromosome attitudes structure to control a simulated mobile robot, called Khepera. These attitudes define the basic
robot actions to reach a goal location, performing straight motion and avoiding obstacles. The GA fitness function,
employed to teach robot’s movements, was engineered to achieve this type of behavior in spite of any changes in
Khepera’s goals and environment. The results obtained demonstrate the controller’s adaptability, displaying nearoptimal paths in different configurations of the environment. The same robot simulator was used to evaluate a fuzzy
controller. [Brasil 99] implemented the reinforcement learning method, called TD(), to learn a game.
3.9 Neural Networks for Image Classification
Traditional image classification (or object recognition) systems are passive devices. Given an input image,
identification or recognition is obtained by processing the entire image. Typically, this fact implies that the whole image
must pre-processed, an extremelly costly procedure. This state of affairs should be contrasted with human vision which,
instead of passively dealing with "input images", involves the continuous interaction with the outside world through socalled saccadic eye movements (Yarbus, 1967). The central role of eye movements in human vision has inspired a range
of computational vision models to explore such properties, producing a number of "active vision" proposals (Aloimonos
et al., 1988; Ballard,1991; Tistarelli & Sandini, 1992). The present work utilizes basic principles of active vision
systems in the task of image interpretation and object recognition.
The main objective of the present component of the project were the following.
1- Develop image representation methods based on the properties of the primate visual system. In particular, design
filters inspired in the concentric receptive field organization found in the retina, as well as the oriented structure found
in the visual cortex (Neumann & Pessoa, 1994; Pessoa et al., 1995; Grossberg & Pessoa, 1988; Exel & Pessoa, 1988).
2- Determine the efficacy of this representation in the problems of image classification and object recognition. In
particular, investigate the performance of the methods developed in data bases extensively used for testing and
benchmarking.
3- Based on the representation chosen, develop new image classification and object recognition methods with
reduced computational costs. In particular, develop multi-scale pyramid and space-variant representations for the active
processing of images.
The results obtained were the following:
1- Development of an active vision system for object recognition. In this system, the image in not treated as a whole,
but its regions are visited sequentially. Each region considered provides new image pixels that are accrued to the input
feature vector that is considered by a Fuzzy ART neural network for image coding. The present development introduced
new attentional strategies, i.e., strategies of how to sequentially visit image regions such that they are optimally chosen
for recognition (i.e., they help disambiguate the recognition process the most). See Exel and Pessoa (1998); Exel
(1999).
2- The active vision object recognition system was extended to deal with multiple-object scenes, or images. In this
case the input to the system was not an image with only one object (e.g., a face image), but a an image containing a host
of objects in arbitrary positions (e.g., a collection of faces). The active vision philosophy adopted in (1) above was
maintained for the scene exploration system. In this system, objects belonging to the image (e.g., a person) are marked
(or tagged) by mechanisms that determine "interesting regions" or regions that contain information that should be
investigated in detail. These canditate regions are by their turn treated as a single object and investigated in detail by the
single-object system developed in (1). See Pessoa et al. (1998a,b).
3.10 Evolutionary Methodologies Applied to the Design of Electronic Circuits
[Zebelum et al 99] presented a major investigation in a novel research area called Evolutionary Electronics (EE). EE
extends the concepts of Genetic Algorithms to the evolution of electronic circuits. The main idea behind EE is that each
possible electronic circuit can be represented as an individual or a chromosome of an evolutionary process, which
performs standard genetic operations over the circuits, such as selection, crossover and mutation. This research work
comprises genetic algorithms modeling, the proposal of novel evolutionary methodologies and a large number of case
studies: Variable length representation (VLR) Evolutionary Algorithms; Memory based genetic algorithms; New
crossover and mutation operators for VLR; Search space partitioning methods; Multi-Objective optimization
methodology baesd on Neural Networks; Electronic filter design; Synthesis of Low Power operational amplifiers;
Analog electronic circuits design; Design and optimization of logic gates; Micro power CMOS analog cells; Synthesis
and optimization of combinational and sequential digital circuits; Switched Capacitors circuits design.
4. Conclusion
The ICOM project have developed and implemented several computational intelligent systems and applied them in
real world problems. We have organized the First Brazilian School on Machine Learninng and Knowledge Discovery in
Databases (MLKKDD’98) (www.cos.ufrj.br/~mlkdd) and two workshops (see [França 97]). We have formed 19 MSc
and 5 PhD students and several researchers that worked on the project. We have published 100 papers: 1 book, 7 book
chapters, 11 papers in international journals, 47 in international conferences and 34 in Brazilian conferences.
Acknowledgments
This project was funded by the Brazilian Research Council CNPq/ProTeM-CC. Some authors are partially financially
supported by CNPq research productivity grants. We also would like to thank Ernesto Burattini, Saso Dzeroski, Alex de
Freitas, Massimo de Gregori, Steffen Holldobler, Raymond Mooney, Foster Provost, Jude Shavlik and Ah-Hwee Tan.
References
Aloimonos, J., Bandopadhay, A., & Weiss, I. (1988). Active vision. International Journal of Computer Vision, 1, 333-356.
Aiello, A., E. Burattini and G. Tamburrini, 1995a. Purely Neural, Rule-Based Diagnostic Systems. I. Production Rules. International
Journal of Intelligent Systems, 10, 735-749.
Aiello, A., E. Burattini and G. Tamburrini,1995b. Purely Neural, Rule-Based Diagnostic Systems. II. Uncertain Reasoning.
International Journal of Intelligent Systems, 10, 751-769.
R. Andrews, J. Diederich and A. B. Tickle; “A Survey and Critique of Techniques for Extracting Rules from Trained Artificial
Neural Networks”; Knowledge-based Systems; Vol. 8, no 6; 1995.
K. R. Apt and N. Bol; “Logic Programming and Negation: A Survey”, Journal of Logic Programming 19-20, pp.9-71, 1994.
BAIÃO, F., MATTOSO, M., 1998, “A Mixed Fragmentation Algorithm for Distributed Object Oriented Databases”, Proceedings of
the International Conference on Computing and Information, Winnipeg, Canada, June
Ballard, D. (1991). Animate vision. Artificial Intelligence, 48, 57-86.
V. C. Barbosa; “Massively Parallel Models of Computation”; Ellis Horwood Series in Artificial Intelligence; 1993.
R. Blasig; “GDS: Gradient Descent Generation of Symbolic Classification Rules”; Advances in Neural Information Processing
System, Vol. 6; 1994.
Buckley, J. J., and E. Eslami, "Fuzzy neural nets: capabilities," in W. Pedrycz (Ed.),Fuzzy Modelling: Paradigms and Practice,
Kluwer, Boston, MA, 167-183, 1996.
Buckley, J. J., and Y. Hayashi,"Fuzzy neural networks: a survey," Fuzzy Sets andSystems 66 (1994), 1-13.
Buckley, J. J., and Y. Hayashi, "Neural nets for fuzzy systems, "Fuzzy Sets and Systems 71 (1995), 265-276.
Caianiello E., 1989. A theory of neural networks. In: I. Aleksander (ed), Neural Computing Architectures: The Design of Brain-like
Machines. North Oxford Academic Publishers, 8-25.
*Carvalho L A V., Ferreira, N C, Fiszman, A., A Neurocomputational Model for Autism, Submitted to Psychophysiology
13
*Carvalho, L A V, Melo, A C D, A Neurocomputational Model for Chronic Stress, Submitted to Mathematical and Computer
Modeling.
N. Chaves, P.Martins e C. Fernando. Um Estudo de Paralelismo em Redes Backpropagation. II Simpósio Brasileiro de Redes Neurais, 1995.
L. Coetzee. Parallel Approaches to Training Feedforward Neural Nets. Faculty of Engineering, University of Pretoria, 1996.
*Rodrigo M. L. A. Costa, Estudo sobre o Desempenho de Algoritmos Genéticos (A Study on the Performance and Convergence of
Genetic Algorithms), Departamento de Engenharia Elétrica, PUC-Rio, March 1997.
De Raedt, L., Lavrac, N. and Dzeroski, S., Multiple Predicate Learning, in: Proceedings of the 13th International Joint Conference on
Artificial Intelligence, Morgan Kaufman, 1993, pp. 1037-1042.
EZEIFE, C., BARKER, K., 1994, Vertical Class Fragmentation in a Distributed Object Based System, Technical Report 94-03,
Department of Computer Science, University of Manitoba, Canada
EZEIFE, C., BARKER, K., 1995, "A Comprehensive Approach to Horizontal Class Fragmentation in a Distributed Object Based
System”, Distributed and Parallel Databases, vol. 3, n° 3, pp. 247-272
M. Fischer e J. Dongarra. Another Architecture: PVM on Windows 95/NT, 1994.
*França, F.M.G. (Editor), Proceedings of the Workshop in Computational Intelligence: Projects ICOM e IPAC, Protem III-CC,
CNPq, Programa de Engenharia de Sistemas e Computação - COPPE/UFRJ, Technical Report ES-450/97, september 1997.
M. Gelfond and V. Lifschitz; “The Stable Model Semantics for Logic Programming”; 5 th Logic Programming Symposium, MIT
Press; 1988.
M. Gelfond and V. Lifschitz; “Classical Negation in Logic Programs and Disjunctive Databases”; New Generation Computing, Vol.
9, Springer-Verlag; 1991.
Gupta, M. M., and D. H. Rao, "On the principles of fuzzy neural networks, "Fuzzy Sets and Systems 61 (1994), 1-18.
S. Haykin. Neural Networks. A Comprehensive Foundation. Macmilian College Publishing Company, 1991
K. Hwang. Advanced Computer Archtecture Parallelism-Scalability-Programmability. Mc Graw Hill, 199
J. Hertz, A. Krogh and R. G. Palmer; “Introduction to the Theory of Neural Computation”; Santa Fe Institute, Studies in the Science
of Complexity, Ed. Addison-Wesley Publishing Company; 1991.
M. Hilario; “An Overview of Strategies for Neurosymbolic Integration”; Connectionist-Symbolic Integration: from Unified to
Hybrid Approaches - IJCAI 95; 1995.
S. Holldobler; “Automated Inferencing and Connectionist Models”; Postdoctoral Thesis, Intellektik, Informatik, TH Darmstadt;
1993.
S. Holldobler and Y. Kalinke; “Toward a New Massively Parallel Computational Model for Logic Programming”; Workshop on
Combining Symbolic and Connectionist Processing, ECAI 94; 1994.
M. I. Jordan; “Attractor Dynamics and Parallelisms in a Connectionist Sequential Machine”; Eighth Annual Conference of the
Cognitive Science Society, pp. 531-546; 1986.
Kandel, A., and G. Langholz, Hybrid Architectures for Intelligent Systems, CRC Press, Boca Raton, FL, 1992.
KARLAPALEM, K., NAVATHE, S., MORSI, M., 1994, “Issues in Distribution Design of Object-Oriented Databases”. In: Özsu, M.
et. al (eds), Distributed Object Management, Morgan Kaufman Publishers References
Keller, J. M., R. Krishnapuram, and F. C. Rhee, "Evidence aggregation networks for fuzzy logic inference," IEEE Trans. on Neural
Networks 3 (1992), 761-769.
B. Kruatrachue e T. Lewis. Gran size Determination for Parallel Processing. IEEE Software, New York, 1998.
N. Lavrac and S. Dzeroski; “Inductive Logic Programming: Techniques and Applications”; Ellis Horwood Series in Artificial
Intelligence; 1994.
P. Leray, P. Gallinari e E. Didelet. A Neural Network Modular Architecture for Network Traffic Management. CESA, Lille, 1994.
LIMA, F. MATTOSO, M., 1996, “Performance Evaluation of Distribution in OODBMS: a Case Study with O2” In: Proc IX Int.
Conf on Parallel & Distributed Computing Systems, France, pp.720-726
LIMA, P. M. V., 1992. Logical Abduction and Prediction of Unit Clauses in Symmetric Hopfield Networks. In: I. Aleksander and J.
Taylor (eds.), Artificial Neural Networks (ICANN-92), Elsevier Science Publishers, 1, 721-725.
LIMA, P. M. V., 1994. Resolution-Based Inference with Artificial Neural Networks. presented at the Workshop on Logic and
Reasoning with Neural Networks at the International Conference on Logic Programming’94 (ICLP-94), Santa Margherita.
Workshop. Proceedings can be retrieved at ftp.site: ftp.di.unipi.it, dir: pub/Papers/perso/ICLP-WS.
Liporace, F. S., R. J. Machado, and V. C. Barbosa, Equalization of the training set for back-propagation networks applied to
classification systems," Proc. of the First Brazilian Congress on Neural Networks, 99-103, 1994.
J. W. Lloyd; “Foundations of Logic Programming”; Springer - Verlag; 1987.
Machado, R. J., and A. F. Rocha, "A method for incremental learning in fuzzy neural networks," Proc. of the First Brazilian Symp.
on Neural Networks, 105-110, 1994.
Machado, R. J., and A. F. Rocha, "Integrating symbolic and neural methods for building intelligent systems," Proc. of the Int. Symp.
on Integrating Knowledge and Neural Heuristics, 283-292, 1994.
Machado, R. J., and A. F. Rocha, "Inference, inquiry and explanation in expert systems by means of fuzzy neural networks," Proc. of
the Second IEEE Int. Conf. on Fuzzy Systems, 351-356, 1993.
Mahoney, J. and Mooney, R.1993. Combining Connectionist and Symbolic Learning Methods to refine Certainty-factors Rule-based
Systems. Connection Science 5 (special issue on architectures for integrating neural and symbolic processing) pp.339-364.
MAIER, D., GRAEFE, G., SHAPIRO, L. et al., 1994, “Issues in Distributed Object Assembly”. In: Özsu, M., Dayal, U., Valduriez,
P. (eds), Distributed Object Management, Morgan Kaufmann Publishers
J. McCarthy, “Epistemological challenges for connectionism”, Behavior and Brain Sciences, vol. 11, n. 1; pp. 44; 1988.
M. Minsky; “Logical versus Analogical, Symbolic versus Connectionist, Neat versus Scruffy”; AI Magazine, Vol. 12, n o 2; 1991.
MITCHELL, T., 1997, Machine Learning, McGraw-Hill Companies, Inc
S. Muggleton and L. Raedt; “Inductive Logic Programming: Theory and Methods”; The Journal of Logic Programming; 1994.
14
Neumann, H., and Pessoa, L. (1994). A simple cell model with multiple spatial frequency selectivity and linear/non-linear response
properties. Proceedings of the World Congress on Neural Networks
ÖZSU, M., VALDURIEZ, P., 1991, Principles of Distributed Database Systems, New Jersey, Prentice-Hall
P. Pacheco. Parallel Programming with MPI. Morgan Kaufmamm, 1996.
M. Pazzani and D. Kibler; “The Utility of Knowledge in Inductive Learning”; Machine Learning, Vol. 9, pp. 57-94; 1992.
Pedrycz, W., "Fuzzy neural networks and neurocomputations," Fuzzy Sets and Systems 56 (1993), 1-28.
Pedrycz, W., and A. F. Rocha, "Fuzzy-set based models of neurons and knowledge-based networks," IEEE Trans. on Fuzzy Systems
1 (1993), 254-266.
G. F. Pfister. In Search of Clusters. Prentice Hall, 1998.
G. Pinkas; “Energy Minimization and the Satisfiability of Propositional Calculus”; Neural Computation, Vol. 3, pp.282-291; 1991.
G. Pinkas; “Logical Inference in Symmetric Connectionist Networks”; Doctoral Thesis, Sever Institute of Technology, Washington
University; 1992.
PROVOST, F., HENNESSY, D., 1996, “Scaling-Up: Distributed Machine Learning with Cooperation”, In: Proceedings of the
Thirteenth National Conference on Artificial Intelligence (AAAI '96), Portland, Oregon
PVM 3 Users Guide and Reference Manual. Oak Laboratories. Tennessee USA, 1994.
T. Resk. Distributed Simulation of Synchronous Neural Networks. Invited Article, Neural Network, Vol 7, No 1, USA., 1994.
T. Resk. Mapping and Parallel Simulation of Synchronous Neural Network on Multiprocessors. Procedings of Euromicro Conference. Barcelona,
1993.
RICHARDS, B. L., MOONEY, R. J., 1995, "Refinement of First-Order Horn-Clause Domain Theories", Machine Learning, vol. 19,
n° 2, pp. 95-131
SANTOS COSTA, V., R. BIANCHINI and I. C. DUTRA, 1996. Evaliuating the impact of coherence protocols on parallel logic
programming systems.Proceedings of the 5th EUROMICRO Workshop on Parallel and Distributed Processing, 376-381.
J. Shavlik; “Combining Symbolic and Neural Learning”; Machine Leaning, Vol. 14, no 2, pp. 321-331; 1994.
*Flávio Joaquim de Souza, Marley M. R. Vellasco, Marco A. C. Pacheco, “Hierarchical Neuro-Fuzzy Quadtree Models”, to be
submitted to the Fuzzy Sets and Systems journal.
Tan, Ah-Hwee 1997. Cascade ARTMAP: Integrating Neural Computation and Symbolic Knowledge Processing. IEEE Transactions
on Neural Networks, Vol. 8, No. 2 (March 1997), pp. 237-250.
S. B. Thrun, J. Bala, E. Bloedorn, I. Bratko, B. Cestnik, J. Cheng, K. De Jong, S. Dzeroski, S. E. Fahlman, D. Fisher, R. Haumann,
K. Kaufman, S. Keller, I. Kononenko, J. Kreuziger, R. S. Michalski, T. Mitchell, P. Pachowicz, Y. Reich, H. Vafaie, K. Van de
Welde, W. Wenzel, J. Wnek and J. Zhang; “The MONK’s Problem: A Performance Comparison of Different Learning
Algorithms”; Technical Report, Carnegie Mellon University; 1991.
Tistarelli, M., & Sandini, G. (1992). Dynamic aspects in active vision. Computer Vision, Graphics, and Image Processing, 56, 108129.
Tito, E., Zaverucha, G., Vellasco, M., Pacheco, M.1999. Applying Bayesian Neural Networks to Electric Load Forecasting.
Submitted.
G. Towell and J. W. Shavlik; “The Extraction of Refined Rules From Knowledge Based Neural Networks”; Machine Learning, Vol.
131; 1993.
G. Towell and J. W. Shavlik; “Knowledge-Based Artificial Neural Networks”; Artificial Intelligence, Vol. 70; 1994.
(WCNN-94), San Diego, Vol. IV, IV-290-298.
YANG, R., V. SANTOS COSTA,and D. H. D. WARREN, 1991. The Andorra I Engine: A parallel implementation of the Basic
Andorra model. Proceedings of the Eighth International Conference on Logic Programmi ng (ICLP'91), 58-67.
Yarbus, (1967). Eye movements and vision. New York: Plenum.
SAVONNET, M., TERRASSE, M., YÉTONGNON, K., 1996, “Using Structural Schema Information as Heuristics for Horizontal
Fragmentation of Object Classes in Distributed OODB”, In: Proc IX Int. Conf on Parallel & Distributed Computing Systems,
France, pp. 732-737
*R.S. Zebulum, M.A.C. Pacheco, M.M.B.R. Vellasco, A Novel Multi-Objective Optimisation Methodology Applied to Synthesis of
CMOS Operational Amplifiers, submitted to the Journal of Solid-State Devices and Circuits, Microelectronics Society –
SBMICRO 1999.
*R.S. Zebulum, M.A.C. Pacheco, M.M.B.R. Vellasco, Variable Length Representation in Evolutionary Electronics, submitted to the
Evolutionary Computation Journal, MIT Press, 1999.
Publications From The ICOM Project
Books : 01
1.
Braga, A. P., Carvalho, A . P. L. e Ludermir, T. B.(1998) Fundamentos de Redes Neurais. Livro publicado na XII Escola de
Computação. Rio de Janeiro, 13-17 de julho de 1998. Revised version published by Livros Técnicos e Científicos (LTC), 1999.
Papers in International Journals: 11
2.
C.R.H. Barbosa, A.C. Bruno, M.M.B.R. Vellasco, M.A.C. Pacheco, J.P. Wikswo Jr., A. P. Ewing, C.S. Camerini, Automation
of SQUID Nondestructive Evaluation of Steel Plates by Neural Networks, IEEE Transactions on Applied Superconductivity,
IEEE Inc. 1999.
15
3.
Carvalho L. A. V., "Modelling Distributed Concept Representation in Hopfield Neural Networks", Mathematical and
Computer Modelling, accepted for publication.
4. Garcez, A. S., Zaverucha, G. 1999. A Connectionist Inductive Learning and Logic Programming System. Accepted in Applied
Intelligence Journal, Kluwer, special volume on Neural Networks and Structured Knowledge.
5. Grossberg, S., & Pessoa, L. (1998). Texture segregation, surface representation, and figure-ground separation. Vision Research,
38, 2657-2684.
6. Ludermir, T.B., Carvalho, A.P.L, Braga, A.P. e de Souto, M. (1998) Weightless Neural Models: a review of current and past
works. Neural Computing Surveys, vol 2, p 41-61. ISBN:1093-7609.
7. Ricardo J. Machado e Armando F. da Rocha, Inference, Inquiry, Evidence Censorship, and Explanation in Connectionist Expert
Systems, IEEE Transactions on Fuzzy Systems, Vol. 5, N. 03, pp. 443-459, agosto de 1997.
8. Ricardo J. Machado, Valmir C. Barbosa, and Paulo A. Neves, "Learning in the combinatorial neural model," IEEE Trans. on
Neural Networks, 9 (1998), 831-847.
9. Nobre, C.N., Braga, A.P., Carvalho, A.P.L. e Ludermir, T.B. (1999) Rule extraction: A comparison between classic and
conexionist methods. To appear in the International Journal of Neural Systems.
10. Pessoa, L., Mingolla, E., and Neumann, H. (1995). A contrast- and luminance-driven multiscale network model of brightness
perception. Vision Research, 35, 2201-2223.
11. Pinto de Melo, C., Santos, F.L., Souza, B.B., Santos, M.S. e Ludermir, T.B. (1998) Polypyrrole Based Aroma Sensor.
12. F. Protti and G. Zaverucha, "Recognizing Classes of Logic Programs", Journal of the IGPL, Vol 5, Number 6, November 1997,
pp. 913-915.
Book Chapters:07
13. F.
Baiao, M. Mattoso and G. Zaverucha, " A Knowledge-Based Perspective of the Distributed Design of Object Oriented
databases", Ed. Nelson F. Ebecken, Data Mining, pp. 383-399, WIT Press, England, 1998.
14.
Hall Barbosa, A.C. Bruno, Marley Vellasco, Marco Pacheco e C.S. Camerini, Automation of
Electric Current Injection NDE by Neural Networks, Editors D. Thompson e D.E. Himenti, in
Review of Progress in Quantitative Nondestructive Evaluation, Vol. 16, pp. 789-795, Plenum
Press, NY, EUA, 1997
15. Braga, A.P., Carvalho, A.P.L. e Ludermir, T.B. (1999) Radial Basis Functions: Theory and Aplications. In Nonlinear Modelling
and Forescasting of High Frequency Financial and Economic Time Series, Soofi, A.S. and Cao, L. editors, Kluver.
16. Carvalho, A.P.L., Braga, A.P., e Ludermir, T.B (1999) Input-output modeling of credit approval data-set using neural data
mining set. In Nonlinear Modelling and Forescasting of High Frequency Financial and Economic Time Series, Soofi, A.S. and
Cao, L. editors, Kluver.
17. De Souto, M.C.P., Ludermir, T.B. e De Oliveira, W.R. (1998) Synthesis of Probabilistic Automata in pRAM Neural Networksvol II ( com 1198 páginas), editado por Niklasson, L., Bodén, M., e Ziemke, T., pp. 603-608. Springer Verlag.
18. Ludermir, T.B., Braga, A.P., Nobre, C.N e Carvalho, A.P.L. (1998) Extracting Rules from Neural Networks: A Data Mining
Approach. No livro Data Mining editado por Nelson Ebecken. WIT Press, Computational Mechanics Publication.
Southhampton, England. Setembro de 1998, p. 303-314.
19. Santos, M.S., Ludermir, T.B., Santos, F.L., Pinto de Melo, C. e Gomes, J.E. (1998), Artificial Nose and Data Analysis using
Multi Layer Perceptron. No livro Data Mining editado por Nelson Ebecken. WIT Press, Computational Mechanics Publication.
Southhampton, England. Setembro de 1998, p. 251-264.
Papers in International Conferences: 47
20. Baião, F., Mattoso, M. and Zaverucha, G. 1998. Towards an Inductive Design of Distributed Object Oriented Databases. Proc.
of the Third IFCIS Conference on Cooperative Information Systems (COOPIS’98), IEEE Computer Society, New York, USA,
August 20-22, pp.188-197.
21. F. Baiao, M. Mattoso and G. Zaverucha, 1998"Issues in Knowledge Discovery in Databases, Proc. World Multiconference on
Systemics, Cybernetics and Informatics / Fourth International Conference on On Information Systems, Analysis and Synthesis,
Vol. 2 pp. 164-169. (SCI’98), Orlando, Florida, USA, July.
22. Barros, M.M., Souza, G.M. e Ludermir, T.B. (1998) Features extraction on boolean artificial neural networks. 3rd Internacional
Conference on Computational Intelligence and Neuroscience. Carolina do Norte, USA, 23-28 de outubro de 1998, p. 99-102,
Association For Intelligent Machinery.
23. Barros, M.M., Ludermir, T.B. e Valadares, J.F.L. (1999) Rule Extraction from boolean artificial neural networks. Aceito na
International Joint Conference on Neural Networks 1999, Washington (July 1999). INNS Press.
24. R. Basilio, G. Zaverucha and A. Garcez, 1998 " Inducing Relational Concepts with Neural Networks Via the LINUS System",
Proc. Fifth International Conference on Neural Information Processing (ICONIP’98), Vol. 3 pp. 1507-1510, 21-23 October,
Kitakyushu, Japan.
25. Luiz Biondi Neto, Marco Aurélio Pacheco, Marley M.B. R. Vellasco, Emmanuel P. Lopes Passos e Luis Chiganer, Hybrid
System for Detection and Diagnosis of Fault in Electrical Networks, XII ISCIS - The Twelfth International Symposium on
Computer and Information Sciences, Antalya, Turquia, 27-29 October 1997.
26. Luiz Biondi Neto, Marco Aurélio Pacheco, Marley M.B. R. Vellasco, Emmanuel P. Lopes Passos e Luis Chiganer, Hybrid
System for Detection and Diagnosis of Fault in Electrical Networks, XII ISCIS - The Twelfth International Symposium on
Computer and Information Sciences, Antalya, Turquia, 27-29 de outubro de 1997.
27. de Oliveira, W. R. (1996). Program Schemes over Continuous Algebras.Computability and Complexity in Analysis’ 96, Trier,
Alemanha, 1996.
16
28. de Oliveira, W. R. e de Barros, R. S. M. (1997). The Real Numbers in Z. Proceedings of the 2 nd BCS-FACS Norther in Formal
Methods Workshop, Ikley 1997, D.J. Duke and A.S. Evans (Eds). Eletronic Workshops in Computing:
http://rwic.springer.co.uk/Springer
29. De Souto, M. C. P., Ludermir, T. B. e De Oliveira, W. R. (1998) Synthesis of Probabilistic Automata in pRAM Neural
Networks. Nos Anais do ICANN98. Suécia, 2-4 de setembro de 1998.
30. Exel, S., & Pessoa, L. (1998). Attentive visual recognition. Proceedings of the International Conference on Pattern
Recognition, August 16-20th, Brisbane, Australia.
31. Fogel, L., Zaverucha, G. 1998. Normal Programs and Multiple Predicate Learning. International Conference on Inductive Logic
Programming (ILP’98), Lecture Notes in Artificial Intelligence (LNAI), Springer Verlag. Madison - Wisconsin, USA, 22-24
July 1998.
32. Garcez, A.S., Zaverucha, G, Silva, V.N.A.L 1997. Applying the Connectionist Inductive Learning and Logic Programming
System to Power Systems Diagnosis. Proceedings of the IEEE International Conference on Neural Networks (ICNN-97), 9-12
June, Houston, Texas, USA, Vol.1 pp.121-126 .
33. Garcez, A.S., Zaverucha, G, Carvalho, L.A. 1997. Logic Programming and Inductive Learning in Artificial Neural Networks.
Knowledge Representation in Neural Networks, pp. 33-46, Logos-Verlag Berlin, ISBN 3-931216-77-2.
34. Garcez, A.S., Zaverucha, G, Carvalho, L.A. 1996. Logical Inference and Inductive Learning in Artificial Neural Networks. XII
European Conference of Artificial Intelligence, ECAI96, Workshop on Neural Network and Structural Knowledge, August 1216, Budapest, Hungary, pp. 41-49.
35. Anchizes do E.L. Gonçalves Filho, Marley M.B.R. Vellasco e Marco A.C. Pacheco, Neural
Networks and Invariant Moments for Noisy Images Recognition, International Conference on
Engineering Applications of Neural Networks (EANN'98), Gibraltar, 10-12 June 1998.
36. N. Hallack, G. Zaverucha and V. Barbosa, 1998 " Towards a Hybrid Model of First-Order Theory Refinement", presented in
Neural Information Processing Systems (NIPS)/ Workshop on Hybrid Neural Symbolic Integration , 4-5 December,
Breckenridge, Colorado, USA. To appear as LNAI, Springer Verlag, 1999.
37. Kilkerry Neto, A., Zaverucha, G, Carvalho, L.A. 1999. An Implementation of a Theorem Prover in Symmetric Neural
Networks. Accepted in International Joint Conference on Neural Networks 1999, Washington (July 1999). INNS Press.
38. Lima, P. M. V.,"Towards Neural Computation of REFORMed Programs", Proceedings of the Workshop on High Performance
Logic Programming Systems, European Summer School on Logic, Languages and Implementation (ESSLLI'96), Prague, 1996,
August.
39. Ludermir, T.B. e de Oliveira, W.R (1998) Extractin Rules from Boolean Neural Networks. Nos anais do ICONIP98.
Kitakyushu-Japão, 21-23 de outubro de 1998, pp.1666-69, IOS Press.
40. R Menezes, G. Zaverucha and V. Barbosa, 1998 " A Penalty-Function Approach to Rule extraction from Knowledge-Based
Neural Networks", in Proc. Fifth International Conference on Neural Information Processing (ICONIP’98), Vol. 3 pp. 14971500, 21-23 October, Kitakyushu, Japan.
41. Nehme, C. C., Carvalho, L. A. V., Mendes, S. B. T., “A Temporal Model to Storage Spatiotemporal Pattern Memory in
Shunting Cooperative Competitive Network", in SpatioTemporal Models in Biological and Artificial Systems, Eds. Silva, F L,
Principe J C, Almeida L B, IOS Press, Washington D.C., Jan. 1998
42. Nehme, C., C., Fuks, S. and Meirelles, M. 1988 A GIS Expert System for Land Evaluation. GisPlanet´98, Lisbon, Portugal.
43. Guilherme F. Ribeiro, Victor N. A. L. da Silva, Plutarcho Lourenço, Ricardo Linden – Comparative Studies of Short-Term Load
Forecasting Using Hybrid Neural Networks - International Symposium on Forecasting – Barbados
44. Luiz Sabino Ribeiro Neto, Marley M. Vellasco, Marco Aurélio Pacheco e Ricardo Zebulum, Very Short Term Load Forecasting
System Using Neural Networks, The Seventeenth Annual International Symposium on Forecasting, pp. 51, Barbados, 19-21 de
Junho de 1997.
45. Pessoa, L., Exel, S., Roque, A., & Leitao, A.P. (1998a). Attentive visual recognition for scene exploration. Proceedings of the
International Conference on Neural Information Processing, Kitakyushu, Japan, October 21-23, pp. 1291-1294.
46. Pessoa, L., Exel, S., Roque, A., & Leitao, A.P. (1998b). Active scene exploration vision system. Proceedings of the
International Conference on Neural Networks and Brain, Beijing, China, October, 27-30, pp. 543-546.
47. Pinto de Melo, C. , Santos, F. L., Souza, B.B., Santos, M. S. e Ludermir, T. B. (1998) Polypyrrole Based Aroma Sensor. Nos
anais da International Conference of Synthetic Metals, França, junho de 1998.
48. F. Protti and G. Zaverucha, 1998 "On the Relations Between Acceptable Programs and Stratifiable Classes", Proc. of the 14th
Brazilian Symposium on Artificial Intelligence (SBIA-98), Porto Alegre, Brazil, November, LNAI 1515, Springer Verlag, pp.
141-150.
49. Santos, M. S., Ludermir, T. B., Santos, F.L., Pinto de Melo, C. e Gomes, J.E. (1998) Artificial Nose and Data Analusis using
Multi Layer Perceptron. Nos anais da International Conference of Data Mining. Computational Mechanics Publication.
Southhampton, England. Setembro de 1998.
50. Santos Costa, V. and Bianchini, R. and Dutra, I. C., "Evaluating the Impact of Coherence Protocols on Parallel Logic
Programming Systems", pp. 376--381, 1997, Proceedings of the 5th EUROMICRO Workshop on Parallel and Distributed
Processing, Also available as technical report ES-389/96, COPPE/Systems Engineering, May, 1996.
51. Santos Costa, V. and Bianchini and Dutra, I. C., "Parallel Logic Programming Systems on Scalable Multiprocessors",
Proceedings of the 2nd International Symposium on Parallel Symbolic Computation, PASCO'97, 1997, July, pp. 58--67.
52. Victor N. A. L. Da Silva, Ricardo S. Zebulum 1996. "An Integration of Neural Networks and Fuzzy Logic for Power Systems
Diagnosis " Proceedings from Intelligent Systems Applications to Power Systems, Janeiro 96, Orlando, Florida, USA .
53. Victor N. A. L. Da Silva, Guilherme F. Ribeiro, Celia Regina S. H. Lourenco 1997. "Sistema Hibrido Inteligente para Diagnose
no Sistema Eletrico" Anais do VII Encontro Regional Latino-americano da CIGRE, Maio de 1997, Puerto Iguazu, Argentina.
17
54. M.A. Teixeira, G. Zaverucha, Silva, V. N. and Ribeiro, G.F. 1999. “Evaluation and Comparison of Different Architectures
Using Elman Networks Applied to Electric Load Forecasting”, in the Proc. of the IEEE International Conference on Intelligent
Systems Applications to Power Systems (ISAP’99), 4-8 April, pp. 3-7.
55. M.A. Teixeira, G. Zaverucha, Silva, V. N. and Ribeiro, G.F. 1999. Recurrent Neural Gas. Accepted in International Joint
Conference on Neural Networks 1999, Washington (July 1999). INNS Press.
56. Carlos E. Thomaz, Marco Aurélio C. Pacheco, Marley Vellasco, “Mobile Robot Path Planning Using Genetic Algorithms”,
IWANN99: 5th International Work-Conference on Artificial and Natural Neural Networks, Invited Section: Optimization
Techniques Based on Genetic Algorithms, Alicante, Spain, June 2-4 1999.
57. Rozenthal, M., Engelhardt, E., Carvalho, L.AV., Laks, J., The Neuropsychological Features of Schizophrenia Using Neural
Networks: Correlation with Psychopathology World Regional Congress of Biological Psychiatry, Brazil, October 1998.
58. Valença, M.J.S. e Ludermir, T.B. (1999) Neural networks with genetic algorithm for input selection. Aceito na International
Joint Conference on Neural Networks 1999, Washington (July 1999). INNS Press.
59. Valença, M.J.S. e Ludermir, T.B. (1999) Self-organization sigmoidal blocks networks. Aceito na International Joint Conference
on Neural Networks 1999, Washington (July 1999). INNS Press.
60. Valença, M.J.S. e Ludermir, T.B. (1999) Multiplicative-Addtive Neural Networks with Active Neurons. Aceito na International
Joint Conference on Neural Networks 1999, Washington (July 1999). INNS Press.
61. Ricardo Salem Zebulum, Marley Vellasco, Marco Aurélio Pacheco e Reinaldo C. Souza, NeuroPrev: A Neural-Net Tool for
Electrical Load Forecasting, The Seventeenth Annual International Symposium on Forecasting, pp. 50, Barbados, 19-21 de
Junho de 1997.
62. Ricardo S. Zebulum, M.A.C. Pacheco e Marley M.B.R. Vellasco, Comparison of Different Evolutionary Methodologies
Applied to Electronic Filter Design, International Conference on Evolutionary Computation (ICEC'98), IEEE World Congress
on Computational Intelligence, pp. 434-439, Anchorage, Alaska, 4-9 May1998.
63. Ricardo S. Zebulum, M.A.C. Pacheco e Marley M.B.R. Vellasco, Evolutionary Design of Logic Gates, 5th International
Conference on Artificial Intelligence in Design, Lisboa, Portugal, July 1998.
64. Ricardo S. Zebulum, M.A.C. Pacheco e Marley M.B.R. Vellasco, A Multi-Objective Optimisation Methodology Applied to the
Synthesis of Low-Power Operational Amplifiers, International Conference on Microelectronics and Packaging (ICMP'98) XIII SBMicro, Curitiba, August 1998.
65. Ricardo S. Zebulum, M.A.C. Pacheco e Marley M.B.R. Vellasco, Analog Circuits Evolution in Extrinsic and Intrinsic Modes,
2nd International Conference on Evolvable Systems – From Biology to Hardware (ICES'98), Lausanne, Suiça, September 1998.
66. Ricardo S. Zebulum, M.A.C. Pacheco, Marley M.B.R. Vellasco, Evolutionary Systems Applied to the Synthesis of a CPU
Controller, The Second Asia-Pacific Conference on Simulated Evolution and Learning, (SEAL’98), v. 3, Canberra, Australia,
24-27 November 1998.
Papers in Brazilian Conferences: 34
67. Luiz Biondi Neto, Marco Aurélio C. Pacheco, Marley Maria B.R. Vellasco, Emmanuel P.L. Passos e Luiz Chiganer, Sistema
Híbrido de Apoio à Decisão para Detecção e Diagnóstico de Falhas em Redes Elétricas, Anais do III Simpósio Brasileiro de
Redes Neurais, pp. 197-204, Recife, 12-14 November 1996.
68. Luiz Biondi Neto, Marley M.B. R. Vellasco, Marco Aurélio Pacheco, Emmanuel P. Lopes Passos e Luis Chiganer, Sistema
Híbrido de Apoio à Decisão para Detecção e Diagnóstico de Falhas em Redes Elétricas, XIV SNPTEE - Seminário Nacional
de Produção e Transmissão de Energia Elétrica, Belém, PA, 26 - 30 October 1997.
69. Barros, M.M., Souza, G.M. e Ludermir, T.B. (1998) Extração de Regras em uma Rede Neural Booleana utilizando o modelo
RAM. V Simpósio Brasileiro de Redes Neurais. Belo Horizonte, 10-12 de dezembro de 1998, volume II, p. 121-125, SBC.
70. Camargo, P. R. C., Perlingeiro, C. A. G., Carvalho, L. A. V., “Redes Neuronais no Dimensionamento e na Simulação Estática
de Processos Químicos'', XI Congresso Brasileiro de Engenhria Química, Rio de Janeiro, Setembro 1996.
71. de Oliveira, W. R. (1997). Continuity -algebras. Relations to Computer Science Workshop in Foundations of Computational
Mathematics Conference, IMPA, Rio de Janeiro, 05/01 a 11/01/1997.
72. Ferreira, V.M.G. and França, F.M.G., “Weightless implementations of weighted neural networks”, Anais do IV Simpósio
Brasileiro de Redes Neurais, Goiânia, GO, 53-54, dezembro de 1997.
73. Fogel, L., Zaverucha, G. 1997. Aplicando Invenção de Predicados no Aprendizado de Múltiplos Predicados. I Encontro
Nacional de Inteligencia Artificial (ENIA’97), SBC97, 5-6 de agosto, Brasilia, DF, pp. 25-30.
74. França, F.M.G., “A hybrid natural language understanding system driven by a single neural-based distributed dynamics”, Anais
do III Congresso Brasileiro de Redes Neurais, Florianópolis, SC, 250-255, julho de 1997.
75. França, F.M.G. (Editor), Anais do Workshop em Inteligência Computacional: Projetos ICOM e IPAC, Protem III-CC, CNPq,
Programa de Engenharia de Sistemas e Computação - COPPE/UFRJ, ES-450/97, setembro de 1997.
76. Garcez, A.S., Zaverucha, G, Silva, V.N.A.L.. 1996. Programação Lógica Estendida e Aprendizado Indutivo em Redes Neurais:
Uma Aplicação em Sistemas de Potencia III Simpósio Brasileiro de Redes Neurais, 12-14 Nov, Recife, pp. 205-212.
77. Kilkerry Neto, A., Zaverucha, G, Carvalho, L.A. 1998. Implementação de um Provador de Teoremas em Redes Neurais
Simétricas. Anais do V Simpósio Brasileiro de Redes Neurais (SBRN’98), 9-11 Dezembro, Belo Horizonte, pp. 115-120.
78. Lima Neto, F. B., Santos, M. S. e Ludermir, T. B. (1997) Estudo Comparativo de Desempenho das Topologias de Redes
Neurais: MLP e Cascade-Correlation em Problemas Reais de Classificação. III Congresso Brasileiro de Redes Neurais.
Florianópolis, Julho de 1997, pp. 493-498.
79. Ludermir, T. B., de Oliveira, W. R. e Santos, A Q. (1996) Um Analisador Sintático Neural para o Pascal. III Simpósio
Brasileiro de Redes Neurais. Recife, 12 a 14 de novembro de 1996 pp.302.
80. Ludermir, T. B., de Oliveira, W. R. e Santos, A. Q. (1997) Implementação Neural de um Analisador Sintático. III Congresso
Brasileiro de Redes Neurais. Florianópolis, Julho de 1997, pp. 244-249.
18
81. Ludermir, T. B. e De Oliveira, W.R. (1997) Extração de Regras de Redes Booleanas com Realimentação. IV Simpósio
Brasileiro de Redes Neurais. Goiânia, 3-5 de dezembro de 1997, pp. 112-114.
82. Ludermir, T.B. e De Oliveira, W.R. (1997) Suporte a Decisão para colheita de Cana-de-Açúcar. IV Simpósio Brasileiro de
Redes Neurais. Goiânia, 3-5 de dezembro de 1997, pp. 27-31.
83. Ludermir, T.B. (1998). Extracting rules from feedforward Boolean neural networks. V Simpósio Brasileiro de Redes Neurais.
Belo Horizonte, 10-12 de dezembro de 1998, volume I, p. 61-66, IEEE Computer Society.
84. Medeiros, C. A., Ludermir, T. B. e de Oliveira, W. R. (1997) Avaliação do Aprendizado de Caracteres em Redes Neurais sem
pesos. Nos anais do III Simpósio Brasileiro de Automação Inteligente , Vitória, pp. 388-393.
85. Menezes, R. and Zaverucha, G. 1997. Modificando o Treinamento do Sistema CIL2P para facilitar a extração de Programas em
Lógica. I Encontro Nacional de Inteligencia Artificial, (ENIA’97), SBC97, 5-6 de agosto, Brasilia, DF, pp. 19-24.
86. Nehme C. C., Carvalho, L. A. V., Mendes S. B. T., "The Gamma-Art Architecture Applied on Passive Sonar Classification
Problem", III Encontro de Tecnologia em Acústica Submarina, Instituto de Pesquisas da Marinha do Brasil, Rio de Janeiro, Set.
1997.
87. Nobre, C.N., Braga, A.P, Carvalho, A.P.L. e Ludermir, T.B. (1998). Extração de Conhecimento: Uma comparação entre os
métodos clássico e conexionista. V Simpósio Brasileiro de Redes Neurais. Belo Horizonte, 10-12 de dezembro de 1998, volume
II, p. 126-131, SBC.
88. Rozenthal, M., Carvalho, L.AV., Engelhardt E., Laks, J., Schizophrenia: Correlation between neuropsychology and
symptomatology with neural networks, Brazilian Psychiatry Conference, Brazil, October 1997.
89. Luiz Sabino Neto, Marley M.B.R. Vellasco e Marco A.C. Pacheco, Previsão de Carga em Curtíssimo Prazo Utilizando Redes
Neurais, VI Symposium of Specialists in Electric Operational and Expansion Planning (SEPOPE 98), pp. CDROM, Salvador,
BA, May 1998.
90. Luiz Sabino Neto, Marco A.C. Pacheco, Marley M.B.R. Vellasco e Karla Figueiredo, Controle de Cheias por Algoritmos
Genéticos (Overflow Control of Electric Generator Reservoir through Genetic Algorithms), Simpósio Internacional sobre
Gestão de Recursos Hídricos, Gramado, RS, October 1998.
91. Santos, M.S. e Ludermir, T.B. (1997) Construção de um Nariz Artificial Usando Redes Neurais. Iv Simpósio Brasileiro de
Redes Neurais. Goiânia, 3-5 de dezembro de 1997, pp. 71-73.
92. Santos, M.S., Ludermir, T.B., Santos, F.L., Pinto de Melo, C. e Gomes, J.E. (1998). Usando um Nariz Artificial para
Reconhecer Safras de vinho. V Simpósio Brasileiro de Redes Neurais. Belo Horizonte, 10-12 de dezembro de 1998, volume II,
p. 253-258, SBC.
93. Santos, F.L., Melo, C.P., Souza, J.E.G., Santos, M.S. e Ludermir, T.B. (1998) Nariz Eletrônico à base de Polopirrol Condutor,
Anais do XVI Encontro de Físiscos do Norte e Nordeste, 18 a 24 de outubro de 1998, Sociedade Brasileira de Física.
94. Victor N. A. L. da Silva, Guilherme F. Ribeiro, Celia R. S. H. Lourenço 1988. Application of Artificial Intelligence in Eletric
Power Systems Diagnosis - VI SEPOPE – Salvador – Bahia – Junho de 1998.
95. Souza, J.E.G., Neto B.B., Santos, F.L., Melo, C.P., Santos, M.S. e Ludermir, T.B. (1998) Desenvolvimento de sensores de
aroma baseados em filmes de polipirrol. II Simpósio Nacional de Instrumentação Agropecuária, 2 a 4 de dezembro de 1998.
96. Teixeira, M.A., Zaverucha, G, Silva, V.N., Ribeiro, G.F. 1998. Previsão de Cargas Elétricas Usando Redes de Elman. Anais do
V Simpósio Brasileiro de Redes Neurais (SBRN’98), 9-11 Dezembro, Belo Horizonte, pp. 161-164.
97. Valença, M.J.S. e Ludermir, T.B. (1998). Self-Organizing Modeling in Forecasting Daily River Flows. V Simpósio Brasileiro
de Redes Neurais. Belo Horizonte, 10-12 de dezembro de 1998, volume I, p. 210-214, IEEE Computer Society.
98. Valença, M.J.S. e Ludermir, T.B. (1998). Previsão de Demanda Máxima Mensal Utilizando um Modelo Auto-Organizável. V
Simpósio Brasileiro de Redes Neurais. Belo Horizonte, 10-12 de dezembro de 1998, volume II, p. 311-314, SBC.
99. Zaverucha, G. 1997. Extensões do CIL2P - Um Sistema Neural Para Programação em Lógica e Aprendizado Indutivo. I
Encontro Nacional de Inteligencia Artificial(ENIA 97), Congresso SBC, 5-6 de agosto, Brasilia, DF, pp. 10-18.
100. Ricardo S. Zebulum, Marco A.C. Pacheco e Marley M.B.R. Vellasco, Synthesis of CMOS Operational Amplifiers Through
Genetic Algorithms, XI Brazilian Symposium on Integrated Circuit (SBCCI’98), pp. 125-128, Buzios, Rio de Janeiro, 30
September to 3 October 1998.
19
Download