The Impact of Aristotelian Thinking on Information Retrieval Steve Halladay ABSTRACT Many consider Aristotle, along with Plato, to be the greatest western philosophers. Aristotelian thinking has had far reaching effects that influence today’s science and technology. This paper considers these influences and describes how Aristotelian thinking has enabled the creation of technologies including those used for Information Retrieval. At the same time, the broad application of Aristotelian thinking is also fundamentally limiting the technology’s capabilities. Exploring the effects of these philosophical forces on Information Technology may help researchers to create even more powerful systems. INTRODUCTION Aristotle was one of history’s foremost philosophers who left a legacy that continues to affect each of our daily lives. His appreciation for the role of empirical observation sets him apart as one of the framers of today’s scientific method. Although he lived over 2300 years ago, we still see the effects of his work in our modern sciences. One example of an Aristotelian contribution is in the area of ontological taxonomies for knowledge representation. Aristotle’s notion was that he could classify everything within a taxonomical hierarchy. The implications of this classification are twofold; first, there is a relationship between all things that can be described by a hierarchy, and second, once similar things are grouped into a class, they may be treated in a similar way. The latter point is illustrated in Aristotle’s famous syllogism: Socrates is a man. All men are mortal. Therefore, Socrates is mortal. In spite of Aristotle’s wonderful intellectual contributions, some of Aristotle’s thinking led him to inaccurate conclusions. For example, while Aristotle was aware that the earth was round, a concept that was generally ahead of it’s time; he was also under the misconception that the earth was the center of the universe. Another well known fallacy that captured Aristotle was the belief that objects of differing mass fall at different rates. This latter misconception is rather surprising given Aristotle’s penchant for observation. The point of this discussion is that while Aristotle made some amazing intellectual contributions, he was human and fallible. Therefore, we should think critically about the path we are on; a path initiated by Aristotle. We should appreciate the strengths and weaknesses of our thought processes as well as the useful and hindering applications of these processes. EFFECTS OF SIMPLIFICATION The result of Aristotle’s classification was the use of simplification or abstraction to help people deal with otherwise unwieldy systems. For example, all animals of the class cow can be milked, all animals of the class bird can fly, and all animals of the class fish can swim. Therefore, when someone describes an animal as a cow, bird or fish, we can have certain understandings of the animal’s capabilities and expectations about how we might interact with the animal. Initially classification mechanisms simplified thought and communications. This simplification occurs because classification alleviates the need to describe all objects’ attributes. If an object is identified with a class, then, by definition, the object has the attributes associated with that class. The use of classification concepts yielded some very concrete results during the industrial revolution. The advent of mass production was an extension of the concept of classification. Products were manufactured from classes of objects with attributes so similar, the objects became interchangeable. For example, Henry Ford mass produced cars in such a way that any Model A engine would fit into any Model A chassis. In fact, the more similar the objects of a class, the better the manufacturing process works. The idea of creating manufacturing objects with high degrees of similarity eventually led to the notion of specification. A specification is really a detailed description of the objects that are considered to be in the same classification or set. All objects that “meet specification” are considered to be valid members of the set and are therefore perfectly acceptable to the manufacturing process. The concept of specification has a related supporting concept called tolerance. Tolerance refers to the amount of acceptable variation of attributes within the set. As tolerance becomes “tight” the degree of acceptable variation diminishes. Tight tolerance decreases the degree of variability within the manufacturing process and can reduce the costs of the manufacturing process. However, creating objects with tight tolerance may increase the costs of producing the object. Many of the manufacturing enhancement processes, as introduced by Deming for example, focus on tightening tolerance to reduce variability within the manufacturing process. The benefits of classification are not limited to the industrial revolution. The computer industry employs the concepts of classification, specification and tolerance to create intellectual objects that are similar, or have the same expected behavior. In the domain of computer science these objects, embodied as software, may appear very different but have a behavior that meets a specification. Software designs that employ these concepts are robust because they can be extended by creating additional objects within the set of the specification which may have increased capability. For example, an object that performs a computationally intensive calculation may be replaced by an object of the same class that allows the calculation to be performed on a remote, higher speed processor. The significant point of this section is that classification helps us to simplify our world conceptually which enables significant technological advances. EFFECTS OF CLASSIFICATION Albert Einstein is quoted a saying “Things should be made as simple as possible – but no simpler.” Sometimes there is too much of a good thing. This may be the case with respect to the simplifying effects of classification. It is understandable to think that since the notions of classification have served us so well; the answer to all our problems is to increase the use of classification. This is specifically what many conventional information retrieval systems do. Many conventional information retrieval systems start by creating a hierarchical classification of concepts of interest. Next, these systems identify which words indicate the membership in specific sets within the hierarchy. By using these words, the retrieval systems assign documents to different sets within the hierarch. Requests for information are handled in a similar manner by using the words in the request to determine the set of interest within the hierarchy. Once the system identifies the set of interest, retrieval is a simple matter of returning all documents within the set. However, information retrieval systems have been plagued for many years with precision/recall tradeoff problems. It seems that these systems can either retrieve all the relevant documents with along with many non-relevant documents (i.e., high recall, low precision), or these systems can retrieve a few, but not all, documents that are mostly relevant (i.e., high precision, low recall). But these systems have not yet been able to retrieve only the relevant documents. For those steeped in Aristotelian thought patterns, the natural solution seems to be to create more classification sets, more levels in the hierarchy and identify more attributes to distinguish between documents. This approach is bound to yield some pay-offs, but the creation of the sets is laborious and time consuming. As a result, trying to solve this problem using augmented classification will eventually yield diminishing returns. An alternative perspective to consider is that classification is part of the problem instead of the solution. As the information revolution matures, the simplifying Aristotelian mechanisms may actually be masking many of the intricate relationships necessary for higher quality retrieval. In order to support the theory that classification mechanisms are actually inhibiting information retrieval, we must explain how this is possible. First, one might consider the effects of set boundaries. Those of us with a mathematical and scientific background are so familiar with sets that we accept sets as a fact of life. We view the world in such a way that objects are either members of a set or they are not. But closer examination can challenge our perspective. For example, consider the set of all chairs. Chairs are in the set and tables are not. What is a chair? Chairs have four legs, a seat and a back. What about objects with four legs and a seat but no back? We refer to these as stools. Are stools in the class of chairs? Similarly, what about objects with three legs a seat and a back? Are these objects still chairs? Perhaps an even more difficult judgment might be to consider a rock in a forest upon which one chooses to sit. Is such an object considered a chair? And so it is that as we challenge various combinations of attributes that define the set, we find that it is not always clear where the set boundaries lie. If determining the classification for a concrete object like a chair is ambiguous, one can only imagine how much more ambiguous the classification of non-concrete objects, like text, might be. Another concern with regards to classifying text is determining which part of the text to classify. For example, a paper such as this might have several sections. Each section discusses a different topic. Each section consists of multiple paragraphs that, in turn have unique topics and differing levels of conceptual abstraction. Carried further, each paragraph has several sentences and each sentence may have multiple phases, each expressing a different concept. Now, in order to correctly classify the document, one must decide if the document should be represented as a single point within a classification set or sets, or as multiple points within multiple sets. Either choice will have detrimental affects of both precision and recall. Finally consider the classical taxonomy which consists of a hierarchy. Hierarchies are essentially sets of sets. Humans have an affinity to taxonomies because taxonomies seem to have a specific place for everything. To find an object within a hierarchy, one only need start at the root and make successive branching choices until one arrives at the desired object location. The problem with hierarchies is that they are only as helpful as one weakest choice. When moving from ht root of the hierarch to its leaves, it only requires one poor judgment to get on the wrong branch and not find the desired object. For this exact reason, hierarchies do not scale well. In order to increase the capacity of a hierarchy, one must increase the breadth or depth of the hierarchy. Increasing the breadth makes each decision more difficult and, therefore, more errorprone. Likewise, increasing a hierarchy’s depth will increase the number of decisions that must be made in order to find the desired object and thereby increase the opportunity for error. Viewed in this light, the application of hierarchies in information retrieval is like creating a shell game of shell games. CONCLUSIONS Considering the effects of simplification as well as the affects of classification, one is left wondering if the information retrieval industry is doomed to languish in the doldrums somewhere between precision and recall. This assessment is correct only if information retrieval researchers choose to continue to think the same way they have to this point. Aristotle was a great thinker. Aristotle had some thoughts that were very helpful and some that were not. But perhaps the greatest aspect of Aristotle’s thinking lies not in the resulting thoughts themselves, but more importantly in the thinking process he employed. Aristotle had a knack for critical thinking. He observed phenomena and then constructed conceptual models that were consistent with his observations. These models had an honest conceptual integrity that was often independent of social pressures of preconditioning. If information researchers are to move forward, perhaps the most fertile results will occur not by thinking Aristotle’s thoughts, but by thinking more like Aristotle.