SYLLABUS 1. OOPS PARADIGM Programming language, Object-Oriented Programming, Object-Oriented Languages, Object-based programming languages, Object-oriented programming languages. 2. INTRODUCTION TO C++ Basic concept of oops-Objects, Classes, Encapsulation, Data Abstraction, Inheritance, Polymorphism, Dynamic Binding, Message Passing, Brief History of C++ 3. Data Types & Variables Structure of a C++ program., Comments, Variables,Identifiers,Data types. Declaration of variables, Initialization of variables, Scope of variables, Constants 4. Operator and control structures Types of Operators. Priority of Operators. Control 5. Array and pointer Arrays. Initializing arrays, Strings, Pointers. Pointers and arrays ,Dynamic Memory. 6. structures and union Structures.,User defined data types. 7. Functions Functions . Default values in arguments 8. classes and objects Introduction to class, Class Definition, Classes and Objects, Access specifiers – Private, Public and Protected. Member functions of the class. 9. Constructor and destructor Constructors, Overloading Constructors, Destructor 10. Function Overloading Function overloading, Precautions to be taken while overloading functions. Static Class Members, Static Member Functions, Friend Functions 11. Operator Overloading Introduction to Operator Overloading. ,Operator Overloading Fundamentals. Implementing the operator functions. 12. Inheritance Reusability.,Inheritance concept-singleinheritance.Using the derived class,Constructor and destructor in derived class, Object initialization and conversion., Nested classes (Container classes).\,Multilevel inheritance., Multiple inheritance., Hybrid Inheritance. Virtual base class. 13. Abstract and virtual function Abstract class, Virtual function. Pure virtual function 14. Templates and exception handling Templates. Exception handling, Advanced 15. File Input Output Input/Output with files. Open a file, closing a file REFERENCES: 1 1. Herbert Schildt, “C++ the Complete Reference “, III edition, TMH 1999 2. Balagurusamy, Entrepreneurial “Object Oriented programming with C++”, TMH 3. Barkakatin “objects oriented programming in C++” PHI 1995 2 SYALLABUS Introduction to OOPS,Data types and variables, Operators and control structures,Array and pointer,Structures and union,Functions,Classes and Objects Constructor and Destructor Functions,Function Overloading,Operator Overloading,Inheritance,Abstract and virtual function,Templates and exception handling,File handling 3 TABLE OF CONTENTS Unit1 Introduction to OOPS Programming language Object-oriented programming paradigm Object-Oriented Programming Object-Oriented Languages Object-based programming languages Object-oriented programming languages. Basic concept of oops Objects Classes Encapsulation Data Abstraction Inheritance Polymorphism Dynamic Binding Message Passing Fits of OOP Application of OOPS Brief History of C++ Unit 2-Data Types & Variables 2.1 Structure of a C++ program. 2.2 Comments 2.3 Variables 2.4 Identifiers 2.5 Data types. 2.6 Declaration of variables 2.7 Initialization of variables 2.8 Scope of variables 2.9 Constants Unit3-Operator and control structures 3.1 Types of Operators. 3.2 Priority of Operators. 3.3 Communication through console. 3.3.1 Output 3.3.2 input 3.4 Control Structures. 3.4.1 Conditional structure 3.4.2 Repetitive structures or loops 3.4.3 Bifurcation of control and jumps 3.4.4 The selective Structure: switch Unit4-Array and pointer 4.1 Arrays. 4 4.1.1 Initializing arrays 4.1.2 Access to the values of an Array 4.1.3 Multidimensional Arrays 4.1.4 Arrays as parameters 4.2 Strings 4.2.1 1Initialization of strings 4.2.2 Assigning values to strings 4.2.3 Converting strings to other types 4.2.4 Functions to manipulate strings 4.3 Pointers. 4.4 Pointers and arrays 4.5 Dynamic Memory. Unit 5-structures and union 5.1 Structures. 5.1.1 Poniters to structures 5.1.2 Nesting structures 5.2 User defined data types. 5.2.1 Typedef 5.2.2 Union 5.2.3 Num Unit-6-Functions 6.1 Functions . 6.2 Default values in arguments 6.3 Void Functions 6.4 Call by value and reference 6.5 Passing Reference to Functions. 6.6 Returning References from Functions 6.7 Inline function 6.8 Recursive function 6.9 Prototyping function Unit7-classes and objects 7.1 Introduction to class. 7.2 Class Definition 7.3 Classes and Objects 7.4 Access specifiers – Private, Public and Protected. 7.5 Member functions of the class. 7.6 Passing and returning objects. 7.7 Pointers to objects. 7.8 Array of objects. 7.9 The special ‘this’ pointer 7.10 self test Unit8 : Constructor and destructor 8.1 Constructors 8.1.1 Syntax rules for writing constructor functions 8.1.2 Different ways of calling contructor 8.2 Overloading Constructors 8.3 Destructor 8.4 Self test 5 Unit9- Function Overloading 9.1 Function overloading 9.2 Precautions to be taken while overloading functions. 9.3 Static Class Members 9.4 Static Member Functions 9.5 Friend Functions 9.6 Friend for Overloading Operators 9.7 Granting friendship to another class 9.8 Granting friendship to a member function of another class Unit10-. Operator Overloading 10.1 Introduction to Operator Overloading. 10.2 Operator Overloading Fundamentals. 10.3 Implementing the operator functions. 10.4 Rules for overloading the operators. 10.5 Pointer oddities (assignment) and Operator Overloading. 10.6 Overloading the Extraction and Insertion Operators 10.7 Conversion functions. 10.7.1 Conversion from basic to user-defined variable. 10.7.2 Conversion from User-Defined to Basic data type 10.7.3 Conversion Between Objects of Different Classes 10.7.4 Conversion function in the Destination Class 10.8 Table for Type Conversions 10.9 Self Test Unit 11. Inheritance 11.1 Reusability. 11.2 Inheritance concept- single inheritance. 11.2.1 Private derivation 11.2.2 Public derivation 11.2.3 The Protected Access 11.2.4 Summary of derivation 11.2.5 Table of derivation and access specifiers 11.3 Using the derived class 11.4 Constructor and destructor in derived class. 11.5 Object initialization and conversion. 11.6 Nested classes (Container classes). 11.7 Multilevel inheritance. 11.8 Multiple inheritance. 11.9 Hybrid Inheritance. 11.10 Virtual base class. Unit 12- Abstract and virtual function 12.1 Abstract class. 12.2 Virtual function. 12.3 Pure virtual function 12.4 self test Unit-13 Templates and exception handling 13.1 Templates. 13.1.1 Function template 13.1.2 Class templates 13.1.3 Template specialization 6 13.1.4 Parameter values for templates 13.1.5 Templates and multiple -file project 13.2 Exception handling 13.2.1 Exception not caught 13.2.2 Standard exception 13.3 Advanced class type-casting. 13.3.1 Reinterpret cast 13.3.2 Static cast 13.3.3 Dynamic cast 13.3.4 Tonst_cast 13.3.5 Typeid 13.4 Preprocessor directives. Unit 14- File Input Output 14.1 Input/Output with files. 14.2 Open a file 14.3 Closing a file 14.4 Methods of Input and Output Classes 14.5 Text mode files 14.6 State flags 14.7 Binary files 14.8 Buffers and Synchronization 14.9 I/O Manipulation 7 UNIT 1 INTRODUCTION TO OOPS Contents 1.1 Programming language 1.2 Object-oriented programming paradigm 1.2.1 Object-Oriented Programming 1.2.2 Object-Oriented Languages 1.2.3 Object-based programming languages 1.2.4 Object-oriented programming languages. 1.3 Basic concept of oops 1.3.1 Objects 1.3.2 Classes 1.3.3 Encapsulation 1.3.4 Data Abstraction 1.3.5 Inheritance 1.3.6 Polymorphism 1.3.7 Dynamic Binding 1.3.8 Message Passing 1.4 Benefits of OOP 1.5 Application of OOPS 1.6 Brief History of C++ 1.1 Programming Languages So, what exactly is a programming language? As a loose definition, a programming language is a tool used by a programmer to give the computer very specific instructions in order to serve some purpose for the user. A program is like a recipe. It outlines exactly the steps needed to create something or perform a certain task. Computers do exactly what they are told, no more, no less. When writing a program, a programmer must outline every possible step and scenario that could occur. The first programming languages that emerged, were assembly languages. These languages are exactly the instruction set of a specific processor. These languages are very low-level and hard to understand. For example, say we wanted to add two numbers, 3 and 4 and get a result: in C++: in assembly: Int a = 3 + 4; ldl 3, R1 ldl 4, R2 addl R1, R2, R3 8 The version in C++ is easier to understand and simpler to write. Programmers write their code in a high level language and then use a compiler to translate their code into an assembly language and then into a machine language that will run on the machine they are using. Programs consist of algorithms. An algorithm is just a well-outlined method for completing a task. • ask the user for the first number • ask the user for the second number • add the two numbers • display the result on the screen This high-level abstraction is not actual code. However, it does express the ideas of a program, and is called pseudo-code. Often, programmers will design their programs in pseudo-code, and then use this to write their actual code. So, why is there more than one programming language? It may seem that a standard language should be agreed on, since all languages are translated using a compiler anyways. However, languages are often designed with a specific use in mind, and some are better than others for dealing with certain problems. So if a programmer is capable of writing a compiler (which is a very complex piece of software) then they can design and create a language. The most important thing to remember about programming languages is that they are only an abstraction! Programming languages were created so developers could express their ideas on a higher level than a computer can understand. Once a user has a good concept of how computers work, and has learned a few computer languages, it becomes much easier to pick up new languages. A programming language is a tool used by programmers in order to specifically outline a series of steps that a computer is to take in a certain instance. High-level programming languages allow a programmer to express ideas on an abstract level, and forces the compiler to worry about the lowlevel implementation details. This allows for faster development of applications, since applications are easier to write. There are even fourth generation languages emerging as viable programming languages. Recall that machine code is considered first generation, assembly languages are second generation, compiled languages are third generation. Fourth generation languages are actually code-generating environments, such as Microsoft's Visual Basic. These fourth generation languages allow programmers to express their ideas visually, and the environment then writes the code to implement these ideas. 1.2 OBJECT-ORIENED PROGRAMMING PARADIGNM The major motivating factor in the invention of object-oriented approach is to remove some of the flaws encountered in the procedural approach. OOP treats data as a critical element in the program development and does not allow it to flow freely around the system. It ties data more closely to the functions that operate on it, and protects it from accidental modification from outside functions. OOP allows decomposition of a problem into a number of entities called object and then builds data and functions around these objects. The organization of data and functions in object-oriented programs is shown in fig. The data of an object can access the functions of other objects. 9 Object A Object B Data Data Functions Functions Object C Functions Data Some of the striking features of object-oriented programming are: • Emphasis is on data rather than procedure. • Programs are divided into what are known as objects • Data structures are designed such that they characterize the objects. • Functions that operate on the data of an object are tied together in the data structure. • Data is hidden and cannot be accessed by external functions. • Objects may communicate with each other through functions. • New data and functions can be easily added whenever necessary. • Follows bottom-up approach in program design. Object-oriented programming is the most recent concept among programming paradigms and still means different things to different people. It is therefore important to have a working definition of object-oriented programming before we proceed further. We define “object-oriented programming as an approach that provides a way of modularizing programs by creating partitioned memory area for both data and functions that can be used as template for creating copies of such modules on demand.” Thus, an object is considered to be a partitioned area of computer memory that stores data and set of operations that can access that data. Since the memory partitions are independent, the objects can be used in a variety of different programs without modifications. 1.2.1 Object-Oriented Programming Since object –oriented programming (OOP) drove the creation of ++, it is necessary to understand its foundational principles. OOP is a powerful way to approach the job of programming. 10 Programming methodologies have changed grammatically since the invention of the computer, primarily to accommodate the increasing complexity of programs. For example, when computers were first invented, programming was done by toggling in the binary machine instructions using the computer’s front panel. As long as programs grew, assembly language was invented so that a programmer could deal with larger, increasingly complex programs, using symbolic representations of the machine instructions. As program continued to grow, high-level languages were introduced that gave the programmer more tools with which to handle complexity. The first widespread language was, of course, FORTRN. Although FORTRON was a very impressive first step, it is hardly a language that encourages clear, easy-to-understand programs. The 1960s gave birth to structured programming. This is the method encouraged by languages such as and Pascal. The use of structured languages made it possible to write moderately complex programs fairly easily. Structured languages are characterized by their support for stand-alone subroutines, local variables, rich control constructs, and their lack of reliance upon the GOTO. Although structured languages are a powerful tool, even they reach their limit when a project becomes too large. Consider this: At each milestone in the development of programming, techniques and tools were created to allow the programmer to deal with increasingly greater complexity. Each step of the way, the new approach took the best elements of the previous methods and moved forward. Prior to the invention of OOP, many projects were nearing (or exceeding) the point where the structured approach no longer worked. Object-oriente4d methods were created to help programmers break through these barriers. Object-oriented programming took the best ideas of structured programming and combined them with several new concepts. The result was a different way of organizing a program. In the most general sense, a program can be organized in of two ways: around its code (what is happening) or around its data (who is being affected). Using only structured programming techniques, programs are typically organized around code. This approach can be thought of as “code acting on data”. For example, a program written in a structured language such as is defined by its functions, any of which may operate on any type of data used by the program. Object-oriented programs work the other way around. They are organized around data, with the key principle being “data controlling access to code.” In an object-oriented language, you define the data and the routines that are permitted to act on that data. Thus, a data type defines precisely what sort of operations can be applied to that data. To support the principles of object-oriented programming, all OOP languages have three traits in common: encapsulation, polymorphism, and inheritance. 1.2.2 Object-Oriented Languages Object-oriented programming is not the right way of any particular language. Like structured programming, OOP concepts can be implemented using languages such as C and Pascal . However, programming becomes clumsy and may generate confusion when the programs grow large. A language that is specially designed to support the OOP concepts makes it easier to implement them. 11 The languages should support several of the OOP concepts to claim that they are object-oriented. Depending upon the features they support, they can be classified into the following two categories: 1.2.3 Object-based programming languages, Object-based programming is the style of programming that primarily supports encapsulation and object identity. Major features that are required for object-based programming are: • Data encapsulation • Data hiding and access mechanism • Automatic initialization and clear-up of objects • Operator overloading Languages that support programming with objects are said to be object-based programming languages. They do not support inheritance and dynamic binding. Ad is a typical object-based programming language. 1.2.4 Object-oriented programming languages. Object-oriented programming incorporates all of object-based programming features along with two additional features, namely, inheritance and dynamic binding. Object-oriented programming can therefore be characterized by the following statement: Object based features + inheritance + dynamic binding Languages that support these features include C++, Smalltalk, Object Pascal and Java . There are a large number of object-oriented programming languages. Table lists some popular general purpose OOP languages and their characteristics. Characteristi cs Simul a * Smalltal k * Objectiv e c C++ Ada ** Object Pascal Turbo Pasca l Ffecl * Java * Binding (early or late) Both Late Both Both Arly Late Early Earl y Both Poor Poor Poor Diffi cult No No Pro mise d ----- ----- Polymorphis m Data Hiding Concurrency Inheritance Multiple Inheritance No No No Garbage Collection No 12 No Persistence No Promise d No Genericity NO No No NO Object Libraies Like 3 GL No No No No Som e supp ort No Not muc h As seen from Table, all languages provide for polymorphism and data hiding. However , many of them do not provide facilities for concurrency, persistence and generosity. Eiffel, Ad and C++ provide generic facility which is an important construct for supporting reuse. However, persistence(a process of storing objects) is not full supported by any of them. In Smalltalk, though the entire current execution state can be saved to disk, yet the individual objects cannot be saved to an external file. Commercially, C++ is only 10 years old, Smalltalk and Objective C13 years old, and Java only 5 years old. Although Similar has existed for more than two decades, it has spent most of its life I n a re search environment. The field is so new, however ,that it should not be judged too harshly. Use of a particular language depends on characteristics and requirements of an application, organizational impact of the choice, and reuse of the existing programs. C++ has now become the most successful , practical, general purpose OOP language, and is widely used in industry today. 1.3 Basic concept of Object-oriented programming It is necessary to understand some of the concepts used extensively in object-oriented programming. These include: 1.3.1. Objects: Objects are the basic run-time entities in an object-oriented system. They may represent a person, a place, a bank account, a table of data or any item that the program has to handle. They may also represent user-defined data such as vectors, time and lists. Programming problem is analyzed in term of objects and the nature of communication between them. Program objects should be chosen such that they match closely with real-world objects. Objects take up space in memory and have an associated address like a record in Pascal, or a structure in . When a program is executed, the objects interact by sending messages to one another. Foe example, if” customer” and “account” are two objects in a program, then the customer object may send a message to the account object requesting for the bank balance. Ach object contains data and code to manipulate the data. Objects can interact without having to know details of each other’s data or code. It is sufficient to know the type of message accepted, and the type of response return by the objects. Although different authors represent them differently, fig. below shows two notations that are popularly used in object-oriented analysis and design. 13 STUDENT Object:STUDEN T Total DATA Name Date-of-birth Marks Average Display FUNCTIONS Total Average Display 1.3.2. Classes: We just mentioned that objects contained that objects contain data, and code to manipulate that data. The entire set of data and code of an object can be made a user-defined data type with the help of a class. In fact, objects are variables of that class type. Once a class has been defined , we can create any number of objects belonging to that class. Each object is associated with the data of type class with which they are created. A class is thus a collection of objects of similar type. Foe example, mango, apple and orange are member of the class fruit. Classes are user-defined data types and behave like the built-in types of a programming language. The syntax is used to create an object is no different than the syntax used to create an integer object in C. If fruit has been defined as a class, then the statement: Fruit mango; Will create an object mango belonging to the class fruit. 1.3.3 Encapsulation: The wrapping of data and functions into a single unit Data encapsulation is the most striking feature of a outside world, and those functions, which are wrapped provide the interface between the object’s data and the direct access by the program is called data hiding . (called class) is known as encapsulation. class. The data is not accessible to the in the class, can access it. These function program. This insulation of the data from 1.3.4 Data Abstraction: Abstraction refers to the act of representing essential features with out including the background details or explanations. Classes use the concept of abstraction and are defined as a list of abstract attributes such as size, weight and cost and functions to operate on these attributes. Sometimes, these are called data members because they hold information. The functions that operate on these data are called methods or member functions. 1.3.5 Inheritance 14 Inheritance is the process by which objects of one class acquire the properties of objects of another class. It supports the concept of hierarchical classification. For example, the bird ‘robin’ is a part of the class ‘flying bird’ which is again a part of the class ‘bird’. The principle behind this sort of division is that each derived class shares common characteristics with the class from which it is derived as illustrated in figure. In OOP, the concept of inheritance provides the idea of reusability. This means that we can add additional features to an existing class without modifying it. This is possible by deriving a new class from the existing one. The new class will have the combined features of both of the classes. The real appeal and the power of the inheritance mechanism is that it a allows the programmer to reuse a class that is almost, but not exactly, what he wants, and to tailor the class in such a way that it does not introduce any Bird Attributes Feathers Lay eggs Flying Bird Non-flying Bird Attributes ………… ………… Attributes ……………… ……………….. Robin Attributes ………… ………… ………… Swallow Attributes ………… ………… . Penguin Attributes ………. ………. Kiwi Attributes ……….. ………… Undesirable side-effects into the rest of the classes. Note that each sub-class defines only those features that are unique to it. Without the use of classification, each class would have to explicitly include all its features. 1.3.6 Polymorphism: Polymorphism is another important OOP concept. Polymorphism, a Greek term, means the ability to take more than one form. An operation may exhibit different behaviors in different instances. The behavior depends upon the type of data used in the operation. For example, consider the 15 operation of addition. For two numbers, the operation will generate a sum. If the operands are strings, then the operation would produce a third string by concatenation. The process of making an operator to exhibit different behaviors in different instances is known as operator overloading. Figure below illustrates that a single function name can be used to handle different number and different types of arguments. This is something similar to a particular word having several different meanings depending on the context. Using a single function name to perform different types of tasks is known as function overloading. Shape Draw () Circle object Box object Triangle object Draw(circle) Draw(box) Draw(triangle) Polymorphism plays an in important role in allowing objects having different internal structures to share the same external interface. This means that a general class of operations may be accessed in the same manner even though specific actions associated with each operation may differ. Polymorphism is extensively used in implementing inheritance. 1.3.7 Dynamic Binding: Binding refers to the linking of a procedure call to the code to be executed in response to the call. Dynamic binding (also known as late binding) means that the code associated with a given procedure call is not known until the time of the call at run-time. It is associated with polymorphism and inheritance. A function call associated with a polymorphism reference depends on the dynamic type of that reference. Consider the procedure “draw” in the above figure. By inheritance, every object will have this procedure. Its algorithm is, however, unique to each object and so the draw procedure will be refined in each class that defines the object. At run-time, the code matching the object under current reference will be called. 1.3.8 Message Passing: An object-oriented program consists of a set of objects that communicate with each other. The process of programming in an object-oriented language, therefore, involves the following basic steps: 1. Creating classes that define objects and their behavior, 16 2. Creating objects from class definitions, and 3. Establishing communication among objects. Objects communicate with one another by sending and receiving information much the same way as people pass message to one another. The concept of message passing makes it easier to talk about building systems that directly model or simulate their real-world counterparts. A message for an object as a request for execution of a procedure, and therefore will invoke a function (procedure) in the receiving object that generates the desired result. Message passing involves specifying the name of the object, the name of the function(message) and the information to be sent. Example: Employee. salary(name); object information message Objects have a life cycle. They can be created and destroyed. Communication with an object is feasible as long as it is alive. 1.4 Benefits of OOP OOP offers several benefits to both the program designer and the user. Object-orientation contributes to the solution of many problems associated with the development and quality of software products. A new technology promises greater programmer productivity, better quality of software and lesser maintenance cost. The principal advantages are: • Through inheritance, we can eliminate redundant code and extend the use of existing • Classes. • We can build programs from the standard working modules that communicate with one another, rather than having to start writing the code from scratch. This leads to saving of development time and higher productivity. • The principle of data hiding helps the programmer to build secure programs that cannot be invaded by code in other parts of the program. • It is possible to have multiple instances of an object to co-exist without any interference. • It is possible to map objects I the problem domain to those in the program. • It is easy to partition the work in a project based on objects. • The data-centered design approach enables us to capture more details of a model in implemental form • Object-oriented systems can be easily upgraded from small to large systems. 17 • Message passing techniques for communication between objects makes the interface descriptions with external systems much simpler. • Software complexity can be easily managed. While it is possible to incorporate all these features in an object-oriented system, their importance depends on the type of the project and the preference of the programmer. There are a number of issues that need to be tackled to reap some of the benefits stated above. For instance, object 18 libraries must be available for reuse. The technology is still developing and current products may be superseded quickly. Strict controls and protocols need to be developed if reuse is not to be compromised. Developing software that is easy to use makes it hard to build. It is hoped that the object-oriented programming tools would help manage this problem. 1.5 Application of OOPS OOP has become one of the programming buzzwords today. There appears to be a great deal of excitement and interest among software engineers in using OOPS . Applications of OOP are beginning to gain importance in many areas. The most popular application of object –oriented programming, up to now, has been in the area of user interface design such as windows. Hundreds of windowing systems have been developed, using the OOP techniques. Real –business systems are often much more complex and contain many more objects with complicated attributes and methods. OOP is useful in these types of applications because can simplify a complex problem. The promising areas for application of OOP includes. • Real-time systems • Simulation and modeling • Object-oriented databases • Hypertext, hypermedia and expert text • AI and expert systems • Neural networks and parallel programming • Decision support and office automation systems • CIM/CAM/CAD systems The object-oriented paradigm sprang from the language, has matured into design, and has recently moved into analysis. It is believed that the richness of OOP environment will enable the software industry to improve not only the quality of software systems but also its productivity. Object-oriented technology is certainly going to change the way the software engineers think, analyze, design and implement future systems. 1.6 Brief History of C++ The C++ Programming Language is basically an extension of the C Programming Language. The C Programming language was developed from 1969-1973 at Bell labs, at the same time the UNIX operating system was being developed there. C was a direct descendant of the language B, which was developed by Ken Thompson as a systems programming language for the fledgling UNIX operating system. B, in turn, descended from the language BCPL which was designed in the 1960s by Martin Richards while at MIT. In 1971 Dennis Ritchie at Bell Labs extended the B language (by adding types) into what he called NB, for "New B". Ritchie credits some of his changes to language constructs found in Algol68, although he states "although it [the type scheme], perhaps, did not emerge in a form that Algol's 19 adherents would approve of" After restructuring the language and rewriting the compiler for B, Ritchie gave his new language a name: "C". In 1983, with various versions of C floating around the computer world, ANSI established a committee that eventually published a standard for C in 1989. In 1983 Bjarne Stroustrup at Bell Labs created C++. C++ was designed for the UNIX system environment, it represents an enhancement of the C programming language and enables programmers to improve the quality of code produced, thus making reusable code easier to write. Why Program in C++? So what is so special about C++? Why should you use C++ to develop your applications? First, C++ is not the best language to use in every instance. C++ is a great choice in most instances, but some special circumstances would be better suited to another language. There are a few major advantages to using C++: 1. C++ allows expression of abstract ideas C++ is a third generation language that allows a programmer to express their ideas at a high level as compared to assembly languages. 2. C++ still allows a programmer to keep low-level control Even though C++ is a third generation language, it has some of the "feel" of an assembly language. It allows a programmmer to get down into the low-level workings and tune as necessary. C++ allows programmers strict control over memory management. 3. C++ has national standards (ANSI) C++ is a language with national standards. This is good for many reasons. Code written in C++ that conforms to the national standards can be easily integrated with preexisting code. Also, this allows programmers to reuse certain common libraries, so certain common functions do not need to be written more than once, and these functions behave the same anywhere they are used. 4. C++ is reusable and object-oriented C++ is an object-oriented language. This makes programming conceptually easier (once the object paradigm has been learned) and allows easy reuse of code, or parts of code through inheritance. 5. C++ is widely used and taught C++ is a very widely used programming language. Because of this, there are many tools available for C++ programming, and there is a broad base of programmers contributing to the C++ "community". 20 UNIT 2 DATA TYPES & VARIABLES Contents 2.1 Structure of a C++ program. 2.2 Comments 2.3 Variables 2.4 Identifiers 2.5 Data types. 2.6 Declaration of variables 2.7 Initialization of variables 2.8 Scope of variables 2.9 Constants 2.1 Structure of a C++ program. Probably the best way to start learning a programming language is with a program. So here is our first program: // my first program in C++ #include <iostream.h> int main () { cout << "Hello World!"; return 0; } The above code shows the source code for our first program, which we can name, for example, hiworld.cpp. The way to edit and compile a program depends on the compiler you are using. Depending on whether it has a Development Interface or not and on its version. Consult section compilers and the manual or help included with your compiler if you have doubts on how to compile a C++ console program. The previous program is the first program that most programming apprentices write, and its result is the printing on screen of the "Hello World!" sentence. It is one of the simpler programs that can be written in C++, but it already includes the basic components that every C++ program has. We are going to take a look at them one by one: // my first program in C++ This is a comment line. All the lines beginning with two slash signs (//) are considered comments and do not have any effect on the behavior of the program. They can be used by the programmer 21 to include short explanations or observations within the source itself. In this case, the line is a brief description of what our program does. #include <iostream.h> Sentences that begin with a pound sign (#) are directives for the preprocessor. They are not executable code lines but indications for the compiler. In this case the sentence #include <iostream.h> tells the compiler's preprocessor to include the iostream standard header file. This specific file includes the declarations of the basic standard input-output library in C++, and it is included because its functionality is used later in the program. int main ( ) This line corresponds to the beginning of the main function declaration. The main function is the point by where all C++ programs begin their execution. It is independent of whether it is at the beginning, at the end or in the middle of the code - its content is always the first to be executed when a program starts. In addition, for that same reason, it is essential that all C++ programs have a main function. main is followed by a pair of parenthesis () because it is a function. In C++ all functions are followed by a pair of parenthesis () that, optionally, can include arguments within them. The content of the main function immediately follows its formal declaration and it is enclosed between curly brackets ({}), as in our example. cout << "Hello World"; This instruction does the most important thing in this program. cout is the standard output stream in C++ (usually the screen), and the full sentence inserts a sequence of characters (in this case "Hello World") into this output stream (the screen). cout is declared in the iostream.h header file, so in order to be able to use it that file must be included. Notice that the sentence ends with a semicolon character (;). This character signifies the end of the instruction and must be included after every instruction in any C++ program (one of the most common errors of C++ programmers is indeed to forget to include a semicolon ; at the end of each instruction). return 0; The return instruction causes the main() function finish and return the code that the instruction is followed by, in this case 0. This it is most usual way to terminate a program that has not found any errors during its execution. As you will see in coming examples, all C++ programs end with a sentence similar to this. Therefore, you may have noticed that not all the lines of this program did an action. There were lines containing only comments (those beginning by //), lines with instructions for the compiler's preprocessor (those beginning by #), then there were lines that initiated the declaration of a function (in this case, the main function) and, finally lines with instructions (like the call to cout <<), these last ones were all included within the block delimited by the curly brackets ({}) of the main function. 22 The program has been structured in different lines in order to be more readable, but it is not compulsory to do so. For example, instead of int main ( ) { cout << " Hello World "; return 0; } we could have written: int main ( ) { cout << " Hello World "; return 0; } in just one line and this would have had exactly the same meaning. In C++ the separation between instructions is specified with an ending semicolon (;) after each one. The division of code in different lines serves only to make it more legible and schematic for humans that may read it. Here is a program with some more instructions: // my second program in C++ Hello World! I'm a C++ program #include <iostream.h> int main () { cout << “Hello World! "; cout << "I'm a C++ program"; return 0; } In this case we used the cout << method twice in two different instructions. Once again, the separation in different lines of the code has just been done to give greater readability to the program, since main could have been perfectly defined thus: int main () { cout << " Hello World! "; cout << " I'm to C++ program "; return 0; } We were also free to divide the code into more lines if we considered it convenient: int main () { cout << "Hello World!"; cout << "I'm a C++ program"; return 0; } 23 And the result would have been exactly the same than in the previous examples. Preprocessor directives (those that begin by #) are out of this rule since they are not true instructions. They are lines read and discarded by the preprocessor and do not produce any code. These must be specified in their own line and do not require the include a semicolon (;) at the end. 2.2 Comments Comments are pieces of source code discarded from the code by the compiler. They do nothing. Their purpose is only to allow the programmer to insert notes or descriptions embedded within the source code. C++ supports two ways to insert comments: // line comment /* block comment */ The first of them, the line comment, discards everything from where the pair of slash signs (//) is found up to the end of that same line. The second one, the block comment, discards everything between the /* characters and the next appearance of the */ characters, with the possibility of including several lines. We are going to add comments to our second program: /* my second program in C++ with more comments */ Hello World! I'm a C++ program #include <iostream.h> int main () { cout << "Hello World! "; // says Hello World! cout << "I'm a C++ program"; // says I'm a C++ program return 0; } If you include comments within the source code of your programs without using the comment characters combinations //, /* or */, the compiler will take them as if they were C++ instructions and, most likely causing one or several error messages. 2.3 Variables The usefulness of the "Hello World" programs shown in the previous section are something more than questionable. We had to write several lines of code, compile them, and then execute the resulting program just to obtain a sentence on the screen as result. It is true that it would have 24 been much faster to simply write the output sentence by ourselves, but programming is not limited only to printing texts on screen. In order to go a little further on and to become able to write programs that perform useful tasks that really save us work we need to introduce the concept of the variable. Let's think that I ask you to retain the number 5 in your mental memory, and then I ask you to also memorize the number 2. You have just stored two values in your memory. Now, if I ask you to add 1 to the first number I said, you should be retaining the numbers 6 (that is 5+1) and 2 in your memory. Values that we could now subtract and obtain 4 as the result. All this process that you have made is a simile of what a computer can do with two variables. This same process can be expressed in C++ with the following instruction set: a = 5; b = 2; a = a + 1; result = a - b; Obviously this is a very simple example since we have only used two small integer values, but consider that your computer can store millions of numbers like these at the same time and conduct sophisticated mathematical operations with them. Therefore, we can define a variable as a portion of memory to store a determined value. Each variable needs an identifier that distinguishes it from the others, for example, in the previous code the variable identifiers were a, b and result, but we could have called the variables any names we wanted to invent, as long as they were valid identifiers. 2.4. Identifiers A valid identifier is a sequence of one or more letters, digits or underline symbols ( _ ). The length of an identifier is not limited, although for some compilers only the 32 first characters of an identifier are significant (the rest are not considered). Neither spaces nor marked letters can be part of an identifier. Only letters, digits and underline characters are valid. In addition, variable identifiers should always begin with a letter. They can also begin with an underline character ( _ ), but this is usually reserved for external links. In no case they can begin with a digit. Another rule that you have to consider when inventing your own identifiers is that they cannot match any key word of the C++ language nor your compiler's specific ones since they could be confused with these. For example, the following expressions are always considered key words according to the ANSI-C++ standard and therefore they must not be used as identifiers: asm, auto, bool, break, case, catch, char, class, const, const_cast, continue, default, delete, do, double, dynamic_cast, else, enum, explicit, extern, false, float, for, friend, goto, if, inline, int, long, mutable, namespace, new, operator, private, protected, public, register, reinterpret_cast, return, short, signed, sizeof, static, static_cast, struct, switch, template, this, throw, true, try, typedef, typeid, typename, union, unsigned, using, virtual, void, volatile, wchar_t Additionally, alternative representations for some operators do not have to be used as identifiers since they are reserved words under some circumstances: 25 and, and_eq, bitand, bitor, compl, not, not_eq, or, or_eq, xor, xor_eq Your compiler may also include some more specific reserved keywords. For example, many compilers which generate 16 bit code (like some compilers for DOS) also include far, huge and near as key words. Very important: The C++ language is "case sensitive", that means that an identifier written in capital letters is not equivalent to another one with the same name but written in small letters. Thus, for example the variable RESULT is not the same as the variable result nor the variable Result. 2.5.Data types When programming, we store the variables in our computer's memory, but the computer must know what we want to store in them since storing a simple number, a letter or a large number is not going to occupy the same space in memory. Our computer's memory is organized in bytes. A byte is the minimum amount of memory that we can manage. A byte can store a relatively small amount of data, usually an integer between 0 and 255 or one single character. But in addition, the computer can manipulate more complex data types that come from grouping several bytes, such as long numbers or numbers with decimals. Next you have a list of the existing fundamental data types in C++, as well as the range of values that can be represented with each one of them: DATA TYPES Name Bytes* Description Range* char 1 character or integer 8 bits length. signed: -128 to 127 unsigned: 0 to 255 short 2 integer 16 bits length. signed: -32768 to 32767 unsigned: 0 to 65535 long 4 integer 32 bits length. signed:-2147483648 to 2147483647 unsigned: 0 to 4294967295 Int * Integer. Its length traditionally depends on the length of the system's Word type, thus in MSDOS it is 16 bits long, whereas in 32 bit systems (like Windows See short, long 9x/2000/NT and systems that work under protected mode in x86 systems) it is 32 bits long (4 bytes). float 4 floating point number. 3.4e + / - 38 (7 digits) double 8 double precision floating point number. 1.7e + / - 308 (15 digits) long double 10 long double precision floating point number. 1.2e + / - 4932 (19 digits) 26 bool 1 wchar_t 2 Boolean value. It can take one of two values: true or false NOTE: this is a type recently added by the ANSI-C++ true or false standard. Not all compilers support it. Consult section bool type for compatibility information. Wide character. It is designed as a type to store international characters of a two-byte character set. NOTE: this is a wide characters type recently added by the ANSI-C++ standard. Not all compilers support it. * Values of columns Bytes and Range may vary depending on your system. The values included here are the most commonly accepted and used by almost all compilers. In addition to these fundamental data types there also exist the pointers and the void parameter type specification, that we will see later. 2.6. Declaration of variables In order to use a variable in C++, we must first declare it specifying which of the data types above we want it to be. The syntax to declare a new variable is to write the data type specifier that we want (like int, short, float...) followed by a valid variable identifier. For example: int a; float mynumber; Are valid declarations of variables. The first one declares a variable of type int with the identifier a. The second one declares a variable of type float with the identifier mynumber. Once declared, variables a and mynumber can be used within the rest of their scope in the program. If you need to declare several variables of the same type and you want to save some writing work you can declare all of them in the same line separating the identifiers with commas. For example: int a, b, c; 27 declares three variables (a, b and c) of type int , and has exactly the same meaning as if we had written: int a; int b; int c; Integer data types (char, short, long and int) can be signed or unsigned according to the range of numbers that we need to represent. Thus to specify an integer data type we do it by putting the keyword signed or unsigned before the data type itself. For example: unsigned short NumberOfSons; signed int MyAccountBalance; By default, if we do not specify signed or unsigned it will be assumed that the type is signed, therefore in the second declaration we could have written: int MyAccountBalance; with exactly the same meaning and since this is the most usual way, few source codes include the keyword signed as part of a compound type name. The only exception to this rule is the char type that exists by itself and it is considered a different type than signed char and unsigned char. Finally, signed and unsigned may also be used as a simple types, meaning the same as signed int and unsigned int respectively. The following two declarations are equivalent: unsigned MyBirthYear; unsigned int MyBirthYear; To see what variable declaration looks like in action in a program, we are going to show the C++ code of the example about your mental memory proposed at the beginning of this section: // operating with variables 4 #include <iostream.h> int main () { // declaring variables: int a, b; int result; // process: a = 5; b = 2; a = a + 1; result = a - b; // print out the result: cout << result; 28 // terminate the program: return 0; } Do not worry if something about the variable declarations looks a bit strange to you. You will see the rest in detail in coming sections. 2.7. Initialization of variables When declaring a local variable, its value is undetermined by default. But you may want a variable to store a concrete value the moment that it is declared. In order to do that, you have to append an equal sign followed by the value wanted to the variable declaration: type identifier = initial_value ; For example, if we want to declare an int variable called a that contains the value 0 at the moment in which it is declared, we could write: int a = 0; Additionally to this way of initializating variables (known as c-like), C++ has added a new way to initialize a variable: by enclosing the initial value between parenthesis (): type identifier (initial_value) ; For example: int a (0); Both ways are valid and equivalent in C++. 2.8. Scope of variables All the variables that we are going to use must have been previously declared. An important difference between the C and C++ languages, is that in C++ we can declare variables anywhere in the source code, even between two executable sentences, and not only at the beginning of a block of instructions, like happens in C. Anyway, it is recommended under some circumstances to follow the indications of the C language when declaring variables, since it can be useful when debugging a program to have all the declarations grouped together. Therefore, the traditional C-like way to declare variables is to include their declaration at the beginning of each function (for local variables) or directly in the body of the program outside any function (for global variables). 29 Global variables can be referred to anywhere in the code, within any function, whenever it is after its declaration. The scope of the local variables is limited to the code level in which they are declared. If they are declared at the beginning of a function (like in main) their scope is the whole main function. In the example above, this means that if another function existed in addition to main(), the local variables declared in main could not be used in the other function and vice versa. In C++, the scope of a local variable is given by the block in which it is declared (a block is a group of instructions grouped together within curly brackets {} signs). If it is declared within a function it will be a variable with function scope, if it is declared in a loop its scope will be only the loop, etc... In addition to local and global scopes there exists external scope, that causes a variable to be visible not only in the same source file but in all other files that will be linked together. 2.9. Constants: Literals. A constant is any expression that has a fixed value. They can be divided in Integer Numbers, Floating-Point Numbers, Characters and Strings. • Integer Numbers 1776 707 -273 They are numerical constants that identify integer decimal numbers. Notice that to express a numerical constant we do not need to write quotes (") nor any special character. There is no doubt that it is a constant: whenever we write 1776 in a program we will be referring to the value 1776. In addition to decimal numbers (those that all of us already know) C++ allows the use as literal constants of octal numbers (base 8) and hexadecimal numbers (base 16). If we want to express an octal number we must precede it with a 0 character (zero character). And to express a hexadecimal number we have to precede it with the characters 0x (zero, x). For example, the following literal constants are all equivalent to each other: // decimal 75 // octal 0113 0x4b // hexadecimal All of them represent the same number: 75 (seventy five) expressed as a radix-10 number, octal and hexdecimal, respectively. 30 • Floating Point Numbers:-They express numbers with decimals and/or exponents. They can include a decimal point, an e character (that expresses "by ten at the Xth height", where X is the following integer value) or both. 3.14159 // 3.14159 6.02e23 // 6.02 x 1023 1.6e-19 // 1.6 x 10-19 3.0 // 3.0 These are four valid numbers with decimals expressed in C++. The first number is PI, the second one is the number of Avogadro, the third is the electric charge of an electron (an extremely small number) -all of them approximated- and the last one is the number 3 expressed as a floating point numeric literal. • Characters and strings:-There also exist non-numerical constants, like: 'z' 'p' "Hello world" "How do you do?" The first two expressions represent single characters, and the following two represent strings of several characters. Notice that to represent a single character we enclose it between single quotes (') and to express a string of more than one character we enclose them between double quotes ("). When writing both single characters and strings of characters in a constant way, it is necessary to put the quotation marks to distinguish them from possible variable identifiers or reserved words. Notice this: x 'x' x refers to variable x, whereas 'x' refers to the character constant 'x'. Character constants and string constants have certain peculiarities, like the escape codes. These are special characters that cannot be expressed otherwise in the source code of a program, like newline (\n) or tab (\t). All of them are preceded by an inverted slash (\). Here you have a list of such escape codes: \n Newline \r carriage return \t Tabulation \v vertical tabulation \b Backspace \f page feed \a alert (beep) \' single quotes (') \" double quotes (") \? question (?) \\ inverted slash (\) For example: 31 '\n' '\t' "Left \t Right" "one\ntwo\nthree" Additionally, you can express any character by its numerical ASCII code by writing an inverted slash bar character (\) followed by the ASCII code expressed as an octal (radix-8) or hexadecimal (radix-16) number. In the first case (octal) the number must immediately follow the inverted slash (for example \23 or \40), in the second case (hexacedimal), you must put an x character before the number (for example \x20 or \x4A). Constants of string of characters can be extended by more than a single code line if each code line ends with an inverted slash (\): "string expressed in \ two lines" You can also concatenate several string constants separating them by one or several blankspaces, tabulators, newline or any other valid blank character: "we form" "a single" "string" "of characters" • Defined constants (#define) You can define your own names for constants that you use quite often without having to resort to variables, simply by using the #define preprocessor directive. This is its format: #define identifier value For example: #define PI 3.14159265 #define NEWLINE '\n' #define WIDTH 100 They define three new constants. Once they are declared, you are able to use them in the rest of the code as any if they were any other constant, for example: circle = 2 * PI * r; cout << NEWLINE; In fact the only thing that the compiler does when it finds #define directives is to replace literally any occurrence of the them (in the previous example, PI, NEWLINE or WIDTH) by the code to which they have been defined (3.14159265, '\n' and 100, respectively). For this reason, #define constants are considered macro constants. The #define directive is not a code instruction, it is a directive for the preprocessor, therefore it assumes the whole line as the directive and does not require a semicolon (;) at the end of it. If you include a semicolon character (;) at the end, it will also be added when the preprocessor will substitute any occurence of the defined constant within the body of the program. Declared constants (const) 32 With the const prefix you can declare constants with a specific type exactly as you would do with a variable: const int width = 100; const char tab = '\t'; const zip = 12440; In case that the type was not specified (as in the last example) the compiler assumes that it is type int. 33 UNIT 3 OPERATOR AND CONTROL STRUCTURES Contents 3.1 3.2 3.3 3.4 Types of Operators. Priority of Operators. Communication through console. 3.3.1 Output 3.3.2 input Control Structures. 3.4.1 Conditional structure 3.4.2 Repetitive structures or loops 3.4.3 Bifurcation of control and jumps 3.4.4 The selective Structure: switch Introduction Once we know of the existence of variables and constants we can begin to operate with them. For that purpose, C++ provides the operators, which in this language are a set of keywords and signs that are not part of the alphabet but are available in all keyboards. It is important to know them since they are the basis of the C++ language. 3.1 Different types of operators • Assignation (=). The assignation operator serves to assign a value to a variable. a = 5; Assigns the integer value 5 to variable a. The part at the left of the = operator is known as lvalue (left value) and the right one as rvalue (right value). lvalue must always be a variable whereas the right side can be either a constant, a variable, the result of an operation or any combination of them. It is necessary to emphasize that the assignation operation always takes place from right to left and never at the inverse. a = b; 34 assigns to variable a (lvalue) the value that contains variable b (rvalue) independently of the value that was stored in a at that moment. Consider also that we are only assigning the value of b to a and that a later change of b would not affect the new value of a. For example, if we take int a, b; a = 10; b = 4; a = b; b = 7; this code (with the evolution of the variables' content in green color): // a:? b:? // a:10 b:? // a:10 b:4 // a:4 b:4 // a:4 b:7 Will give us the result that the value contained in a is 4 and the one contained in b is 7. The final modification of b has not affected a, although before we have declared a = b; (right-to-left rule). A property that C++ has over other programming languages is that the assignation operation can be used as the rvalue (or part of an rvalue) for another assignation. For example: a = 2 + (b = 5); is equivalent to: b = 5; a = 2 + b; That means: first assign 5 to variable b and then assign to a the value 2 plus the result of the previous assignation of b (that is 5), leaving a with a final value of 7. Thus, the following expression is also valid in C++: a = b = c = 5; Assigns 5 to the three variables a, b and c. • Arithmetic operators ( +, -, *, /, % ) The five arithmetical operations supported by the language are: + addition - subtraction * multiplication / division % module Operations of addition, subtraction, multiplication and division should not suppose an understanding challenge for you since they literally correspond with their respective mathematical operators. The only one that may not be known by you is the module, specified with the percentage sign (%). Module is the operation that gives the remainder of a division of two integer values. For example, 35 if we write a = 11 % 3;, the variable a will contain 2 as the result since 2 is the remainder from dividing 11 between 3. • Compound assignation operators (+=, -=, *=, /=, %=, >>=, <<=, &=, ^=, |=) A feature of assignation in C++ that contributes to its fame of sparing language when writing are the compound assignation operators (+=, -=, *= and /= among others), which allow to modify the value of a variable with one of the basic operators: value += increase; is equivalent to value = value + increase; a -= 5; is equivalent to a = a - 5; a /= b; is equivalent to a = a / b; price *= units + 1; is equivalent to price = price * (units + 1); and the same for all other operations. • Increase and decrease. Another example of saving language when writing code are the increase operator (++) and the decrease operator (--). They increase or reduce by 1 the value stored in a variable. They are equivalent to +=1 and to -=1, respectively. Thus: a++; a+=1; a=a+1; are all equivalent in its functionality: the three increase by 1 the value of a. Its existence is because in the first C compilers the three previous expressions produced different executable code according to which one was used. Nowadays this type of code optimization is generally done automatically by the compiler. A characteristic of this operator is that it can be used both as a prefix or as a suffix. That means it can be written before the variable identifier (++a) or after (a++). Although in simple expressions like a++ or ++a they have exactly the same meaning, in other operations in which the result of the increase or decrease operation is evaluated as another expression they may have an important difference in their meaning: In case that the increase operator is used as a prefix (++a) the value is increased before the expression is evaluated and therefore the increased value is considered in the expression; in case that it is used as a suffix (a++) the value stored in a is increased after being evaluated and therefore the value stored before the increase operation is evaluated in the expression. Notice the difference: Example 1 B=3; A=++B; // A is 4, B is 4 Example 2 B=3; A=B++; // A is 3, B is 4 36 In Example 1, B is increased before its value is copied to A. While in Example 2, the value of B is copied to A and B is later increased. • Relational operators ( ==, !=, >, <, >=, <= ) In order to evaluate a comparison between two expressions we can use the Relational operators. As specified by the ANSI-C++ standard, the result of a relational operation is a bool value that can only be true or false, according to the result of the comparison. We may want to compare two expressions, for example, to know if they are equal or if one is greater than the other. Here is a list of the relational operators that can be performed in C++: == Equal != Different > Greater than < Less than >= Greater or equal than <= Less or equal than Here you have some examples: (7 == 5) would return false. (5 > 4) would return true. (3 != 2) would return true. (6 >= 6) would return true. (5 < 5) would return false. of course, instead of using only numberic constants, we can use any valid expression, including variables. Suppose that a=2, b=3 and c=6, (a == 5) would return false. (a*b >= c) would return true since (2*3 >= 6) is it. (b+4 > a*c) would return false since (3+4 > 2*6) is it. ((b=2) == a) would return true. Be aware. Operator = (one equal sign) is not the same as operator == (two equal signs), the first is an assignation operator (assigns the right side of the expression to the variable in the left) and the other (==) is a relational operator of equality that compares whether both expressions in the two sides of the operator are equal to each other. Thus, in the last expression ((b=2) == a), we first assigned the value 2 to b and then we compared it to a, that also stores value 2, so the result of the operation is true. In many compilers previous to the publication of the ANSI-C++ standard, as well as in the C language, the relational operations did not return a bool value true or false, rather they returned an int as result with a value of 0 in order to represent "false" and a value different from 0 (generally 1) to represent "true". For more information, or if your compiler does not support the bool type, consult the section bool type. 37 • Logic operators ( !, &&, || ). Operator ! is equivalent to boolean operation NOT, it has only one operand, located at its right, and the only thing that it does is to invert the value of it, producing false if its operand is true and true if its operand is false. It is like saying that it returns the opposite result of evaluating its operand. For example: !(5 == 5) returns false because the expression at its right (5 == 5) would be true. !(6 <= 4) returns true because (6 <= 4) would be false. !true returns false. !false returns true. Logic operators && and || are used when evaluating two expressions to obtain a single result. They correspond with boolean logic operations AND and OR respectively. The result of them depends on the relation between its two operands: First Second result result Operand Operand a && b a || b a b true true true true true false false true false true false true false false false false For example: ( (5 == 5) && (3 > 6) ) returns false ( true && false ). ( (5 == 5) || (3 > 6)) returns true ( true || false ). • Conditional operator ( ? ). The conditional operator evaluates an expression and returns a different value according to the evaluated expression, depending on whether it is true or false. Its format is: condition ? result1 : result2 if condition is true the expression will return result1, if not it will return result2. 7==5 ? 4 : 3 returns 3 since 7 is not equal to 5. 7==5+2 ? 4 : 3 returns 4 since 7 is equal to 5+2. 5>3 ? a : b returns a, since 5 is greater than 3. a>b ? a : b returns the greater one, a or b. Bitwise Operators ( &, |, ^, ~, <<, >> ). 38 Bitwise operators modify variables considering the bits that represent the values that they store, that means, their binary representation. op asm Description & AND Logical AND | OR Logical OR ^ XOR Logical exclusive OR ~ NOT Complement to one (bit inversion) << SHL Shift Left >> SHR Shift Right For more information about binary numbers and bitwise operations, consult Boolean logic. • Explicit type casting operators Type casting operators allows you to convert a datum of a given type to another. There are several ways to do this in C++, the most popular one, compatible with the C language is to precede the expression to be converted by the new type enclosed between parenthesis (): int i; float f = 3.14; i = (int) f; The previous code converts the float number 3.14 to an integer value (3). Here, the type casting operator was (int). Another way to do the same thing in C++ is using the constructor form: preceding the expression to be converted by the type and enclosing the expression between parentheses: i = int ( f ); Both ways of type casting are valid in C++. And additionally ANSI-C++ added new type casting operators more specific for object oriented programming. sizeof() This operator accepts one parameter, that can be either a variable type or a variable itself and returns the size in bytes of that type or object: a = sizeof (char); This will return 1 to a because char is a one byte long type. The value returned by sizeof is a constant, so it is always determined before program execution. Other operators Later in the tutorial we will see a few more operators, like the ones referring to pointers or the specifics for object-oriented programming. Each one is treated in its respective section. 39 3.2 Priority of operators When making complex expressions with several operands, we may have some doubts about which operand is evaluated first and which later. For example, in this expression: a=5+7%2 we may doubt if it really means: a = 5 + (7 % 2) with result 6, or a = (5 + 7) % 2 with result 0 The correct answer is the first of the two expressions, with a result of 6. There is an established order with the priority of each operator, and not only the arithmetic ones (those whose preference we may already know from mathematics) but for all the operators which can appear in C++. From greatest to lowest priority, the priority order is as follows: Priority Operator Description Associativity 1 :: Scope Left 2 () [ ] -> . sizeof Left ++ -- increment/decrement ~ Complement to one (bitwise) ! Unary NOT &* Reference and Dereference (pointers) (type) Type casting +- Unary less sign 4 */% arithmetical operations Left 5 +- arithmetical operations Left 6 << >> bit shifting (bitwise) Left 7 < <= > >= Relational operators Left 8 == != Relational operators Left 9 &^| Bitwise operators Left 10 && || Logic operators Left 11 ?: Conditional Right 12 = += -= *= /= %= Assignation >>= <<= &= ^= |= Right 13 , Left 3 Right Comma, Separator Associativity defines -in the case that there are several operators of the same priority level- which one must be evaluated first, the rightmost one or the leftmost one. 40 All these precedence levels for operators can be manipulated or become more legible using parenthesis signs ( and ), as in this example: a = 5 + 7 % 2; might be written as: a = 5 + (7 % 2); or a = (5 + 7) % 2; According to the operation that we wanted to perform. So if you want to write a complicated expression and you are not sure of the precedence levels, always include parenthesis. It will probably also be more legible code. 3.3 Communication Through Console The console is the basic interface of computers, normally it is the set composed of the keyboard and the screen. The keyboard is generally the standard input device and the screen the standard Output device. In the iostream C++ library, standard input and output operations for a program are supported by two data streams: cin for input and cout for output. Additionally, cerr and clog have also been implemented - these are two output streams specially designed to show error messages. They can be redirected to the standard output or to a log file. Therefore cout (the standard output stream) is normally directed to the screen and cin (the standard input stream) is normally assigned to the keyboard. By handling these two streams you will be able to interact with the user in your programs since you will be able to show messages on the screen and receive his/her input from the keyboard. 3.3.1 Output (cout) The cout stream is used in conjunction with the overloaded operator << (a pair of "less than" signs). cout << "Output sentence"; // prints Output sentence on screen // prints number 120 on screen cout << 120; // prints the content of variable x on screen cout << x; The << operator is known as insertion operator since it inserts the data that follows it into the stream that precedes it. In the examples above it inserted the constant string Output sentence, the numerical constant 120 and the variable x into the output stream cout. Notice that the first of the two sentences is enclosed between double quotes (") because it is a string of characters. Whenever we want to use constant strings of characters we must enclose them between double quotes (") so that they can be clearly distinguished from variables. For example, these two sentences are very different: cout << "Hello"; // prints Hello on screen 41 cout << Hello; // prints the content of Hello variable on screen The insertion operator (<<) may be used more than once in a same sentence: cout << "Hello, " << "I am " << "a C++ sentence"; This last sentence would print the message Hello, I am a C++ sentence on the screen. The utility of repeating the insertion operator (<<) is demonstrated when we want to print out a combination of variables and constants or more than one variable: cout << "Hello, I am " << age << " years old and my zipcode is " << zipcode; If we suppose that variable age contains the number 24 and the variable zipcode contains 90064 the output of the previous sentence would be: Hello, I am 24 years old and my zipcode is 90064 It is important to notice that cout does not add a line break after its output unless we explicitly indicate it, therefore, the following sentences: cout << "This is a sentence."; cout << "This is another sentence."; will be shown followed in screen: This is a sentence.This is another sentence. even if we have written them in two different calls to cout. So, in order to perform a line break on output we must explicitly order it by inserting a new-line character, that in C++ can be written as \n: cout << "First sentence.\n "; cout << "Second sentence.\nThird sentence."; produces the following output: First sentence. Second sentence. Third sentence. Additionally, to add a new-line, you may also use the endl manipulator. For example: cout << "First sentence." << endl; cout << "Second sentence." << endl; would print out: First sentence. Second sentence. 42 The endl manipulator has a special behavior when it is used with buffered streams: they are flushed. But anyway cout is unbuffered by default. You may use either the \n escape character or the endl manipulator in order to specify a line jump to cout. Notice the differences of use shown earlier. 3.3.2 Input (cin). Handling the standard input in C++ is done by applying the overloaded operator of extraction (>>) on the cin stream. This must be followed by the variable that will store the data that is going to be read. For example: int age; cin >> age; Declares the variable age as an int and then waits for an input from cin (keyborad) in order to store it in this integer variable. cin can only process the input from the keyboard once the RETURN key has been pressed. Therefore, even if you request a single character cin will not process the input until the user presses RETURN once the character has been introduced. You must always consider the type of the variable that you are using as a container with cin extraction. If you request an integer you will get an integer, if you request a character you will get a character and if you request a string of characters you will get a string of characters. // i/o example #include <iostream.h> Please enter an integer value: 702 The value you entered is 702 and its double is 1404. int main () { int i; cout << "Please enter an integer value: "; cin >> i; cout << "The value you entered is " << i; cout << " and its double is " << i*2 << ".\n"; return 0; } The user of a program may be one of the reasons that provoke errors even in the simplest programs that use cin (like the one we have just seen). Since if you request an integer value and the user introduces a name (which is a string of characters), the result may cause your program to misoperate since it is not what we were expecting from the user. So when you use the data input provided by cin you will have to trust that the user of your program will be totally cooperative and that he will not introduce his name when an interger value is requested. Farther ahead, when we will see how to use strings of characters we will see possible solutions for the errors that can be caused by this type of user input. 43 You can also use cin to request more than one datum input from the user: cin >> a >> b; is equivalent to: cin >> a; cin >> b; In both cases the user must give two data, one for variable a and another for variable b that may be separated by any valid blank separator: a space, a tab character or a newline. 3.4 Control Structures A program is usually not limited to a linear sequence of instructions. During its process it may bifurcate, repeat code or take decisions. For that purpose, C++ provides control structures that serve to specify what has to be done to perform our program. With the introduction of control sequences we are going to have to introduce a new concept: the block of instructions. A block of instructions is a group of instructions separated by semicolons (;) but grouped in a block delimited by curly bracket signs: { and }. Most of the control structures that we will see in this section allow a generic statement as a parameter, this refers to either a single instruction or a block of instructions, as we want. If we want the statement to be a single instruction we do not need to enclose it between curly-brackets ({}). If we want the statement to be more than a single instruction we must enclose them between curly brackets ({}) forming a block of instructions. 3.4.1 Conditional structure: if and else It is used to execute an instruction or block of instructions only if a condition is fulfilled. Its form is: if (condition) statement where condition is the expression that is being evaluated. If this condition is true, statement is executed. If it is false, statement is ignored (not executed) and the program continues on the next instruction after the conditional structure. For example, the following code fragment prints out x is 100 only if the value stored in variable x is indeed 100: if (x == 100) cout << "x is 100"; If we want more than a single instruction to be executed in case that condition is true we can specify a block of instructions using curly brackets { }: if (x == 100) { 44 cout << "x is "; cout << x; } We can additionally specify what we want that happens if the condition is not fulfilled by using the keyword else. Its form used in conjunction with if is: if (condition) statement1 else statement2 For example: if (x == 100) cout << "x is 100"; else cout << "x is not 100"; prints out on the screen x is 100 if indeed x is worth 100, but if it is not -and only if not- it prints out x is not 100. The if + else structures can be concatenated with the intention of verifying a range of values. The following example shows its use telling if the present value stored in x is positive, negative or none of the previous, that is to say, equal to zero. if (x > 0) cout << "x is positive"; else if (x < 0) cout << "x is negative"; else cout << "x is 0"; Remember that in case we want more than a single instruction to be executed, we must group them in a block of instructions by using curly brackets { }. 3.4.2 Repetitive structures or loops Loops have as objective to repeat a statement a certain number of times or while a condition is fulfilled. The while loop. Its format is: while (expression) statement And its function is simply to repeat statement while expression is true. For example, we are going to make a program to count down using a while loop: // custom countdown using while #include <iostream.h> int main () { int n; Enter the starting number > 8 8, 7, 6, 5, 4, 3, 2, 1, FIRE! 45 cout << "Enter the starting number > "; cin >> n; while (n>0) { cout << n << ", "; --n; } cout << "FIRE!"; return 0; } When the program starts the user is prompted to insert a starting number for the countdown. Then the while loop begins, if the value entered by the user fulfills the condition n>0 (that n be greater than 0 ), the block of instructions that follows will execute an indefinite number of times while the condition (n>0) remains true. All the process in the program above can be interpreted according to the following script: beginning in main: • 1. User assigns a value to n. • 2. The while instruction checks if (n>0). At this point there are two possibilities: o true: execute statement (step 3,) o false: jump statement. The program follows in step 5.. • 3. Execute statement: cout << n << ", "; --n; (prints out n on screen and decreases n by 1). • 4. End of block. Return Automatically to step 2. • 5. Continue the program after the block: print out FIRE! and end of program. We must consider that the loop has to end at some point, therefore, within the block of instructions (loop's statement) we must provide some method that forces condition to become false at some moment, otherwise the loop will continue looping forever. In this case we have included -n; that causes the condition to become false after some loop repetitions: when n becomes 0, that is where our countdown ends. Of course this is such a simple action for our computer that the whole countdown is performed instantly without practical delay between numbers. The do-while loop. Format: do statement while (condition); Its functionality is exactly the same as the while loop except that condition in the do-while is evaluated after the execution of statement instead of before, granting at least one execution of statement even if condition is never fulfilled. For example, the following program echoes any number you enter until you enter 0. // number echoer #include <iostream.h> Enter number (0 to end): 12345 You entered: 12345 46 int main () { unsigned long n; do { cout << "Enter number (0 to end): "; cin >> n; cout << "You entered: " << n << "\n"; } while (n != 0); return 0; } Enter number (0 to end): 160277 You entered: 160277 Enter number (0 to end): 0 You entered: 0 The do-while loop is usually used when the condition that has to determine its end is determined within the loop statement, like in the previous case, where the user input within the block of intructions is what determines the end of the loop. If you never enter the 0 value in the previous example the loop will never end. The for loop. Its format is: for (initialization; condition; increase) statement; And its main function is to repeat statement while condition remains true, like the while loop. But in addition, for provides places to specify an initialization instruction and an increase instruction. So this loop is specially designed to perform a repetitive action with a counter. It works the following way: 1. Initialization is executed. Generally it is an initial value setting for a counter varible. This is executed only once. 2. Condition is checked, if it is true the loop continues, otherwise the loop finishes and statement is skipped. 3. Statement is executed. As usual, it can be either a single instruction or a block of instructions enclosed within curly brackets { }. 4. Finally, whatever is specified in the increase field is executed and the loop gets back to step 2. Here is an example of countdown using a for loop. // countdown using a for loop #include <iostream.h> int main () { for (int n=10; n>0; n--) { cout << n << ", "; } cout << "FIRE!"; return 0; } 10, 9, 8, 7, 6, 5, 4, 3, 2, 1, FIRE! 47 The initialization and increase fields are optional. They can be avoided but not the semicolon signs among them. For example we could write: for (;n<10;) if we want to specify no initialization and no increase; or for (;n<10;n++) if we want to include an increase field but not an initialization. Optionally, using the comma operator (,) we can specify more than one instruction in any of the fields included in a for loop, like in initialization, for example. The comma operator (,) is an instruction separator, it serves to separate more than one instruction where only one instruction is generally expected. For example, suppose that we wanted to intialize more than one variable in our loop: for ( n=0, i=100 ; n!=i ; n++, i-- ) { // whatever here... } This loop will execute 50 times if neither n nor i are modified within the loop: n starts with 0 and i with 100, the condition is (n!=i) (that n be not equal to i). Beacuse n is increased by one and i decreased by one, the loop's condition will become false after the 50th loop, when both n and i will be equal to 50. 3.4.3 Bifurcation of control and jumps The break instruction. Using break we can leave a loop even if the condition for its end is not fulfilled. It can be used to end an infinite loop, or to force it to end before its natural end. For example, we are going to stop the count down before it naturally finishes (an engine failure maybe): // break loop example #include <iostream.h> int main () { int n; for (n=10; n>0; n--) { cout << n << ", "; if (n==3) { cout << "countdown aborted!"; break; } } return 0; } 10, 9, 8, 7, 6, 5, 4, 3, countdown aborted! The continue instruction. 48 The continue instruction causes the program to skip the rest of the loop in the present iteration as if the end of the statement block would have been reached, causing it to jump to the following iteration. For example, we are going to skip the number 5 in our countdown: // break loop example #include <iostream.h> int main () { for (int n=10; n>0; n--) { if (n==5) continue; cout << n << ", "; } cout << "FIRE!"; return 0; } 10, 9, 8, 7, 6, 4, 3, 2, 1, FIRE! The goto instruction. It allows making an absolute jump to another point in the program. You should use this feature carefully since its execution ignores any type of nesting limitation. The destination point is identified by a label, which is then used as an argument for the goto instruction. A label is made of a valid identifier followed by a colon (:). This instruction does not have a concrete utility in structured or object oriented programming aside from those that low-level programming fans may find for it. For example, here is our countdown loop using goto: // goto loop example #include <iostream.h> int main () { int n=10; loop: cout << n << ", "; n--; if (n>0) goto loop; cout << "FIRE!"; return 0; } 10, 9, 8, 7, 6, 5, 4, 3, 2, 1, FIRE! The exit function. exit is a function defined in cstdlib (stdlib.h) library. The purpose of exit is to terminate the running program with an specific exit code. Its prototype is: void exit (int exit code); The exit code is used by some operating systems and may be used by calling programs. By convention, an exit code of 0 means that the program finished normally and any other value means an error happened. 49 4.4.4 The selective Structure: switch. The syntax of the switch instruction is a bit peculiar. Its objective is to check several possible constant values for an expression, something similar to what we did at the beginning of this section with the linking of several if and else if sentences. Its form is the following: switch (expression) { case constant1: block of instructions 1 break; case constant2: block of instructions 2 break; . . . default: default block of instructions } It works in the following way: switch evaluates expression and checks if it is equivalent to constant1, if it is, it executes block of instructions 1 until it finds the break keyword, then the program will jump to the end of the switch selective structure. If expression was not equal to constant1 it will check if expression is equivalent to constant2. If it is, it will execute block of instructions 2 until it finds the break keyword. Finally, if the value of expression has not matched any of the previously specified constants (you may specify as many case sentences as values you want to check), the program will execute the instructions included in the default: section, if this one exists, since it is optional. Both of the following code fragments are equivalent: if-else equivalent switch example Switch (x) { case 1: cout << "x is 1"; break; case 2: cout << "x is 2"; break; default: cout << "value of x unknown"; } if (x == 1) { cout << "x is 1"; } else if (x == 2) { cout << "x is 2"; } else { cout << "value of x unknown"; } I have commented before that the syntax of the switch instruction is a bit peculiar. Notice the inclusion of the break instructions at the end of each block. This is necessary because if, for example, we did not include it after block of instructions 1 the program would not jump to the end of the switch selective block (}) and it would continue executing the rest of the blocks of instructions until the first appearance of the break instruction or the end of the switch selective block. This makes it unnecessary to include curly brackets { } in each of the cases, and it can also be useful to execute the same block of instructions for different possible values for the expression evaluated. For example: 50 Switch (x) { case 1: case 2: case 3: cout << "x is 1, 2 or 3"; break; default: cout << "x is not 1, 2 nor 3"; } Notice that switch can only be used to compare an expression with different constants. Therefore we cannot put variables (case (n*2):) or ranges (case (1..3):) because they are not valid constants. 51 UNIT 4 ARRAY AND POINTER Contents 4.1 Arrays. 4.1.1 Initializing arrays 4.1.2 Access to the values of an Array 4.1.3 Multidimensional Arrays 4.1.4 Arrays as parameters 4.2 Strings 4.2.1 1Initialization of strings 4.2.2 Assigning values to strings 4.2.3 Converting strings to other types 4.2.4 Functions to manipulate strings 4.3 Pointers. 4.4 Pointers and arrays 4.5 Dynamic Memory. 4.1. Arrays Arrays are a series of elements (variables) of the same type placed consecutively in memory that can be individually referenced by adding an index to a unique name. That means that, for example, we can store 5 values of type int without having to declare 5 different variables each with a different identifier. Instead, using an array we can store 5 different values of the same type, int for example, with a unique identifier. For example, an array to contain 5 integer values of type int called billy could be represented this way: where each blank panel represents an element of the array, that in this case are integer values of type int. These are numbered from 0 to 4 since in arrays the first index is always 0, independently of its length . Like any other variable, an array must be declared before it is used. A typical declaration for an array in C++ is: type name [elements]; where type is a valid object type (int, float...), name is a valid variable identifier and the elements field, that is enclosed within brackets [], specifies how many of these elements the array contains. Therefore, to declare billy as shown above it is as simple as the following sentence: int billy [5]; 52 NOTE: The elements field within brackets [] when declaring an array must be a constant value, since arrays are blocks of static memory of a given size and the compiler must be able to determine exactly how much memory it must assign to the array before any instruction is considered. 4.1.1 Initializing arrays. When declaring an array of local scope (within a function), if we do not specify otherwise, it will not be initialized, so its content is undetermined until we store some values in it. If we declare a global array (outside any function) its content will be initialized with all its elements filled with zeros. Thus, if in the global scope we declare: int billy [5]; every element of billy will be set initialy to 0: But additionally, when we declare an Array, we have the possibility to assign initial values to each one of its elements using curly brackets { }. For example: int billy [5] = { 16, 2, 77, 40, 12071 }; this declaration would have created an array like the following one: The number of elements in the array that we initialized within curly brackets { } must match the length in elements that we declared for the array enclosed within square brackets [ ]. For example, in the example of the billy array we have declared that it had 5 elements and in the list of initial values within curly brackets { } we have set 5 different values, one for each element. Because this can be considered useless repetition, C++ includes the possibility of leaving the brackets empty [ ] and the size of the Array will be defined by the number of values included between curly brackets { }: int billy [] = { 16, 2, 77, 40, 12071 }; 4.1.2 Access to the values of an Array In any point of the program in which the array is visible we can access individually anyone of its values for reading or modifying as if it was a normal variable. The format is the following: name[index] 53 Following the previous examples in which billy had 5 elements and each of those elements was of type int, the name which we can use to refer to each element is the following: For example, to store the value 75 in the third element of billy a suitable sentence would be: billy[2] = 75; and, for example, to pass the value of the third element of billy to the variable a, we could write: a = billy[2]; Therefore, for all purposes, the expression billy[2] is like any other variable of type int. Notice that the third element of billy is specified billy[2], since first is billy[0], the second is billy[1], and therefore, third is billy[2]. By this same reason, its last element is billy[4]. Since if we wrote billy[5], we would be acceding to the sixth element of billy and therefore exceeding the size of the array. In C++ it is perfectly valid to exceed the valid range of indices for an Array, which can create problems since they do not cause compilation errors but they can cause unexpected results or serious errors during execution. The reason why this is allowed will be seen farther ahead when we begin to use pointers. At this point it is important to be able to clearly distinguish between the two uses that brackets [ ] have related to arrays. They perform two differt tasks: one is to set the size of arrays when declaring them; and second is to specify indices for a concrete array element when referring to it. We must simply take care not to confuse these two possible uses of brackets [ ] with arrays: int billy[5]; // declaration of a new Array (begins with a type name) // access to an element of the Array. billy[2] = 75; Other valid operations with arrays: billy[0] = a; billy[a] = 75; b = billy [a+2]; billy[billy[a]] = billy[2] + 5; // arrays example #include <iostream.h> 12206 int billy [] = {16, 2, 77, 40, 12071}; int n, result=0; int main () { for ( n=0 ; n<5 ; n++ ) { result += billy[n]; } 54 cout << result; return 0; } 4.1.3 Multidimensional Arrays Multidimensional arrays can be described as arrays of arrays. For example, a bidimensional array can be imagined as a bidimensional table of a uniform concrete data type. jimmy represents a bidimensional array of 3 per 5 values of type int. The way to declare this array would be: int jimmy [3][5]; and, for example, the way to reference the second element vertically and fourth horizontally in an expression would be: jimmy[1][3] (remember that array indices always begin by 0). Multidimensional arrays are not limited to two indices (two dimensions). They can contain as many indices as needed, although it is rare to have to represent more than 3 dimensions. Just consider the amount of memory that an array with many indices may need. For example: char century [100][365][24][60][60]; assigns a char for each second contained in a century, that is more than 3 billion chars! This would consume about 3000 megabytes of RAM memory if we could declare it. Multidimensional arrays are nothing more than an abstraction, since we can obtain the same results with a simple array just by putting a factor between its indices: int jimmy [3][5]; is equivalent to int jimmy [15]; (3 * 5 = 15) 55 With the only difference that the compiler remembers for us the depth of each imaginary dimension. Serve as example these two pieces of code, with exactly the same result, one using bidimensional arrays and the other using only simple arrays: // multidimensional array #include <iostream.h> // pseudo-multidimensional array #include <iostream.h> #define WIDTH 5 #define HEIGHT 3 #define WIDTH 5 #define HEIGHT 3 int jimmy [HEIGHT][WIDTH]; int n,m; int jimmy [HEIGHT * WIDTH]; int n,m; int main () { for (n=0;n<HEIGHT;n++) for (m=0;m<WIDTH;m++) { jimmy[n][m]=(n+1)*(m+1); } return 0; } int main () { for (n=0;n<HEIGHT;n++) for (m=0;m<WIDTH;m++) { jimmy[n * WIDTH + m]=(n+1)*(m+1); } return 0; } none of the programs above produce any output on the screen, but both assign values to the memory block called jimmy in the following way: We have used defined constants (#define) to simplify possible future modifications of the program, for example, in case that we decided to enlarge the array to a height of 4 instead of 3 it could be done by changing the line: #define HEIGHT 3 to #define HEIGHT 4 with no need to make any other modifications to the program. 4.1.4 Arrays as parameters At some moment we may need to pass an array to a function as a parameter. In C++ is not possible to pass by value a complete block of memory as a parameter to a function, even if it is ordered as an array, but it is allowed to pass its address. This has almost the same practical effect and it is a much faster and more efficient operation. 56 In order to admit arrays as parameters the only thing that we must do when declaring the function is to specify in the argument the base type for the array, an identifier and a pair of void brackets []. For example, the following function: void procedure (int arg[]) admits a parameter of type "Array of int" called arg. In order to pass to this function an array declared as: int myarray [40]; it would be enough to write a call like this: procedure (myarray); Here you have a complete example: // arrays as parameters #include <iostream.h> 5 10 15 2 4 6 8 10 void printarray (int arg[], int length) { for (int n=0; n<length; n++) cout << arg[n] << " "; cout << "\n"; } int main () { int firstarray[] = {5, 10, 15}; int secondarray[] = {2, 4, 6, 8, 10}; printarray (firstarray,3); printarray (secondarray,5); return 0; } As you can see, the first argument (int arg[]) admits any array of type int, wathever its length is. For that reason we have included a second parameter that tells the function the length of each array that we pass to it as the first parameter. This allows the for loop that prints out the array to know the range to check in the passed array. In a function declaration is also possible to include multidimensional arrays. The format for a tridimensional array is: base_type[][depth][depth] for example, a function with a multidimensional array as argument could be: void procedure (int myarray[][3][4]) notice that the first brackets [] are void and the following ones are not. This must always be thus because the compiler must be able to determine within the function which is the depth of each additional dimension. 57 Arrays, both simple or multidimensional, passed as function parameters are a quite common source of errors for less experienced programmers. 4.2 Strings In all programs seen until now, we have used only numerical variables, used to express numbers exclusively. But in addition to numerical variables there also exist strings of characters, that allow us to represent successions of characters, like words, sentences, names, texts, et cetera. Until now we have only used them as constants, but we have never considered variables able to contain them. In C++ there is no specific elemental variable type to store strings of characters. In order to fulfill this feature we can use arrays of type char, which are successions of char elements. Remember that this data type (char) is the one used to store a single character, for that reason arrays of them are generally used to make strings of single characters. For example, the following array (or string of characters): char jenny [20]; can store a string up to 20 characters long. You may imagine it thus: This maximum size of 20 characters is not required to always be fully used. For example, jenny could store at some moment in a program either the string of characters "Hello" or the string "Merry christmas". Therefore, since the array of characters can store shorter strings than its total length, a convention has been reached to end the valid content of a string with a null character, whose constant can be written 0 or '\0'. We could represent jenny (an array of 20 elements of type char) storing the strings of characters "Hello" and "Merry Christmas" in the following way: Notice how after the valid content a null character ('\0') it is included in order to indicate the end of the string. The panels in gray color represent indeterminate values. 4.2.1 Initialization of strings Because strings of characters are ordinary arrays they fulfill all their same rules. For example, if we want to initialize a string of characters with predetermined values we can do it just like any other array: char mystring[] = { 'H', 'e', 'l', 'l', 'o', '\0' }; In this case we would have declared a string of characters (array) of 6 elements of type char initialized with the characters that compose Hello plus a null character '\0'. 58 Nevertheless, strings of characters have an additional way to initialize their values: using constant strings. In the expressions we have used in examples in previous chapters constants that represented entire strings of characters have already appeared several times. These are specified enclosed between double quotes ("), for example: "the result is: " is a constant string that we have probably used on some occasion. Unlike single quotes (') which specify single character constants, double quotes (") are constants that specify a succession of characters. Strings enclosed between double quotes always have a null character ('\0') automatically appended at the end. Therefore we could initialize the string mystring with values by either of these two ways: char mystring [] = { 'H', 'e', 'l', 'l', 'o', '\0' }; char mystring [] = "Hello"; In both cases the array or string of characters mystring is declared with a size of 6 characters (elements of type char): the 5 characters that compose Hello plus a final null character ('\0') which specifies the end of the string and that, in the second case, when using double quotes (") it is automatically appended. Before going further, notice that the assignation of multiple constants like double-quoted constants (") to arrays are only valid when initializing the array, that is, at the moment when declared. Expressions within the code like: mystring = "Hello"; mystring[] = "Hello"; are not valid for arrays, like neither would be: mystring = { 'H', 'e', 'l', 'l', 'o', '\0' }; So remember: We can "assign" a multiple constant to an Array only at the moment of initializing it. The reason will be more comprehensible when you know a bit more about pointers, since then it will be clarified that an array is simply a constant pointer pointing to an allocated block of memory. And because of this constantnes, the array itself can not be assigned any value, but we can assing values to each of the elements of the array. The moment of initializing an Array it is a special case, since it is not an assignation, although the same equal sign (=) is used. Anyway, always have the rule previously underlined present. 4.2.2 Assigning values to strings Since the lvalue of an assignation can only be an element of an array and not the entire array, it would be valid to assign a string of characters to an array of char using a method like this: mystring[0] mystring[1] mystring[2] mystring[3] = = = = 'H'; 'e'; 'l'; 'l'; 59 mystring[4] = 'o'; mystring[5] = '\0'; But as you may think, this does not seem to be a very practical method. Generally for assigning values to an array, and more specifically to a string of characters, a series of functions like strcpy are used. strcpy (string copy) is defined in the cstring (string.h) library and can be called the following way: strcpy (string1, string2); This does copy the content of string2 into string1. string2 can be either an array, a pointer, or a constant string, so the following line would be a valid way to assign the constant string "Hello" to mystring: strcpy (mystring, "Hello"); For example: // setting value to string #include <iostream.h> #include <string.h> J. Soulie int main () { char szMyName [20]; strcpy (szMyName,"J. Soulie"); cout << szMyName; return 0; } Notice that we needed to include <string.h> header in order to be able to use function strcpy. Although we can always write a simple function like the following setstring with the same operation as cstring's strcpy: J. Soulie // setting value to string #include <iostream.h> void setstring (char szOut [], char szIn []) { int n=0; do { szOut[n] = szIn[n]; } while (szIn[n++] != '\0'); } int main () { char szMyName [20]; setstring (szMyName,"J. Soulie"); cout << szMyName; return 0; } 60 Another frequently used method to assign values to an array is by directly using the input stream (cin). In this case the value of the string is assigned by the user during program execution. When cin is used with strings of characters it is usually used with its getline method, that can be called following this prototype: cin.getline ( char buffer[], int length, char delimiter = ' \n'); where buffer is the address of where to store the input (like an array, for example), length is the maximum length of the buffer (the size of the array) and delimiter is the character used to determine the end of the user input, which by default - if we do not include that parameter - will be the newline character ('\n'). The following example repeats whatever you type on your keyboard. It is quite simple but serves as an example of how you can use cin.getline with strings: // cin with strings #include <iostream.h> int main () { char mybuffer [100]; cout << "What's your name? "; cin.getline (mybuffer,100); cout << "Hello " << mybuffer << ".\n"; cout << "Which is your favourite team? "; cin.getline (mybuffer,100); cout << "I like " << mybuffer << " too.\n"; return 0; } What's your name? Juan Hello Juan. Which is your favourite team? Inter Milan I like Inter Milan too. Notice how in both calls to cin.getline we used the same string identifier (mybuffer). What the program does in the second call is simply step on the previous content of buffer with the new one that is introduced. If you remember the section about communication through the console, you will remember that we used the extraction operator (>>) to receive data directly from the standard input. This method can also be used instead of cin.getline with strings of characters. For example, in our program, when we requested an input from the user we could have written: cin >> mybuffer; This would work, but this method has the following limitations that cin.getline has not: • It can only receive single words (no complete sentences) since this method uses as a delimiter any occurrence of a blank character, including spaces, tabulators, newlines and carriage returns. • It is not allowed to specify a size for the buffer. That makes your program unstable in case the user input is longer than the array that will host it. For these reasons it is recommended that whenever you require strings of characters coming from cin you use cin.getline instead of cin >>. 61 4.2.3 Converting strings to other types Due to that a string may contain representations of other data types like numbers, it might be useful to translate that content to a variable of a numeric type. For example, a string may contain "1977", but this is a sequence of 5 chars not so easily convertable to a single integer data type. The cstdlib (stdlib.h) library provides three useful functions for this purpose: • atoi: converts string to int type. • atol: converts string to long type. • atof: converts string to float type. All of these functions admit one parameter and return a value of the requested type (int, long or float). These functions combined with getline method of cin are a more reliable way to get the user input when requesting a number than the classic cin>> method: Enter price: 2.75 Enter quantity: 21 Total price: 57.75 // cin and ato* functions #include <iostream.h> #include <stdlib.h> int main () { char mybuffer [100]; float price; int quantity; cout << "Enter price: "; cin.getline (mybuffer,100); price = atof (mybuffer); cout << "Enter quantity: "; cin.getline (mybuffer,100); quantity = atoi (mybuffer); cout << "Total price: " << price*quantity; return 0; } 4.2.4 Functions to manipulate strings The cstring library (string.h) defines many functions to perform manipulation operations with Clike strings (like already explained strcpy). Here you have a brief look at the most usual: strcat: char* strcat (char* dest, const char* src); Appends src string at the end of dest string. Returns dest. strcmp: int strcmp (const char* string1, const char* string2); Compares strings string1 and string2. Returns 0 is both strings are equal. strcpy: char* strcpy (char* dest, const char* src); Copies the content of src to dest. Returns dest. strlen: size_t strlen (const char* string); 62 Returns the length of string. NOTE: char* is the same as char[] 4.3 Pointers We have already seen how variables are memory cells that we can access by an identifier. But these variables are stored in concrete places of the computer memory. For our programs, the computer memory is only a succession of 1 byte cells (the minimum size for a datum), each one with a unique address. A good simile for the computer memory can be a street in a city. On a street all houses are consecutively numbered with an unique identifier so if we talk about 27th of Sesame Street we will be able to find that place without trouble, since there must be only one house with that number and, in addition, we know that the house will be between houses 26 and 28. In the same way in which houses in a street are numbered, the operating system organizes the memory with unique and consecutive numbers, so if we talk about location 1776 in the memory, we know that there is only one location with that address and also that is between addresses 1775 and 1777. Address (dereference) operator (&). At the moment in which we declare a variable it must be stored in a concrete location in this succession of cells (the memory). We generally do not decide where the variable is to be placed fortunately that is something automatically done by the compiler and the operating system at runtime, but once the operating system has assigned an address there are some cases in which we may be interested in knowing where the variable is stored. This can be done by preceding the variable identifier by an ampersand sign (&), which literally means "address of". For example: ted = &andy; would assign to variable ted the address of variable andy, since when preceding the name of the variable andy with the ampersand (&) character we are no longer talking about the content of the variable, but about its address in memory. We are going to suppose that andy has been placed in the memory address 1776 and that we write the following: andy = 25; fred = andy; ted = &andy; the result is shown in the following diagram: 63 We have assigned to fred the content of variable andy as we have done in many other occasions in previous sections of this tutorial, but to ted we have assigned the address in memory where the operating system stores the value of andy, that we have imagined was 1776 (it can be any address, I have just invented this one). The reason is that in the allocation of ted we have preceded andy with an ampersand (&) character. The variable that stores the address of another variable (like ted in the previous example) is what we call a pointer. In C++ pointers have certain virtues and they are used very often. Farther ahead we will see how this type of variable is declared. Reference operator (*) Using a pointer we can directly access the value stored in the variable pointed by it just by preceding the pointer identifier with the reference operator asterisk (*), that can be literally translated to "value pointed by". Therefore, following with the values of the previous example, if we write: beth = *ted; (that we could read as: "beth equal to value pointed by ted") beth would take the value 25, since ted is 1776, and the value pointed by 1776 is 25. You must clearly differenciate that ted stores 1776, but *ted (with an asterisk * before) refers to the value stored in the address 1776, that is 25. Notice the difference of including or not including the reference asterisk (I have included an explanatory commentary of how each expression could be read): beth = ted; // beth equal to ted ( 1776 ) beth = *ted; // beth equal to value pointed by ted ( 25 ) Operator of address or dereference (&) It is used as a variable prefix and can be translated as "address of", thus: &variable1 can be read as "address of variable1" Operator of reference (*) It indicates that what has to be evaluated is the content pointed by the expression considered as an address. It can be translated by "value pointed by". 64 * mypointer can be read as "value pointed by mypointer". At this point, and following with the same example initiated above where: andy = 25; ted = &andy; you should be able to clearly see that all the following expressions are true: andy == 25 &andy == 1776 ted == 1776 *ted == 25 The first expression is quite clear considering that its assignation was andy=25;. The second one uses the address (or derefence) operator (&) that returns the address of the variable andy, that we imagined to be 1776. The third one is quite obvious since the second was true and the assignation of ted was ted = &andy;. The fourth expression uses the reference operator (*) that, as we have just seen, is equivalent to the value contained in the address pointed by ted, that is 25. So, after all that, you may also infer that while the address pointed by ted remains unchanged the following expression will also be true: *ted == andy Declaring variables of type pointer Due to the ability of a pointer to directly reference the value that it point to, it becomes necessary to specify which data type a pointer points to when declaring it. It is not the same to point to a char as it is to point to an int or a float type. Therefore, the declaration of pointers follows this form: type * pointer_name; where type is the type of data pointed, not the type of the pointer itself. For example: int * number; char * character; float * greatnumber; They are three declarations of pointers. Each one points to a different data type, but the three are pointers and in fact the three occupy the same amount of space in memory (the size of a pointer depends on the operating system), but the data to which they point do not occupy the same amount of space nor are of the same type, one is int, another one is char and the other one float. I emphasize that the asterisk (*) that we use when declaring a pointer means only that it is a pointer, and should not be confused with the reference operator that we have seen a bit earlier which is also written with an asterisk (*). They are simply two different tasks represented with the same sign. // my first pointer #include <iostream.h> value1==10 / value2==20 int main () { int value1 = 5, value2 = 15; 65 int * mypointer; mypointer = &value1; *mypointer = 10; mypointer = &value2; *mypointer = 20; cout << "value1==" << value1 << "/ value2==" << value2; return 0; } Notice how the values of value1 and value2 have changed indirectly. First we have assigned to mypointer the address of value1 using the deference ampersand sign (&). Then we have assigned 10 to the value pointed by mypointer, which is pointing to the address of value1, so we have modified value1 indirectly. In order that you can see that a pointer may take several different values during the same program we have repeated the process with value2 and the same pointer. Here is an example a bit more complicated: value1==10 / value2==20 // more pointers #include <iostream.h> int main () { int value1 = 5, value2 = 15; int *p1, *p2; p1 = &value1; p2 = &value2; *p1 = 10; *p2 = *p1; p1 = p2; *p1 = 20; // p1 = address of value1 // p2 = address of value2 // value pointed by p1 = 10 // value pointed by p2 = value pointed by p1 // p1 = p2 (value of pointer copied) // value pointed by p1 = 20 cout << "value1==" << value1 << "/ value2==" << value2; return 0; } I have included as comments on each line how the code can be read: ampersand (&) as "address of" and asterisk (*) as "value pointed by". Notice that there are expressions with pointers p1 and p2 with and without the asterisk. The meaning of using or not using a reference asterisk is very different: An asterisk (*) followed by the pointer refers to the place pointed by the pointer, whereas a pointer without an asterisk (*) refers to the value of the pointer itself, that is, the address of where it is pointing. Another thing that can call your attention is the line: int *p1, *p2; 66 That declares the two pointers of the previous example putting an asterisk (*) for each pointer. The reason is that the type for all the declarations of the same line is int (and not int*). The explanation is because of the level of precedence of the reference operator asterisk (*) that is the same as the declaration of types, therefore, because they are associative operators from the right, the asterisks are evaluated first than the type. We have talked about this in, although it is enough that you know clearly that -unless you include parenthesis- you will have to put an asterisk (*) before each pointer that you declare. 4.4 Pointers and arrays The concept of array is very much bound to the one of pointer. In fact, the identifier of an array is equivalent to the address of its first element, like a pointer is equivalent to the address of the first element that it points to, so in fact they are the same thing. For example, supposing these two declarations: int numbers [20]; int * p; the following allocation would be valid: p = numbers; At this point p and numbers are equivalent and they have the same properties, the only difference is that we could assign another value to the pointer p whereas numbers will always point to the first of the 20 integer numbers of type int with which it was defined. So, unlike p, that is an ordinary variable pointer, numbers is a constant pointer (indeed an array name is a constant pointer). Therefore, although the previous expression was valid, the following allocation is not: numbers = p; because numbers is an array (constant pointer), and no values can be assigned to constant identifiers. Due to the character of variables all the expressions that include pointers in the following example are perfectly valid: // more pointers #include <iostream.h> 10, 20, 30, 40, 50, int main () { int numbers[5]; int * p; p = numbers; *p = 10; p++; *p = 20; p = &numbers[2]; *p = 30; p = numbers + 3; *p = 40; p = numbers; *(p+4) = 50; for (int n=0; n<5; n++) cout << numbers[n] << ", "; 67 return 0; } In chapter "Arrays" we used bracket signs [] several times in order to specify the index of the element of the Array to which we wanted to refer. Well, the bracket signs operator [] are known as offset operators and they are equivalent to adding the number within brackets to the address of a pointer. For example, both following expressions: a[5] = 0; // a [offset of 5] = 0 *(a+5) = 0; // pointed by (a+5) = 0 are equivalent and valid either if a is a pointer or if it is an array. Pointer initialization When declaring pointers we may want to explicitly specify to which variable we want them to point, int number; int *tommy = &number; this is equivalent to: int number; int *tommy; tommy = &number; When a pointer assignation takes place we are always assigning the address where it points to, never the value pointed. You must consider that at the moment of declaring a pointer, the asterisk (*) indicates only that it is a pointer, it in no case indicates the reference operator (*). Remember, they are two different operators, although they are written with the same sign. Thus, we must take care not to confuse the previous with: int number; int *tommy; *tommy = &number; That anyway would not have much sense in this case. As in the case of arrays, the compiler allows the special case that we want to initialize the content at which the pointer points with constants at the same moment as declaring the variable pointer: char * terry = "hello"; In this case static storage is reserved for containing "hello" and a pointer to the first char of this memory block (that corresponds to 'h') is assigned to terry. If we imagine that "hello" is stored at addresses 1702 and following, the previous declaration could be outlined thus: 68 It is important to indicate that terry contains the value 1702 and not 'h' nor "hello", although 1702 points to these characters. The pointer terry points to a string of characters and can be used exactly as if it was an Array (remember that an array is just a constant pointer). For example, if our temper changed and we wanted to replace the 'o' by a '!' sign in the content pointed by terry, we could do it by any of the following two ways: terry[4] = '!'; *(terry+4) = '!'; Remember that to write terry[4] is just the same as to write *(terry+4), although the most usual expression is the first one. With either of those two expressions something like this would happen: Arithmetic of pointers To conduct arithmetical operations on pointers is a little different than to conduct them on other integer data types. To begin with, only addition and subtraction operations are allowed to be conducted, the others make no sense in the world of pointers. But both addition and subtraction have a different behavior with pointers according to the size of the data type to which they point. When we saw the different data types that exist, we saw that some occupy more or less space than others in the memory. For example, in the case of integer numbers, char occupies 1 byte, short occupies 2 bytes and long occupies 4. Let's suppose that we have 3 pointers: char *mychar; short *myshort; long *mylong; And that we know that they point to memory locations 1000, 2000 and 3000 respectively. So if we write: mychar++; myshort++; mylong++; mychar, as you may expect, would contain the value 1001. Nevertheless, myshort would contain the value 2002, and mylong would contain 3004. The reason is that when adding 1 to a pointer 69 we are making it to point to the following element of the same type with which it has been defined, and therefore the size in bytes of the type pointed is added to the pointer. This is applicable both when adding and subtracting any number to a pointer. It would happen exactly the same if we write: mychar = mychar + 1; myshort = myshort + 1; mylong = mylong + 1; It is important to warn you that both increase (++) and decrease (--) operators have a greater priority than the reference operator asterisk (*), therefore the following expressions may lead to confussion: *p++; *p++ = *q++; The first one is equivalent to *(p++) and what it does is to increase p (the address where it points to - not the value that contains). In the second, because both increase operators (++) are after the expressions to be evaluated and not before, first the value of *q is assigned to *p and then both q and p are increased by one. It is equivalent to: *p = *q; p++; q++; Like always, I recommend you use parenthesis () in order to avoid unexpected results. Pointers to pointers C++ allows the use of pointers that point to pointers, that these, in its turn, point to data. In order to do that we only need to add an asterisk (*) for each level of reference: char a; char * b; char ** c; a = 'z'; 70 b = &a; c = &b; this, supposing the randomly chosen memory locations of 7230, 8092 and 10502, could be described thus: (inside the cells there is the content of the variable; under the cells its location) The new thing in this example is variable c, which we can talk about in three different ways, each one of them would correspond to a different value: c is a variable of type (char **) with a value of 8092 *c is a variable of type (char*) with a value of 7230 **c is a variable of type (char) with a value of'z' void pointers The type of pointer void is a special type of pointer. void pointers can point to any data type, from an integer value or a float to a string of characters. Its sole limitation is that the pointed data cannot be referenced directly (we can not use reference asterisk * operator on them), since its length is always undetermined, and for that reason we will always have to resort to type casting or assignations to turn our void pointer to a pointer of a concrete data type to which we can refer. One of its utilities may be for passing generic parameters to a function: 6, 10, 13 // integer increaser #include <iostream.h> void increase (void* data, int type) { switch (type) { case sizeof(char) : (*((char*)data))++; break; case sizeof(short): (*((short*)data))++; break; case sizeof(long) : (*((long*)data))++; break; } } int main () { char a = 5; short b = 9; long c = 12; increase (&a,sizeof(a)); increase (&b,sizeof(b)); increase (&c,sizeof(c)); cout << (int) a << ", " << b << ", " << c; return 0; 71 } sizeof is an operator integrated in the C++ language that returns a constant value with the size in bytes of its parameter, so, for example, sizeof(char) is 1, because char type is 1 byte long. Pointers to functions C++ allows operations with pointers to functions. The greatest use of this is for passing a function as a parameter to another function, since these cannot be passed dereferenced. In order to declare a pointer to a function we must declare it like the prototype of the function except the name of the function is enclosed between parenthesis () and a pointer asterisk (*) is inserted before the name. It might not be a very handsome syntax, but that is how it is done in C++: 8 // pointer to functions #include <iostream.h> int addition (int a, int b) { return (a+b); } int subtraction (int a, int b) { return (a-b); } int (*minus)(int,int) = subtraction; int operation (int x, int y, int (*functocall)(int,int)) { int g; g = (*functocall)(x,y); return (g); } int main () { int m,n; m = operation (7, 5, addition); n = operation (20, m, minus); cout <<n; return 0; } In the example, minus is a global pointer to a function that has two parameters of type int, it is immediately assigned to point to the function subtraction, all in a single line: int (* minus)(int,int) = subtraction; 4.5. Dynamic Memory Until now, in our programs, we have only had as much memory as we have requested in declarations of variables, arrays and other objects that we included, having the size of all of them 72 fixed before the execution of the program. But, what if we need a variable amount of memory that can only be determined during the program execution (runtime), for example, in case that we need an user input to determine the necessary amount of space? The answer is dynamic memory, for which C++ integrates the operators new and delete. Operators new and delete are exclusive of C++. Farther ahead in this section are shown the C equivalents for these operators. Operators new and new[ ] In order to request dynamic memory, the operator new exists. new is followed by a data type and optionally the number of elements required within brackets []. It returns a pointer to the beginning of the new block of assigned memory. Its form is: pointer = new type or pointer = new type [elements] The first expression is used to assign memory to contain one single element of type. The second one is used to assign a block (an array) of elements of type. For example: int * bobby; bobby = new int [5]; In this case, the operating system has assigned space for 5 elements of type int in a heap and it has returned a pointer to its beginning that has been assigned to bobby. Therefore, now, bobby points to a valid block of memory with space for 5 int elements. You could ask what is the difference between declaring a normal array and assigning memory to a pointer as we have just done. The most important one is that the size of an array must be a constant value, which limits its size to what we decide at the moment of designing the program before its execution, whereas the dynamic memory allocation allows assigning memory during the execution of the program using any variable, constant or combination of both as size. The dynamic memory is generally managed by the operating system, and in multitask interfaces it can be shared between several applications, so there is a possibility that the memory exhausts. If this happens and the operating system cannot assign the memory that we request with the operator new, a null pointer will be returned. For that reason it is recommended to always check to see if the returned pointer is null after a call to new. int * bobby; bobby = new int [5]; if (bobby == NULL) { 73 // error assigning memory. Take measures. }; Operator delete. Since the necessity of dynamic memory is usually limited to concrete moments within a program, once it is no longer needed it should be freed so that it becomes available for future requests of dynamic memory. The operator delete exists for this purpose, whose form is: delete pointer; or delete [] pointer; The first expression should be used to delete memory alloccated for a single element, and the second one for memory allocated for multiple elements (arrays). In most compilers both expressions are equivalent and can be used without distinction, although indeed they are two different operators and so must be considered for operator overloading. // rememb-o-matic #include <iostream.h> #include <stdlib.h> int main () { char input [100]; int i,n; long * l; cout << "How many numbers do you want to type in? "; cin.getline (input,100); i=atoi (input); l= new long[i]; if (l == NULL) exit (1); for (n=0; n<i; n++) { cout << "Enter number: "; cin.getline (input,100); l[n]=atol (input); } cout << "You have entered: "; for (n=0; n<i; n++) cout << l[n] << ", "; delete[] l; return 0; } How many numbers do you want to type in? 5 Enter number : 75 Enter number : 436 Enter number : 1067 Enter number : 8 Enter number : 32 You have entered: 75, 436, 1067, 8, 32, This simple example that memorizes numbers does not have a limited amount of numbers that can be introduced, thanks to us requesting to the system to provide as much space as is necessary to store all the numbers that the user wishes to introduce. NULL is a constant value defined in manyfold C++ libraries specially designed to indicate null pointers. In case that this constant is not defined you can do it yourself by defining it to 0: 74 #define NULL 0 It is indifferent to put 0 or NULL when checking pointers, but the use of NULL with pointers is widely extended and it is recommended for greater legibility. The reason is that a pointer is rarely compared or set directly to a numerical literal constant except precisely number 0, and this way this action is symbolically masked. Dynamic memory in ANSI-C Operators new and delete are exclusive of C++ and they are not available in C language. In C language, in order to assign dynamic memory we have to resort to the library stdlib.h. We are going to see them, since they are also valid in C++ and they are used in some existing programs. The function malloc It is the generic function to assign dynamic memory to pointers. Its prototype is: void * malloc (size_t nbytes); where nbytes is the number of bytes that we want to be assigned to the pointer. The function returns a pointer of type void*, which is the reason why we have to type cast the value to the type of the destination pointer, for example: char * ronny; ronny = (char *) malloc (10); This assigns to ronny a pointer to an usable block of 10 bytes. When we want to assign a block of data of a different type other than char (different from 1 byte) we must multiply the number of elements desired by the size of each element. Luckyly we have at our disposition the operator sizeof, that returns the size of the type of a concrete datum. int * bobby; bobby = (int *) malloc (5 * sizeof(int)); This piece of code assigns to bobby a pointer to a block of 5 integers of type int, this size can be equal to 2, 4 or more bytes according to the system where the program is compiled. The function calloc. calloc is very similar to malloc in its operation, its main difference is in its prototype: void * calloc (size_t nelements, size_t size); Since it admits 2 parameters instead of one. These two parameters are multiplied to obtain the total size of the memory block to be assigned. Usually the first parameter (nelements) is the number of elements and the second one (size) serves to specify the size of each element. For example, we could define bobby with calloc thus: int * bobby; bobby = (int *) calloc (5, sizeof(int)); Another difference between malloc and calloc is that calloc initializates all its elements to 0. The function realloc. 75 It changes the size of a block of memory already assigned to a pointer. void * realloc (void * pointer, size_t size); pointer parameter receives a pointer to an already assigned memory block or a null pointer, and size specifies the new size that the memory block shall have. The function assigns size bytes of memory to the pointer. The function may need to change the location of the memory block so that the new size can fit, in that case the present content of the block is copied to the new one to guarantee that the existing data is not lost. The new pointer is returned by the function. If it has not been posible to assign the memory block with the new size it returns a null pointer but the pointer specified as parameter and its content remains unchanged. The function free. It releases a block of dynamic memory previously assigned using malloc, calloc or realloc. void free (void * pointer); This function must only be used to release memory assigned with functions malloc, calloc and realloc. 76 UNIT 5 STRUCTURES AND UNION Contents 5.1 5.2 5.2.1 5.2.2 5.2.3 Structures. 5.1.1 Poniters to structures 5.1.2 Nesting structures User defined data types. Typedef Union Enum 5.1 Structures A data structure is a set of diverse types of data that may have different lengths grouped together under a unique declaration. Its form is the following: struct model_name { type1 element1; type2 element2; type3 element3; . . } object_name; where model_name is a name for the model of the structure type and the optional parameter object_name is a valid identifier (or identifiers) for structure object instantiations. Within curly brackets { } they are the types and their sub-identifiers corresponding to the elements that compose the structure. If the structure definition includes the parameter model_name (optional), that parameter becomes a valid type name equivalent to the structure. For example: struct products { char name [30]; float price; }; products apple; products orange, melon; We have first defined the structure model products with two fields: name and price, each of a different type. We have then used the name of the structure type (products) to declare three objects of that type: apple, orange and melon. Once declared, products has become a new valid type name like the fundamental ones int, char or short and we are able to declare objects (variables) of that type. 77 The optional field object_name that can go at the end of the structure declaration serves to directly declare objects of the structure type. For example, we can also declare the structure objects apple, orange and melon this way: struct products { char name [30]; float price; } apple, orange, melon; Moreover, in cases like the last one in which we took advantage of the declaration of the structure model to declare objects of it, the parameter model_name (in this case products) becomes optional. Although if model_name is not included it will not be possible to declare more objects of this same model later. It is important to clearly differentiate between what is a structure model, and what is a structure object. Using the terms we used with variables, the model is the type, and the object is the variable. We can instantiate many objects (variables) from a single model (type). Once we have declared our three objects of a determined structure model (apple, orange and melon) we can operate with the fields that form them. To do that we have to use a point (.) inserted between the object name and the field name. For example, we could operate with any of these elements as if they were standard variables of their respective types: apple.name apple.price orange.name orange.price melon.name melon.price each one being of its corresponding data type: apple.name, orange.name and melon.name are of type char[30], and apple.price, orange.price and melon.price are of type float. We are going to leave apples, oranges and melons and go with an example about movies: // example about structures #include <iostream.h> #include <string.h> #include <stdlib.h> struct movies_t { char title [50]; int year; } mine, yours; Enter title: Alien Enter year: 1979 My favourite movie is: 2001 A Space Odyssey (1968) And yours: Alien (1979) void printmovie (movies_t movie); int main () { char buffer [50]; strcpy (mine.title, "2001 A Space Odyssey"); 78 mine.year = 1968; cout << "Enter title: "; cin.getline (yours.title,50); cout << "Enter year: "; cin.getline (buffer,50); yours.year = atoi (buffer); cout << "My favourite movie is:\n "; printmovie (mine); cout << "And yours:\n "; printmovie (yours); return 0; } void printmovie (movies_t movie) { cout << movie.title; cout << " (" << movie.year << ")\n"; } The example shows how we can use the elements of a structure and the structure itself as normal variables. For example, yours.year is a valid variable of type int, and mine.title is a valid array of 50 chars. Notice that mine and yours are also treated as valid variables of type movies_t when being passed to the function printmovie(). Therefore, one of the most important advantages of structures is that we can refer either to their elements individually or to the entire structure as a block. Structures are a feature used very often to build data bases, specially if we consider the possibility of building arrays of them. // array of structures #include <iostream.h> #include <stdlib.h> #define N_MOVIES 5 struct movies_t { char title [50]; int year; } films [N_MOVIES]; void printmovie (movies_t movie); int main () { char buffer [50]; Enter Enter Enter Enter Enter Enter Enter Enter Enter Enter title: Alien year: 1979 title: Blade Runner year: 1982 title: Matrix year: 1999 title: Rear Window year: 1954 title: Taxi Driver year: 1975 You have entered these movies: Alien (1979) Blade Runner (1982) Matrix (1999) Rear Window (1954) 79 int n; for (n=0; n<N_MOVIES; n++) { cout << "Enter title: "; cin.getline (films[n].title,50); cout << "Enter year: "; cin.getline (buffer,50); films[n].year = atoi (buffer); } cout << "\nYou have entered these movies:\n"; for (n=0; n<N_MOVIES; n++) printmovie (films[n]); return 0; } Taxi Driver (1975) void printmovie (movies_t movie) { cout << movie.title; cout << " (" << movie.year << ")\n"; } 5.1.1 Pointers to structures Like any other type, structures can be pointed by pointers. The rules are the same as for any fundamental data type: The pointer must be declared as a pointer to the structure: struct movies_t { char title [50]; int year; }; movies_t amovie; movies_t * pmovie; Here amovie is an object of struct type movies_t and pmovie is a pointer to point to objects of struct type movies_t. So, the following, as with fundamental types, would also be valid: pmovie = &amovie; Ok, we will now go with another example, that will serve to introduce a new operator: // pointers to structures #include <iostream.h> #include <stdlib.h> struct movies_t { Enter title: Matrix Enter year: 1999 You have entered: Matrix (1999) 80 char title [50]; int year; }; int main () { char buffer[50]; movies_t amovie; movies_t * pmovie; pmovie = & amovie; cout << "Enter title: "; cin.getline (pmovie->title,50); cout << "Enter year: "; cin.getline (buffer,50); pmovie->year = atoi (buffer); cout << "\nYou have entered:\n"; cout << pmovie->title; cout << " (" << pmovie->year << ")\n"; return 0; } The previous code includes an important introduction: operator ->. This is a reference operator that is used exclusively with pointers to structures and pointers to classes. It allows us not to have to use parenthesis on each reference to a structure member. In the example we used: pmovie->title that could be translated to: (*pmovie).title both expressions pmovie->title and (*pmovie).title are valid and mean that we are evaluating the element title of the structure pointed by pmovie. You must distinguish it clearly from: *pmovie.title that is equivalent to *(pmovie.title) and that would serve to evaluate the value pointed by element title of structure movies, that in this case (where title is not a pointer) it would not make much sense. The following panel summarizes possible combinations of pointers and structures: 81 Expression Description pmovie.title Element title of structure pmovie Equivalent pmovie->title Element title of structure pointed by pmovie (*pmovie).title *pmovie.title Value pointed by element title of structure pmovie *(pmovie.title) 5.1.2 Nesting structures Structures can also be nested so that a valid element of a structure can also be another structure. struct movies_t { char title [50]; int year; } struct friends_t { char name [50]; char email [50]; movies_t favourite_movie; } charlie, maria; friends_t * pfriends = &charlie; Therefore, after the previous declaration we could use the following expressions: charlie.name maria.favourite_movie.title charlie.favourite_movie.year pfriends->favourite_movie.year (where, by the way, the last two expressions are equivalent). The concept of structures that has been discussed in this section is the same as used in C language, nevertheless, in C++, the structure concept has been extended up to the same functionality of a class with the peculiarity that all of its elements are considered public. 5.2 User Defined Data Types We have already seen a data type that is defined by the user (programmer): the structures. But in addition to these there are other kinds of user defined data types: 5.2.1 Typedef C++ allows us to define our own types based on other existing data types. In order to do that we shall use keyword typedef, whose form is: typedef existing_type new_type_name ; where existing_type is a C++ fundamental or any other defined type and new_type_name is the name that the new type we are going to define will receive. For example: 82 typedef typedef typedef typedef char C; unsigned int WORD; char * string_t; char field [50]; In this case we have defined four new data types: C, WORD, string_t and field as char, unsigned int, char* and char[50] respectively, that we could perfectly use later as valid types: C achar, anotherchar, *ptchar1; WORD myword; string_t ptchar2; field name; Typedef can be useful to define a type that is repeatedly used within a program and it is possible that we will need to change it in a later version, or if a type you want to use has too long a name and you want it to be shorter. 5.2.2 Unions Unions allow a portion of memory to be accessed as different data types, since all of them are in fact the same location in memory. Its declaration and use is similar to the one of structures but its functionality is totally different: union model_name { type1 element1; type2 element2; type3 element3; . . } object_name; All the elements of the union declaration occupy the same space of memory. Its size is the one of the greatest element of the declaration. For example: union mytypes_t { char c; int i; float f; } mytypes; defines three elements: mytypes.c mytypes.f mytypes.i each one of a different data type. Since all of them are referring to a same location in memory, the modification of one of the elements will afect the value of all of them. 83 One of the uses a union may have is to unite an elementary type with an array or structures of smaller elements. For example, union mix_t{ long l; struct { short hi; short lo; } s; char c[4]; } mix; defines three names that allow us to access the same group of 4 bytes: mix.l, mix.s and mix.c and which we can use according to how we want to access it, as long, short or char respectively. I have mixed types, arrays and structures in the union so that you can see the different ways that we can access the data: Anonymous unions In C++ we have the option that unions be anonymous. If we include a union in a structure without any object name (the one that goes after the curly brackets { }) the union will be anonymous and we will be able to access the elements directly by its name. For example, look at the difference between these two declarations: anonymous union Union struct { char title[50]; char author[50]; union { float dollars; int yens; } price; } book; struct { char title[50]; char author[50]; union { float dollars; int yens; }; } book; The only difference between the two pieces of code is that in the first one we gave a name to the union (price) and in the second we did not. The difference is when accessing members dollars and yens of an object. In the first case it would be: book.price.dollars book.price.yens whereas in the second it would be: 84 book.dollars book.yens Once again I remind you that because it is a union, the fields dollars and yens occupy the same space in the memory so they cannot be used to store two different values. That means that you can include a price in dollars or yens, but not both. Enumerations (enum) Enumerations serve to create data types to contain something different that is not limited to either numerical or character constants nor to the constants true and false. Its form is the following: enum model_name { value1, value2, value3, . . } object_name; For example, we could create a new type of variable called color to store colors with the following declaration: enum colors_t {black, blue, green, cyan, red, purple, yellow, white}; Notice that we do not include any fundamental data type in the declaration. To say it another way, we have created a new data type without it being based on any existing one: the type color_t, whose possible values are the colors that we have enclosed within curly brackets {}. For example, once declared the colors_t enumeration in the following expressions will be valid: colors_t mycolor; mycolor = blue; if (mycolor == green) mycolor = red; In fact our enumerated data type is compiled as an integer and its possible values are any type of integer constant specified. If it is not specified, the integer value equivalent to the first possible value is 0 and the following ones follow a +1 progression. Thus, in our data type colors_t that we defined before, black would be equivalent to 0, blue would be equivalent to 1, green to 2 and so on. If we explicitly specify an integer value for some of the possible values of our enumerated type (for example the first one) the following values will be the increases of this, for example: enum months_t { january=1, february, march, april, may, june, july, august, september, october, november, december} y2k; 85 In this case, variable y2k of the enumerated type months_t can contain any of the 12 possible values that go from january to december and that are equivalent to values between 1 and 12, not between 0 and 11 since we have made January equal to 1. 86 UNIT 6 FUNCTIONS Contents 6.1 6.2 6.3 6.4 6.5 6.6 6.7 6.8 6.9 Functions Default values in arguments Void Functions Call by value and reference Passing Reference to Functions. Returning References from Functions Inline function Recursive function Prototyping function 6.1.Functions Using functions we can structure our programs in a more modular way, accessing all the potential that structured programming in C++ can offer us. A function is a block of instructions that is executed when it is called from some other point of the program. The following is its format: type name ( argument1, argument2, ...) statement where: — type is the type of data returned by the function. — name is the name by which it will be possible to call the function. — arguments (as many as wanted can be specified). Each argument consists of a type of data followed by its identifier, like in a variable declaration (for example, int x) and which acts within the function like any other variable. They allow passing parameters to the function when it is called. The different parameters are separated by commas. — statement is the function's body. It can be a single instruction or a block of instructions. In the latter case it must be delimited by curly brackets {}. Here you have the first function example: // function example #include <iostream.h> The result is 8 int addition (int a, int b) { int r; r=a+b; 87 return (r); } int main () { int z; z = addition (5,3); cout << "The result is " << z; return 0; } In order to examine this code, first of all remember something said at the beginning of this tutorial: a C++ program always begins its execution with the main function. So we will begin there. We can see how the main function begins by declaring the variable z of type int. Right after that we see a call to addition function. If we pay attention we will be able to see the similarity between the structure of the call to the function and the declaration of the function itself in the code lines above: The parameters have a clear correspondence. Within the main function we called to addition passing two values: 5 and 3 that correspond to the int a and int b parameters declared for the function addition. At the moment at which the function is called from main, control is lost by main and passed to function addition. The value of both parameters passed in the call (5 and 3) are copied to the local variables int a and int b within the function. Function addition declares a new variable (int r;), and by means of the expression r=a+b;, it assigns to r the result of a plus b. Because the passed parameters for a and b are 5 and 3 respectively, the result is 8. The following line of code: return (r); Finalizes function addition, and returns the control back to the function that called it (main) following the program from the same point at which it was interrupted by the call to addition. But additionally, return was called with the content of variable r (return (r);), which at that moment was 8, so this value is said to be returned by the function. The value returned by a function is the value given to the function when it is evaluated. Therefore, z will store the value returned by addition (5, 3), that is 8. To explain it another way, you can imagine that the call to a function (addition (5,3)) is literally replaced by the value it returns (8). 88 The following line of code in main is: cout << "The result is " << z; That, as you may already suppose, produces the printing of the result on the screen. Scope of variables [re] You must consider that the scope of variables declared within a function or any other block of instructions is only their own function or their own block of instructions and cannot be used outside of them. For example, in the previous example it had been impossible to use the variables a, b or r directly in function main since they were local variables to function addition. Also, it had been impossible to use the variable z directly within function addition, since this was a local variable to the function main. Therefore, the scope of local variables is limited to the same nesting level in which they are declared. Nevertheless you can also declare global variables that are visible from any point of the code, inside and outside any function. In order to declare global variables you must do it outside any function or block of instructions, that means, directly in the body of the program. And here is another example about functions: The The The The // function example #include <iostream.h> int subtraction (int a, int b) { int r; r=a-b; return (r); } int main () { int x=5, y=3, z; z = subtraction (7,2); cout << "The first result is " << z << '\n'; cout << "The second result is " << subtraction (7,2) << '\n'; cout << "The third result is " << subtraction (x,y) << '\n'; z= 4 + subtraction (x,y); cout << "The fourth result is " << z << '\n'; return 0; 89 first result is 5 second result is 5 third result is 2 fourth result is 6 } In this case we have created the function subtraction. The only thing that this function does is to subtract both passed parameters and to return the result. Nevertheless, if we examine the function main we will see that we have made several calls to function subtraction. We have used some different calling methods so that you see other ways or moments when a function can be called. In order to understand well these examples you must consider once again that a call to a function could be perfectly replaced by its return value. For example the first case (that you should already know beacause it is the same pattern that we have used in previous examples): z = subtraction (7,2); cout << "The first result is " << z; If we replace the function call by its result (that is 5), we would have: z = 5; cout << "The first result is " << z; As well as cout << "The second result is " << subtraction (7,2); has the same result as the previous call, but in this case we made the call to subtraction directly as a parameter for cout. Simply imagine that we had written: cout << "The second result is " << 5; since 5 is the result of subtraction (7,2). In the case of cout << "The third result is " << subtraction (x,y); The only new thing that we introduced is that the parameters of subtraction are variables instead of constants. That is perfectly valid. In this case the values passed to the function subtraction are the values of x and y, that are 5 and 3 respectively, giving 2 as result. The fourth case is more of the same. Simply note that instead of: z = 4 + subtraction (x,y); we could have put: z = subtraction (x,y) + 4; with exactly the same result. Notice that the semicolon sign (;) goes at the end of the whole expression. It does not necessarily have to go right after the function call. The explanation might be once again that you imagine that a function can be replaced by its result: z = 4 + 2; z = 2 + 4; 6.2. Default values in arguments 90 When declaring a function we can specify a default value for each parameter. This value will be used if that parameter is left blank when calling to the function. To do that we simply have to assign a value to the arguments in the function declaration. If a value for that parameter is not passed when the function is called, the default value is used, but if a value is specified this default value is stepped on and the passed value is used. For example: // default values in functions #include <iostream.h> 6 5 int divide (int a, int b=2) { int r; r=a/b; return (r); } int main () { cout << divide (12); cout << endl; cout << divide (20,4); return 0; } As we can see in the body of the program there are two calls to the function divide. In the first one: divide (12) we have only specified one argument, but the function divide allows up to two. So the function divide has assumed that the second parameter is 2 since that is what we have specified to happen if this parameter is lacking (notice the function declaration, which finishes with int b=2). Therefore the result of this function call is 6 (12/2). In the second call: divide (20,4) there are two parameters, so the default assignation (int b=2) is stepped on by the passed parameter, that is 4, making the result equal to 5 (20/4). 6.3. Void function If you remember the syntax of a function declaration: type name ( argument1, argument2 ...) statement you will see that it is obligatory that this declaration begins with a type, that is the type of the data that will be returned by the function with the return instruction. But what if we want to return no value? 91 Imagine that we want to make a function just to show a message on the screen. We do not need it to return any value, moreover, we do not need it to receive any parameters. For these cases, the void type was devised in the C language. Take a look at: // void function example #include <iostream.h> I'm a function! void dummyfunction (void) { cout << "I'm a function!"; } int main () { dummyfunction (); return 0; } Although in C++ it is not necessary to specify void, its use is considered suitable to signify that it is a function without parameters or arguments and not something else. What you must always be aware of is that the format for calling a function includes specifing its name and enclosing the arguments between parenthesis. The non-existence of arguments does not exempt us from the obligation to use parenthesis. For that reason the call to dummyfunction is dummyfunction (); This clearly indicates that it is a call to a function and not the name of a variable or anything else. 6.4 Call by value and reference. Until now, in all the functions we have seen, the parameters passed to the functions have been passed by value. This means that when calling a function with parameters, what we have passed to the function were values but never the specified variables themselves. For example, suppose that we called our first function addition using the following code: int x=5, y=3, z; z = addition ( x , y ); What we did in this case was to call function addition passing the values of x and y, that means 5 and 3 respectively, not the variables themselves. This way, when function addition is being called the value of its variables a and b become 5 and 3 respectively, but any modification of a or b within the function addition will not affect the values of x and y outside it, because variables x and y were not passed themselves to the function, only their values. 92 But there might be some cases where you need to manipulate from inside a function the value of an external variable. For that purpose we have to use arguments passed by reference, as in the function duplicate of the following example: // passing parameters by reference #include <iostream.h> x=2, y=6, z=14 void duplicate (int& a, int& b, int& c) { a*=2; b*=2; c*=2; } int main () { int x=1, y=3, z=7; duplicate (x, y, z); cout << "x=" << x << ", y=" << y << ", z=" << z; return 0; } The first thing that should call your attention is that in the declaration of duplicate the type of each argument was followed by an ampersand sign (&), that serves to specify that the variable has to be passed by reference instead of by value, as usual. When passing a variable by reference we are passing the variable itself and any modification that we do to that parameter within the function will have effect in the passed variable outside it. To express it another way, we have associated a, b and c with the parameters used when calling the function (x, y and z) and any change that we do on a within the function will affect the value of x outside. Any change that we do on b will affect y, and the same with c and z. That is why our program's output, that shows the values stored in x, y and z after the call to duplicate, shows the values of the three variables of main doubled. If when declaring the following function: void duplicate (int& a, int& b, int& c) we had declared it thus: void duplicate (int a, int b, int c) That is, without the ampersand (&) signs, we would have not passed the variables by reference, but their values, and therefore, the output on screen for our program would have been the values of x, y and z without having been modified. 93 This type of declaration "by reference" using the ampersand (&) sign is exclusive of C++. Passing by reference is an effective way to allow a function to return more than one single value. For example, here is a function that returns the previous and next numbers of the first parameter passed. // more than one returning value #include <iostream.h> Previous=99, Next=101 void prevnext (int x, int& prev, int& next) { prev = x-1; next = x+1; } int main () { int x=100, y, z; prevnext (x, y, z); cout << "Previous=" << y << ", Next=" << z; return 0; } 6.5. Passing Reference to Functions. Reference variables are particularly useful when passing to functions. The changes made in the called functions are reflected back to the calling function . The program uses the classic problem in programming, swapping the values of two variables. e.g. void val_swap(int x, int y) // Call by Value { int t; t = x; x = y; y = t; } void add_swap(int *x, int *y) // Call by Address { int t; t = *x; *x = *y; *y = t; } void val_swap(int &x, int &y) // Call by Reference { 94 int t; t = x; x = y; y = t; } void main() { int n1 = 25, n2 = 50; cout << “Before call by value : “; cout << “ n1 = “ << n1 << “ n2 = “ << n2 << endl; val_swap( n1, n2 ); cout << “ After call by value : “; cout << “ n1 = “ << n1 << “ n2 = “ << n2 << endl; cout << “Before call by address : “; cout << “ n1 = “ << n1 << “ n2 = “ << n2 << endl; val_swap( &n1, &n2 ); cout << “ After call by address : “; cout << “ n1 = “ << n1 << “ n2 = “ << n2 << endl; cout << “Before call by reference: “; cout << “ n1 = “ << n1 << “ n2 = “ << n2 << endl; val_swap( n1, n2 ); cout << “ After call by value : “ ; cout << “ n1 = “ << n1 << “ n2 = “ << n2 << endl; } Output : Before call by value : n1 = 25 n2 = 50 After call by value : n1 = 25 n2 = 50 // x = 50, y = 25 Before call by address : n1 = 25 n2 = 50 After call by address : n1 = 50 n2 = 25 //x = 50, y = 25 Before call by reference: n1 = 50 n2 = 25 After call by reference : n1 = 25 n2 = 50 //x = 25, y = 50 You can see that the only difference in writing the functions in call by value and call by reference is while receiving the parameters where as in pass by address the function body has some changes, i.e. they use (*) indirection operator to manipulate the variables. 6.6. Returning References from Functions 95 Just as in passing the parameters by reference, returning a reference also doesn’t return back a copy of the variable , instead an alias is returned. e.g. int &func(int &num) { : : return(num); } void main() { int n1,n2; : : n1 = fn( n2); } Notice that the function header contains an ampersand (&) before the function name. This is how a function is made to return reference variable. In this case, it takes a reference to an integer as its argument and returns a reference to an integer. This facility can be very useful for returning objects and even structure variables. 6.7. Inline functions The inline directive can be included before a function declaration to specify that the function must be compiled as code at the same point where it is called. This is equivalent to declaring a macro. Its advantage is only appreciated in very short functions, in which the resulting code from compiling the program may be faster if the overhead of calling a function (stacking of arguments) is avoided. The format for its declaration is: inline type name ( arguments ... ) { instructions ... } and the call is just like the call to any other function. It is not necessary to include the inline keyword before each call, only in the declaration. 6.8. Recursive function Recursivity is the property that functions have to be called by themselves. It is useful for tasks such as some sorting methods or to calculate the factorial of a number. For example, to obtain the factorial of a number (n) its mathematical formula is: n! = n * (n-1) * (n-2) * (n-3) ... * 1 96 more concretely, 5! (factorial of 5) would be: 5! = 5 * 4 * 3 * 2 * 1 = 120 and a recursive function to do that could be this: // factorial calculator #include <iostream.h> Type a number: 9 !9 = 362880 long factorial (long a) { if (a > 1) return (a * factorial (a-1)); else return (1); } int main () { long l; cout << "Type a number: "; cin >> l; cout << "!" << l << " = " << factorial (l); return 0; } Notice how in function factorial we included a call to itself, but only if the argument is greater than 1, since otherwise the function would perform an infinite recursive loop in which once it arrived at 0 it would continue multiplying by all the negative numbers (probably provoking a stack overflow error on runtime). This function has a limitation because of the data type used in its design (long) for more simplicity. In a standard system, the type long would not allow storing factorials greater than 12!. 6.9 Prototyping functions Until now, we have defined the all of the functions before the first appearance of calls to them, that generally was in main, leaving the function main for the end. If you try to repeat some of the examples of functions described so far, but placing the function main before any other function that is called from within it, you will most likely obtain an error. The reason is that to be able to call a function it must have been declared previously (it must be known), like we have done in all our examples. But there is an alternative way to avoid writing all the code of all functions before they can be used in main or in another function. It is by prototyping functions. This consists in making a previous shorter, but quite significant, declaration of the complete definition so that the compiler can know the arguments and the return type needed. Its form is: type name ( argument_type1, argument_type2, ...); 97 It is identical to the header of a function definition, except: • It does not include a statement for the function. That means that it does not include the body with all the instructions that are usually enclose within curly brackets { }. • It ends with a semicolon sign (;). In the argument enumeration it is enough to put the type of each argument. The inclusion of a name for each argument as in the definition of a standard function is optional, although recommended. For example: • // prototyping #include <iostream.h> void odd (int a); void even (int a); int main () { int i; do { cout << "Type a number: (0 to exit)"; cin >> i; odd (i); } while (i!=0); return 0; } Type a number (0 Number is odd. Type a number (0 Number is even. Type a number (0 Number is even. Type a number (0 Number is even. to exit): 9 to exit): 6 to exit): 1030 to exit): 0 void odd (int a) { if ((a%2)!=0) cout << "Number is odd.\n"; else even (a); } void even (int a) { if ((a%2)==0) cout << "Number is even.\n"; else odd (a); } This example is indeed not an example of effectiveness, I am sure that at this point you can already make a program with the same result using only half of the code lines. But this example ilustrates how protyping works. Moreover, in this concrete case the prototyping of -at least- one of the two functions is necessary. The first things that we see are the prototypes of functions odd and even: void odd (int a); void even (int a); that allows these functions to be used before they are completely defined, for example, in main, which now is located in a more logical place: the beginning of the program's code. 98 Nevertheless, the specific reason why this program needs at least one of the functions prototyped is because in odd there is a call to even and in even there is a call to odd. If none of the two functions had been previously declared, an error would have happened, since either odd would not be visible from even (because it has not still been declared), or even would not be visible from odd. Many programmers recommend that all functions be prototyped. It is also my recommendation, mainly in case that there are many functions or in case that they are very long. Having the prototype of all the functions in the same place can spare us some time when determining how to call it or even ease the creation of a header file. 99 UNIT7 CLASSES AND OBJECTS Contents 7.1 7.2 7.3 7.4 7.5 7.6 7.7 7.8 7.9 7.10 Introduction to class. Class Definition Classes and Objects Access specifiers – Private, Public and Protected. Member functions of the class. Passing and returning objects. Pointers to objects. Array of objects. The special ‘this’ pointer self test 7.1 Introduction to Classes Object-oriented programming (OOP) is a conceptual approach to design programs. It can be implemented in many languages, whether they directly support OOP concepts or not. The C language also can be used to implement many of the object-oriented principles. However, C++ supports the object-oriented features directly. All these features like Data abstraction, Data encapsulation, Information hiding etc have one thing in common – the vehicle that is used to implement them. The vehicle is “ class.” Class is a user defined data type just like structures, but with a difference. It also has three sections namely private, public and protected. Using these, access to member variables of a class can be strictly controlled. 7.2. Class Definition The following is the general format of defining a class template: class class_name { permission_label_1: member1; permission_label_2: member2; ... } object_name; example:- 100 class tag_name { public : type member_variable_name; : type member_function_name(); : private: // Must // Optional type member_variable_name; : type member_function_name(); : }; The keyword class is used to define a class template. The private and public sections of a class are given by the keywords ‘private’ and ‘public’ respectively. They determine the accessibility of the members. All the variables declared in the class, whether in the private or the public section, are the members of the class. Since the class scope is private by default, you can also omit the keyword private. In such cases you must declare the variables before public, as writing public overrides the private scope of the class. e.g. class tag_name { type member_variable_name; // private : type member_function_name(); // private : public : // Must type member_variable_name; : type member_function_name(); : }; The variables and functions from the public section are accessible to any function of the program. However, a program can access the private members of a class only by using the public member functions of the class. This insulation of data members from direct access in a program is called information hiding. e.g. 101 class player { public : void getstats(void); void showstats(void); int no_player; private : char name[40]; int age; int runs; int tests; float average; float calcaverage(void); }; The above example models a cricket player. The variables in the private section – name, age, runs, highest, tests, and average – can be accessed only by member functions of the class calcaverage(), getstats() and showstats(). The functions in the public section - getstats() and showstats() can be called from the program directly , but function calcaverage() can be called only from the member functions of the class – getstats() and showstats(). With information hiding one need not know how actually the data is represented or functions implemented. The program need not know about the changes in the private data and functions. The interface(public) functions take care of this. The OOP methodology is to hide the implementation specific details, thus reducing the complexities involved. 7.3. Classes and Objects As seen earlier, a class is a vehicle to implement the OOP features in the C++ language. Once a class is declared, an object of that type can be defined. An object is said to be a specific instance of a class just like Maruti car is an instance of a vehicle or pigeon is the instance of a bird. Once a class has been defined several objects of that type can be declared. For instance, an object of the class defined above can be declared with the following statement: player Sachin, Dravid, Mohan ; [Or] class player Sachin , Dravid, Mohan ; where Sachin and Dravid are two objects of the class player. Both the objects have their own set of member variables. Once the object is declared, its public members can be accessed using the dot operator with the name of the object. We can also use the variable no_player in the public section with a dot operator in functions other than the functions declared in the public section of the class. 102 e.g. Sachin.getstats(); Dravid.showstats(); Mohan.no_player = 10; 7.4. Access specifis- Private Public and Protected members Class members can either be declared in public’,’protected’ or in the ‘private’ sections of the class. But as one of the features of OOP is to prevent data from unrestricted access, the data members of the class are normally declared in the private section. The member functions that form the interface between the object and the program are declared in public section ( otherwise these functions can not be called from the program ). The member functions which may have been broken down further or those, which do not form a part of the interface, are declared in the private section of the class. By default all the members of the class are private. The third access specifier ‘protected’ that is not used in the above example, pertains to the member functions of some new class that will be inherited from the base class. As far as non-member functions are concerned, private and protected are one and the same. Summary of Access specifiers • private members of a class are accessible only from other members of their same class or from their "friend" classes. • protected members are accessible from members of their same class and friend classes, and also from members of their derived classes. • Finally, public members are accessible from anywhere the class is visible. 7.5. Member Functions of a Class A member function of the class is same as an ordinary function. Its declaration in a class template must define its return value as well as the list of its arguments. You can declare or define the function in the class specifier itself, in which case it is just like a normal function. But since the functions within the class specifier is considered inline by the compiler we should not define large functions and functions with control structures, iterative statements etc should not be written inside the class specifier. However, the definition of a member function differs from that of an ordinary function if written outside the class specifier. The header of a member function uses the scope operator (::) to specify the class to which it belongs. The syntax is: return_type class_name :: function_name (parameter list) { : } e.g. void player :: getstats (void) { : } 103 void player :: showstats (void) { : : } This notation indicates that the functions getstats () and showstats() belong to the class player. COMPLETE EXAMPLE OF CLASS: Find the area of the rectangle // class example #include <iostream.h> class CRectangle { int x, y; public: void set_values (int,int); int area (void) {return (x*y);} }; void CRectangle::set_values (int a, int b) { x = a; y = b; } int main () { CRectangle rect, rectb; rect.set_values (3,4); rectb.set_values (5,6); cout << "rect area: " << rect.area() << endl; cout << "rectb area: " << rectb.area() << endl; } ------------------------------------------------------------OUTPUT rect area: 12 rectb area: 30 7.6. Passing and Returning Objects Objects can be passed to a function and returned back just like normal variables. When an object is passed by content , the compiler creates another object as a formal variable in the called function and copies all the data members from the actual variable to it. Objects can also be passed by address, which will be discussed later. 104 e.g. class check { public : check add(check); void get() { cin >> a; } void put() { cout << a; } private : int a; }; void main() { check c1,c2,c3; c1.get(); c2.get(); c3 = c1.add(c2); c3.put(); } check check :: add ( check c2) { check temp; temp.a = a + c2.a; return ( temp); } The above example creates three objects of class check. It adds the member variable of two classes, the invoking class c1 and the object that is passed to the function , c2 and returns the result to another object c3. You can also notice that in the class add() the variable of the object c1 is just referred as ‘a’ where as the member of the object passed .i.e. c2 is referred as ‘c2.a’ . This is because the member function will be pointed by the pointer named this in the compiler where as what we pass should be accessed by the extraction operator ‘.’. we may pass more than one object and also normal variable. we can return an object or a normal variable from the function. We have made use of a temporary object in the function add() in order to facilitate return of the object. 105 7.7. Pointers to Objects Passing and returning of objects is, however, not very efficient since it involves passing and returning a copy of the data members. This problem can be eliminated using pointers. Like other variables, objects of class can also have pointers. Declaring a pointer to an object of a particular class is same as declaring a pointer to a variable of any other data type. A pointer variable containing the address of an object is said to be pointing to that object. Pointers to objects can be used to make a call by address or for dynamic memory allocation. Just like structure pointer, a pointer to an object also uses the arrow operator to access its members. Like pointers to other data types, a pointer to an object also has only one word of memory. It has to be made to point to an already existing object or allocated memory using the keyword ‘new’. e.g. string str; string *sp; // Object // Pointer to an object sp = &str; // Assigns address of an existing object sp = new string // Allocates memory with new. It is perfectly valid to create pointers pointing to objects, in order to do that we must simply consider that once declared, the class becomes a valid type, so use the class name as the type for the pointer. For example: CRectangle * prect; is a pointer to an object of class CRectangle. As it happens with data structures, to refer directly to a member of an object pointed by a pointer you should use operator ->. Here is an example with some possible combinations: a area: 2 *b area: 12 *c area: 2 d[0] area: 30 d[1] area: 56 // pointer to objects example #include <iostream.h> class CRectangle { int width, height; public: void set_values (int, int); int area (void) {return (width * height);} }; void CRectangle::set_values (int a, int b) { width = a; height = b; } int main () { CRectangle a, *b, *c; CRectangle * d = new CRectangle[2]; 106 b= new CRectangle; c= &a; a.set_values (1,2); b->set_values (3,4); d->set_values (5,6); d[1].set_values (7,8); cout << "a area: " << a.area() << endl; cout << "*b area: " << b->area() << endl; cout << "*c area: " << c->area() << endl; cout << "d[0] area: " << d[0].area() << endl; cout << "d[1] area: " << d[1].area() << endl; return 0; } Next you have a summary on how can you read some pointer and class operators (*, &, ., ->, [ ]) that appear in the previous example: *x can be read: pointed by x &x can be read: address of x x.y can be read: member y of object x (*x).y can be read: member y of object pointed by x x->y can be read: member y of object pointed by x (equivalent to the previous one) x[0] can be read: first object pointed by x x[1] can be read: second object pointed by x x[n] can be read: (n+1)th object pointed by x 7.8. Array of Objects As seen earlier, a class is a template, which can contain data items as well as member functions to operate on the data items. Several objects of the class can also be declared and used. Also, an array of objects can be declared and used just like an array of any other data type. An example will demonstrate the use of array of objects. e.g. class student { public : void getdetails(); void printdetails(); private : int rollno; char name[25]; int marks[6]; float percent; }; void student :: getdetails() 107 { int ctr,total; cout << ”enter rollno”; cin >> rollno ; cout << ”enter name”; cin >> name; cout << ” enter 6 marks “ ; for( ctr = 1 ;ctr <= 6 ; ctr++ ) { cin >> marks[ctr]; total = total + marks[ctr]; } percent = total / 6; } void student :: printdetails () { cout << rollno << name << percent ; } void main() { student records[50]; int x=0; cout << “ How many students “; cin >> x; for ( int i =1; i<= x; i++) { records[i].getdeatils(); } for ( int i =1; i<= x; i++) { records[i].printdeatils(); } } As can be seen above, an array of objects is declared just like any other array. Members of the class are accessed, using the array name qualified by a subscript. The statement, 108 records[y].printdetails(); Invokes the member funs printdetails() for the object given by the subscript y. For different values of subscript, it invokes the same member function, but for different objects. 7.9 The Special Pointer ‘this’ When several instances of a class come into existence, it naturally follows that each instance has its own copy of member variables. If this were not the case, then for obvious reasons it would be impossible to create more than one instance of the class. On the other hand, even though the class member functions are encapsulated with the data members inside the class definition, it would be very inefficient in terms of memory usage to replicate all these member functions and store the code for them within each instance. Consequently, only one copy of each member function per class is stored in memory, and must be shared by all of the instances of the class. But this poses a big problem for the compiler: How can any given member function of a class knows which instance it is supposed to be working on ? In other words, up to now in a class member function you have simply been referring to the members directly without regard to the fact that when the instantiations occur each data member will have a different memory address. In other words, all the compiler knows is the offset of each data member from the start of the class. The solution to this dilemma is that, in point of fact, each member function does have access to a pointer variable that points to the instance being manipulated. Fortunately this pointer is supplied to each member function automatically when the function is called, so that this burden is not placed upon the programmer. This pointer variable has a special name ‘this’ (reserved word). Even though the this pointer is implicitly declared, you always have access to it and may use the variable name anywhere you seem appropriate. e.g. class try_this { public : void print(); try_this add(int); private : int ivar; }; 109 void print() { cout << ivar; cout << this -> ivar ; } The function print refers to the member variable ivar directly. Also, an explicit reference is made using the this pointer. This special pointer is generally used to return the object, which invoked the member function. For example, void main() { try_this t1,t2; t2 = t1.add(3); t2.print(); } try_this try_this :: add(int v) { ivar = ivar + v; return ( *this); } In the above example if ivar for t1 is 10 and value in v is 2, then the function add() adds them and ivar for t1 becomes 12 . We want to store this in another object t2, which can be done by returning the object t1 using *this to t2. The result of t2.print() now will be 12. Dereferencing the Pointer this Sometimes a member function needs to make a copy of the invoking instance so that it can modify the copy without affecting the original instance. This can be done as follows : try_this temp(*this); try_this temp = *this ; In OOP emphasis is on how the program represents data. It is a design concept with less emphasis on operational aspects of the program. The primary concepts of OOP is implemented using class and objects. A class contains data members as well as function members. The access specifiers control the access of data members. Only the public members of the class can access the data members declared in private section. Once class has been defined, many objects of that class can be declared. Data members of different objects of the same class occupy different memory area but function members of different objects of the same class share the same set of functions. This is possible because of the internal pointer ‘*this’ which keeps track of which function is invoked by which object. 110 7.10 Self test Exercises: 1. Define a class to model a banking system. The function members should allow initializing the data members, a query to facilitate for account and a facility to deposit and withdraw from the account. WAP to implement the same. 2. Create a class called Time that has separate int member data for hours, minutes and seconds. Write functions for accepting time and displaying time. 111 UNIT 8 CONSTRUCTOR AND DESTRUCTOR Contents 8.1 8.2 8.3 8.4 Constructors 8.1.1 Syntax rules for writing constructor functions 8.1.2 Different ways of calling contructor Overloading Constructors Destructor Self test Since C++ supports the concept of user-defined classes and the subsequent initiations of these classes, it is important that initialization of these instantiations be performed so that the state of any object does not reflect “ garbage”. One of the principles of C++ is that objects know how to initialize and cleanup after themselves. This automatic initialization and clean up is accomplished by two member functions – the constructor and the destructor. 8.1 Constructors By definition, a constructor function of some class is a member function that automatically gets executed whenever an instance of the class to which the constructor belongs comes into existence. The execution of such a function guarantees that the instance variables of the class will be initialized properly. A constructor function is unique from all other functions in a class because it is not called using some instance of the class, but is invoked whenever we create an object of that class. A constructor may be overloaded to accommodate many different forms of initialization for instances of the class. i.e. for a single class many constructors can be written with different argument lists . 8.1.1 Syntax rules for writing constructor functions • Its name must be same as that of the class to which it belongs. • It is declared with no return type (not even void). However, it will implicitly return a temporary copy of the instance itself that is being created. • It cannot be declared static (a function which does not belong to a particular instance), const( in which you can not make changes). • It should have public or protected access within the class. Only in very rare circumstances the programmers declare it in private section. e.g. 112 We are going to implement CRectangle including a constructor: rect area: 12 rectb area: 30 // classes example #include <iostream.h> class CRectangle { int width, height; public: CRectangle (int,int); int area (void) {return (width*height);} }; CRectangle::CRectangle (int a, int b) { width = a; height = b; } int main () { CRectangle rect (3,4); CRectangle rectb (5,6); cout << "rect area: " << rect.area() << endl; cout << "rectb area: " << rectb.area() << endl; } . Notice the way in which the parameters are passed to the constructor at the moment at which the instances of the class are created: CRectangle rect (3,4); CRectangle rectb (5,6); 8.1.2 Different ways of calling contructor There are basically three ways of calling constructor. The first way to call the constructor is explicitly as : CRectangle rect = CRectangle (3,4); This statement creates an object with the name bigbox and initializes the data members with the parameters passed to the constructor function. The above object can also be created with an implicit call to the constructor : CRectangle rect (3,4); Both the statements given above are equivalent. Yet, another way of creating and initializing an object is by direct assignment of the data item to the object name. But, this approach works if there is only one data item in the class. This is obvious because we cannot assign more than one value at a time to a variable. 113 e.g. class counter { public : counter ( int c) // constructor. { count = c; }; private : int count; }; we can now create an object as, counter cnt = 0; In the above example , object cnt is initialized by a value zero at the time of declaration. This value is actually assigned to its data member count. This is the third way to initialize an object’s data member. Thus, all the following statements to initialize the objects of the class counter are equivalent: counter c1(20); counter c1 = counter(30); counter c1= 10; 8.2 Overloading Constructors Like any other function, a constructor can also be overloaded with several functions that have the same name but different types or numbers of parameters. Remember that the compiler will execute the one that matches at the moment at which a function with that name is called. In this case, at the moment at which a class object is declared. In fact, in the cases where we declare a class and we do not specify any constructor the compiler automatically assumes two overloaded constructors ("default constructor" and "copy constructor"). For example, for the class: class CExample { public: int a,b,c; void multiply (int n, int m) { a=n; b=m; c=a*b; }; }; with no constructors, the compiler automatically assumes that it has the following constructor member functions: 114 • Empty constructor It is a constructor with no parameters defined as nop (empty block of instructions). It does nothing. CExample::CExample () { }; Copy constructor It is a constructor with only one parameter of its same type that assigns to every non static class member variable of the object a copy of the passed object. CExample::CExample (const CExample& rv) { a=rv.a; b=rv.b; c=rv.c; } It is important to realize that both default constructors: the empty construction and the copy constructor exist only if no other constructor is explicitly declared. In case that any constructor with any number of parameters is declared, none of these two default constructors will exist. So if you want them to be there, you must define your own ones. • Of course, you can also overload the class constructor providing different constructors for when you pass parameters between parenthesis and when you do not (empty): rect area: 12 rectb area: 25 // overloading class constructors #include <iostream.h> class CRectangle { int width, height; public: CRectangle (); CRectangle (int,int); int area (void) {return (width*height);} }; CRectangle::CRectangle () { width = 5; height = 5; } CRectangle::CRectangle (int a, int b) { width = a; height = b; } int main () { CRectangle rect (3,4); CRectangle rectb; cout << "rect area: " << rect.area() << endl; cout << "rectb area: " << rectb.area() << endl; } In this case rectb was declared without parameters, so it has been initialized with the constructor that has no parameters, which declares both width and height with a value of 5. 115 Notice that if we declare a new object and we do not want to pass parameters to it we do not include parentheses (): CRectangle rectb; // right CRectangle rectb(); // wrong! 8.3 Destructors The Destructor fulfills the opposite functionality. It is automatically called when an object is released from the memory, either because its scope of existence has finished (for example, if it was defined as a local object within a function and the function ends) or because it is an object dynamically assigned and it is released using operator delete. A destructor function gets executed whenever an instance of the class to which it belongs goes out of existence. The primary usage of a destructor function is to release memory space that the instance currently has reserved. Syntax rules for writing a destructor function • Its name is the same as that of the class to which it belongs, except that the first character of the name is the symbol tilde ( ~ ). • It is declared with no return type ( not even void ) since it cannot ever return a value. • It cannot be static, const or volatile. • It takes no input arguments , and therefore cannot be overloaded. • It should have public access in class declaration. Generally the destructor cannot be called explicitly (directly) from the program. The compiler generates a class to destructor when the object expires. Class destructor is normally used to clean up the mess from an object. Class destructors become extremely necessary when class constructor use the new operator, otherwise it can be given as an empty function. However, the destructor function may be called explicitly allowing you to release the memory not required and allocate this memory to new resources, in Borland C++ version 3.1. Eg class employee { public : employee() { } ~employee(); { } }; // example on constructors and destructors 116 #include <iostream.h> class CRectangle { int *width, *height; public: CRectangle (int,int); ~CRectangle (); int area (void) {return (*width * *height);} }; CRectangle::CRectangle (int a, int b) { width = new int; height = new int; *width = a; *height = b; } CRectangle::~CRectangle () { delete width; delete height; } int main () { CRectangle rect (3,4), rectb (5,6); cout << "rect area: " << rect.area() << endl; cout << "rectb area: " << rectb.area() << endl; return 0; } 8.4 Self test 1. Create a class called Time that has a separate data members for day, month and year. A constructor should be used to initialize these members. Then write a function to add these dates and store the result in a third object and display it. 2. WAP to add co-ordinates of the plane. The class contains x and y co-ordinates. Create three objects. Use a constructor to pass one pair of co-ordinates and a function to accept the second pair. Add these variables of two objects and store the result in the third. 117 UNIT 9 FUNCTION OVERLOADING Contents 9.1 9.2 9.3 9.4 9.5 9.6 9.7 9.8 Function overloading Precautions to be taken while overloading functions. Static Class Members Static Member Functions Friend Functions Friend for Overloading Operators Granting friendship to another class Granting friendship to a member function of another class 9.1 Function Overloading Function overloading is a form of polymorphism. Function overloading facilitates defining one function having many forms. In other words it facilitates defining several functions with the same name, thus overloading the function names. Like in operator overloading, even here, the compiler uses context to determine which definition of an overloaded function is to be invoked. Function overloading is used to define a set of functions that essentially, do the same thing, but use different argument lists. The argument list of the function is also called as the function’s signature. You can overload the function only if their signatures are different. The differences can be 1. In number of arguments, 2. Data types of the arguments, 3. Order of arguments, if the number and data types of the arguments are same. e.g. int add( int, int ); int add( int, int, int ); float add( float, float ); float add( int, float, int ); float add(int,int,float); The compiler cannot distinguish if the signature is same and only return type is different. Hence, it is a must, that their signature is different. The following functions therefore raise a compilation error. e.g. 118 float add( int, float, int ); int add( int, float, int ); Consider the following example // overloaded function #include <iostream.h> 2 2.5 int divide (int a, int b) { return (a/b); } float divide (float a, float b) { return (a/b); } int main () { int x=5,y=2; float n=5.0,m=2.0; cout << divide (x,y); cout << "\n"; cout << divide (n,m); cout << "\n"; return 0; } In this case we have defined two functions with the same name, but one of them accepts two arguments of type int and the other accepts them of type float. The compiler knows which one to call in each case by examining the types when the function is called. If it is called with two ints as arguments it calls to the function that has two int arguments in the prototype and if it is called with two floats it will call to the one which has two floats in its prototype. For simplicity I have included the same code within both functions, but this is not compulsory. You can make two functions with the same name but with completely different behaviors. 9.2 Precautions to be taken while overloading function Function overloading is a boon to designers, since different names for similar functions need not be thought of, which often is a cumbersome process given that many times people run out of names. But, this facility should not be overused, lest it becomes an overhead in terms of readability and maintenance. Only those functions, which basically do the same task, on different sets of data, should be overloaded. 119 We have already seen the powerful features of OOP that make C++ such a strong language. In this session we will continue exploring some other features, which make this language more powerful. 9.3 Static Class Members As we already know all the objects of the class have different data members but invoke the same member functions. However, there is an exception to this rule. If the data member is declared with the keyword static, then only one such data item is created for the entire class, no matter how many objects it has. Static data members are useful, if all objects of a class must share a common data item. Whereas, the visibility of this data item remains same, the duration of this variable is for entire lifetime of the program. For example, such a variable can be particularly useful if an object requires to know how many objects of its kind exist. class counter { public : counter (); int getcount(); private: static int count; }; counter::counter () { count++; } int counter::getcount() { return ( count ); } int counter :: count = 0;// INITIALIZATION OF STATIC MEMBER. void main() { counter c1,c2; cout << “ Count = “ << c1.getcount() << endl; cout << “ Count = “ << c2.getcount() << endl; counter c3; cout << “ Count = “ << c3.getcount() << endl; counter c4,c5; 120 cout << “ Count = “ << c4.getcount() << endl; cout << “ Count = “ << c5.getcount() << endl; } Output: Count Count Count Count Count = = = = = 2 2 3 5 5 // not 1 because 2 objects are already created. In the above example, the class counter demonstrates the use of static data members. It contains just one data member count. Notice the initialization of this static class member. For some compilers it is mandatory. Even though the data member is in the private section it can be accessed in this case as a global variable directly, but has to be preceded by the name of the class and the scope resolution operator. This example contains a constructor to increment this variable. Similarly, there can be a destructor to decrement it. You can use these to generate register numbers for student objects from a student class. Whenever an object is created, he will be automatically assigned a register number if the register number is a static variable and a constructor is used to write an equation to generate separate register numbers in some order. You could initialize the variable to give the first register number and then use this in the constructor for further operations. 9.4 Static Member Functions:All the objects of the class share static data members of a class. The example above demonstrates how to keep track of all the objects of a class which are4 in existence. However, this function uses existing objects to invoke a member function getcount(), which returns the value of the static data member. What if the programme does not want to use objects to invoke this function and still the programme would like to know how many objects have been created? If there is no object how the member function is invoked? Further, as can be seen from the previous output, the number of objects (count) remains same at a given instance no matter which object is used to invoke the member function. In fact, the use of existing objects, like in the above example, is not an effective way to access the value of the static data member. A specific object should not be used to refer to this member, since it does not belong to that object; it belongs to the entire class. C++ gives a facility to define static function members, for the same. That is, to invoke such a function, an object is not required. It can be invoked with the name of the class. The programme given below illustrates its use. class counter { public : 121 counter (); static int getcount(); private: static int count; }; counter::counter () { count++; } int counter::getcount() { } int counter :: count = 0; // INITIALIZATION OF STATIC MEMBER. void main() { counter c1,c2; cout << “ Count = “ << counter :: getcount() << endl; cout << “ Count = “ << counter :: getcount() << endl; counter c3; cout << “ Count = “ << counter :: getcount() << endl; counter c4,c5; cout << “ Count = “ << counter :: getcount() << endl; cout << “ Count = “ << counter :: getcount() << endl; } Output: Count Count Count Count Count = = = = = 2 2 3 5 5 122 9.5 Friend Functions One of the main features of OOP is information hiding. A class encapsulates data and methods to operate on that data in a single unit . The data from the class can be accessed only through member functions of the class. This restricted access not only hides the implementation details of the methods and the data structure, it also saves the data from any possible misuse, accidental or otherwise. However, the concept of data encapsulation sometimes takes information hiding too far. There are situations where a rigid and controlled access leads to inconvenience and hardships. For instance, consider a case where a function is required to operate on object of two different classes. This function cannot be made a member of both the classes. What can be done is that a function can be defined outside both the classes and made to operate on both. That is, a function not belonging to either, but able to access both. Such a function is called as a friend function. In other words, a friend function is a nonmember function, which has access to a class’s private members. It is like allowing access to one’s personal belongings to a friend. Using a friend function is quite simple. The following example defines a friend function to access members of two classes. class Bclass; // Forward Declaration class Aclass { public : Aclass(int v) { Avar = v; } friend int addup(Aclass &ac, Bclass &bc); private : int Avar; }; class Bclass { public : Bclass(int v) { Bvar = v; } friend int addup(Aclass &ac, Bclass &bc); private : int Bvar; }; 123 int addup(Aclass &ac, Bclass &bc) { return( ac.Avar + bc.Bvar); } void main() { Aclass aobj; Bclass bobj; int total; total = addup(aobj,bobj); } The program defines two classes- Aclass and Bclass. It also has constructors for these classes. A friend function, addup(), is declared in the definition of both the classes, although it does not belong to either. This friend function returns the sum of the two private members of the classes. Notice, that this function can directly access the private members of the classes. To access the private members, the name of the member has to be prefixed with the name of the object , along with the dot operator. The first line in the program is called forward declaration. This is required to inform the compiler that the definition for class Bclass comes after class Aclass and therefore it will not show any error on encountering the object of Bclass in the declaration of the friend function. Since it does not belong to both the classes , while defining it outside we do not give any scope resolution and we do not invoke it with the help of any object. Also , the keyword friend is just written before the declaration and not used while defining. Sometimes friend functions can be avoided using inheritance and they are preferred. Excessive use of friend over many classes suggests a poorly designed program structure. But sometimes they are unavoidable. 9.6 Friend for Overloading Operators Some times friend functions cannot be avoided. For instance with the operator overloading. Consider the following class that contains data members to simulate a matrix. Several operations can be performed on the matrices. One of them is to multiply the given matrix by a number( constant literal). There are two ways in which we can do this. The two ways are : Matrix * num; [or] num * Matrix; 124 In the first case we can overload * to perform the operation and an object invokes this as the statement gets converted to : Mobj.operator*(num); Where Mobj is an object of Matrix and num is a normal integer variable. What happens to the second one ? It gets converted by the compiler as : num.operator*(Mobj); Let us see this program in detail. class Matrix { public: : : Matrix &operator*(int num); friend Matrix &operator*(int n, Matrix &m); private: int mat[20][20]; int rows, cols; } Matrix Matrix::operator*(int num) { Matrix temp; temp.rows=rows; temp.cols=cols; for(int i=1; i<=rows; i++) for(int j=1; j<=cols; j++) temp.mat[i][j]=mat[i][j]*num; return (temp); } Matrix operator*(int n, Matrix &m) { Matrix temp; temp= m*n; return temp; } void main() { Matrix M1, M2, M3; int num; : : // accept matrix one and num 125 M2=M1*num; // calls member operator function. M3=num*M1; // calls friend function. } Here when the compiler comes across the multiplication of an object by a number it invokes the operator member function. When is encountered before multiplication symbol as in the second call in the program, the compiler calls friend function . in friend function we have just reversed the arguments, which causes the member function to be invoked . Intelligent use of friend functions makes the code more readable. 9.7 Granting friendship to another class:To grant friendship to another class, write the keyword followed by the class name. The keyword class is optional. Note that this declaration also implies a forward declaration of the class to which the friendship is being granted. The implication of this declaration is that all off the member functions of the friend class are friend functions of the class that bestows the friendship. e.g. class Aclass { public : : : : friend class Bclass; // Friend declaration. private : int Avar; }; class Bclass { public : : : void fn1(Aclass ac) { Bvar = ac. Avar; // Avar can be accessed. } private : int Bvar; }; void main() { Aclass aobj; 126 Bclass bobj; Bobj,fn1(aobj); } The program declares class Bclass to be a friend of Aclass. It means that all member functions of Bclass have been granted direct access to all member functions of Aclass. 9.8 Granting friendship to a member function of another class If you want class A to grant friendship to one or more individual member functions of class B, then you must code the classes and their member functions in this manner: • Forward declare class A; • Define class B and declare (not define) the member functions: • Define class A in which you declare the friendship for the member functions of class B. Of course, you must qualify the names of these functions using the class name B and the scope resolution operator. • Define the member functions of class B; e.g. class Aclass class Bclass { public : : : void fn1(); // Can’t define here void fn3() { : : } private : int Bvar; }; class Aclass { public : : 127 : void fn1() { : } friend classB:: fn1(); friend classB:: fn2(); private : int Avar; }; void classB:: fn1() { Bvar = Avar; } void classB:: fn2() { Bvar = variable +25; } 128 UNIT 10 OPERATOR OVERLOADING Contents 10.1 10.2 10.3 10.4 10.5 10.6 10.7 10.8 10.9 Introduction to Operator Overloading. Operator Overloading Fundamentals. Implementing the operator functions. Rules for overloading the operators. Pointer oddities (assignment) and Operator Overloading. Overloading the Extraction and Insertion Operators Conversion functions. 10.7.1 Conversion from basic to user-defined variable. 10.7.2 Conversion from User-Defined to Basic data type 10.7.3 Conversion Between Objects of Different Classes 10.7.4 Conversion function in the Destination Class Table for Type Conversions Self Test 10.1 Introduction to Operator Overloading All computer languages have built in types like integers, real numbers, characters and so on. Some languages allow us to create our own data types like dates, complex numbers, co-ordinates of a point. Operations like addition, comparisons can be done only on basic data types and not on derived (user-defined) data types. If we want to operate on them we must write functions like compare (), add (). e.g. if (compare (v1, v2) = = 0) : : Where v1 and v2 are variables of the new data type and compare () is a function that will contain actual comparison instructions for comparing their member variables. However, the concept of Operator Overloading, in C++, allows a statement like if (v1 = = v2) : : Where the operation of comparing them is defined in a member function and associated with comparison operator(==). 129 The ability to create new data types, on which direct operations can be performed is called as extensibility and the ability to associate an existing operator with a member function and use it with the objects of its class, as its operands, is called as Operator Overloading. Operator Overloading is one form of Polymorphism ,an important feature of object-oriented programming .Polymorphism means one thing having many forms, i.e. here an operator can be overloaded to perform different operations on different data types on different contexts. Operator Overloading is also called operational polymorphism. Another form of polymorphism is function overloading. 10.2 Operator Overloading Fundamentals The C language uses the concept of Operator Overloading Discreetly. The asterisk (*) is used as multiplication operator as well as indirection (pointer) operator. The ampersand (&) is used as address operator and also as the bitwise logical ‘AND’ operator. The compiler decides what operation is to be performed by the context in which the operator is used. Thus, the C language has been using Operator Overloading internally. Now, C++ has made this facility public. C++ can overload existing operators with some other operations. If the operator is not used in the context as defined by the language, then the overloaded operation, if defined will be carried out. For example, in the statement x = y + z; If x, y and z are integer variables, then the compiler knows the operation to be performed. But, if they are objects of some class, then the compiler will carry out the instructions, which will be written for that operator in the class. 10.3 Implementing Operator Functions The general format of the Operator function is: return_type operator op ( argument list ); Where op is the symbol for the operator being overloaded. Op has to be a valid C++ operator, a new symbol cannot be used. e.g. Let us consider an example where we overload unary arithmetic operator ‘++’. class Counter { public : Counter(); 130 void operator++(void); private : int Count; }; Counter::Counter() { Count = 0; } void Counter::operator++(void) { ++ Count ; } void main() { Counter c1; c1++; ++c1; // increments Count to 1 // increments Count to 2 } In main() the increment operator is applied to a specific object. The function itself does not take any arguments. It increments the data member Count. Similarly, to decrement the Counter object can also be coded in the class definition as: void operator--(void) { -- Count ; } and invoked with the statement --c1; or c1--; In the above example , the compiler checks if the operator is overloaded and if an operator function is found in the class description of the object, then the statement to increment gets converted, by the compiler, to the following: c1.operator++(); This is just like a normal function call qualified by the object’s name. It has some special characters ( ++) in it. Once this conversion takes place, the compiler treats it just like any other member function from the class. Hence, it can be seen that such a facility is not a very big overhead on the compiler. 131 However, the operator function in the above example has a potential glitch. On overloading , it does not work exactly like it does for the basic data types. With the increment and decrement operators overloaded, the operator function is executed first, regardless of whether the operator is postfix or prefix. If we want to assign values to another object in main() we have to return values to the calling function. Counter Counter :: operator++(void) { Counter temp; temp.Count = ++ Count ; return ( temp ); } void main() { Counter c1,c2; c1 = c2 ++; //increments to 1, then assigns. } In this example , the operator function creates a new object temp of the class Counter, assigns the incremented value of Count to the data member of temp and returns the new object. This object is returned to main(). We can do this in another way by creating a nameless temporary object and return it. class Counter { public : Counter(); // CONSTRUCTOR WITHOUT ARGUMENTS Counter( int c); // CONSTRUCTOR WITH 1 ARGUMENT Counter operator++(void); private : int Count; }; Counter::Counter() // CONSTRUCTOR WITHOUT ARGUMENTS { Count = 0; } Counter::Counter( int c) // CONSTRUCTOR WITH 1 ARGUMENT { Count = c; 132 } Counter Counter::operator++(void) { ++ Count ; return Counter(Count); } One change we can see is a constructor with one argument. No new temporary object is explicitly created. However return statement creates an unnamed temporary object of the class Counter initializes it with the value in Count and returns the newly created object. Hence one argument constructor is required. Yet another way of returning an object from the member function is by using the this pointer. This special pointer points to the object, which invokes the function. The constructor with one argument is not required in this approach. Counter Counter :: operator++(void) { ++ Count ; return ( * this); } Consider a class COMPLEX for Complex numbers. It will have a real and an imaginary member variable. Here we can see binary operator overloaded and also how to return values from the functions. class COMPLEX { public: COMPLEX operator+(COMPLEX); private: int real, imaginary; }; Suppose that C1, C2 and C3 are objects of this class. Symbolically addition can be carried out as C3 = C1 + C2; The actual instructions of the operator are written in a special member function. e.g. COMPLEX COMPLEX :: operator+( COMPLEX C2) { COMPLEX temp; temp.real = real + C2.real; 133 temp.imaginary = imaginary + C2. imaginary; return (temp); } The above example shows how Operator Overloading is implemented. It overloads “+” operator to perform addition on objects of COMPLEX class. Here we have overloaded a binary operator(+). 10.4 Rules for overloading the operators This summarizes the most important points you need to know in order to do operator function overloading. • The only operators you may overload are the ones from the C++ list and not all of those are available. You cannot arbitrarily choose a new symbol (such as @) and attempt to “overload it. Start by declaring a function in the normal function fashion, but for the function name use the expression: Operator op Where op is the operator to be overloaded. You may leave one or more spaces before op. • • The pre-defined precedence rules cannot be changed. i.e. you cannot, for example, make binary ‘+’ have higher precedence than binary ‘*’. In addition, you cannot change the associativity of the operators. The unary operators that you may overload are: -> indirect member operator ! not & address * dereference + plus minus ++ prefix increment ++ postfix increment (possible in AT & T version 2.1) -postfix decrement -prefix decrement (possible in AT & T version 2.1) ~ one’s complement • • • The binary operators that you may overload are: (), [], new, delete, *, / , %, + , - , <<,>>, <, <=, >, >=, ==,! =, &, ^, |, &&, ||, =, *=, /=, %=, +=, -, =, <<=, >>=, &=,! =, ^=, ','(Comma). The operators that can not be overloaded are: . .* 134 direct member direct pointer to member :: ?: scope resolution ternary • No default arguments are allowed in overloaded operator functions. • As with the predefined operators, an overloaded operator may be unary or binary. If it is normally unary, then it cannot be defined to be binary and vice versa. However, if an operator can be both unary and binary, then it can be overloaded either way or both. • The operator function for a class may be either a non-static member or global friend function. A non-static member function automatically has one argument implicitly defined, namely the address of the invoking instance (as specified by the pointer variable this). Since a friend function has no this pointer, it needs to have all its arguments explicitly defined). At least one of the arguments to the overloaded function explicit or implicit must be an instance of the class to which the operator belongs. Here you have a table with a summary on how the different operator functions must be declared (replace @ by the operator in each case): • Expression Operator (@) @a + - * & ! ~ ++ -- A::operator@() operator@(A) a@ ++ -- A::operator@(int) operator@(A, int) a@b +-*/%^&|< > == != <= >= << >> && || , A::operator@(B) operator@(A, B) a@b = += -= *= /= %= ^= &= |= <<= >>= [] A::operator@(B) - a(b, c...) () A::operator()(B, C...) - a->b -> A::operator->() - Function member Global function * where a is an object of class A, b is an object of class B and c is an object of class C. You can see in this panel that there are two ways to overload some class operators: as member function and as global function. Its use is indistinct, nevertheless I remind you that functions that are not members of a class cannot access the private or protected members of the class unless the global function is friend of the class (friend is explained later). Consider an example, which depicts overloading of += (Compound assignment), <, >, == (Equality),!=, + (Concatenation) using String class. class String { public : String (); String ( char str [] ); 135 void putstr(); String operator + (String); String operator += (String s2); int operator < (String s2); int operator > (String s2); int operator == (String s2); int operator != (String s2); private : char s[100]; }; String::String () // CONSTRUCTOR WITH { // NO ARGUMENTS s[0] = 0; }; String:: String( char str [] ) // CONSTRUCTOR WITH { // ONE ARGUMENT strcpy(s,str) }; void String:: putstr()// FUNCTION TO PRINT STRING { cout << s ; }; String String :: operator+(String s2) { String temp; strcpy(temp.s,s); strcat(temp.s,s2.s); return (temp); } String String :: operator+=(String s2) { strcat(s,s2.s); return (*this); } 136 int String::operator < (String s2) { return (strcmp (s, s2.s ) < 0); } int String::operator > (String s2) { return (strcmp (s, s2.s ) > 0); } int String::operator == (String s2) { return (strcmp (s, s2.s ) == 0); } int String::operator != (String s2) { return (strcmp (s, s2.s ) != 0); } void main() { String s1 = “welcome “; String s2 = “ to the world of c++”; String s3; cout << endl << “s1 = “; s1.putstr(); cout << endl << “s2 = “; s2.putstr(); s3 = s1 + s2; cout << endl << “ s3 = “; s3.putstr(); String s4; cout <<endl<<” *********************”; s4 = s1 + = s2; cout << endl << “ s4 = “; 137 s4.putstr(); String s5 = “ Azzzz “; String s6 = “ Apple “; if( s5 < s6 ) { s5.putstr(); cout << ” < ”; s6.putstr(); } else if( s5 > s6 ) { s5.putstr(); cout << ” > ”; s6.putstr(); } else { if( s5 == s6 ) s5.putstr(); cout << ” = ”; s6.putstr(); } else { if( s5 != s6 ) s5.putstr(); cout << ” < ”; s6.putstr(); } } Output: S1 = welcome S2 = to the world of C++ S3 = welcome to the world of c++ ************************** S4 = welcome to the world of c++ // vectors: overloading operators example #include <iostream.h> class CVector { public: int x,y; CVector () {}; CVector (int,int); CVector operator + (CVector); }; 138 CVector::CVector (int a, int b) { x = a; y = b; } CVector CVector::operator+ (CVector param) { CVector temp; temp.x = x + param.x; temp.y = y + param.y; return (temp); } int main () { CVector a (3,1); CVector b (1,2); CVector c; c = a + b; cout << c.x << "," << c.y; return 0; } 10.5 Pointer Oddities and Operator Overloading Consider an example, where the data members contain pointers and have been allocated memory using the operator new. In this case using an assignment operator to assign one object to another will result in the pointer variable being copied rather than the contents at the address. The following example explains this problem. class String { public : String (char *s = “”) // CONSTRUCTOR { size = strlen(s); cptr = new char [size + 1]; strcpy(cptr,s); }; ~String() { delete cptr; 139 } void putstr() // FUNCTION TO PRINT STRING { cout << cptr ; }; private : char *cptr; int size; }; void main() { String s1(“hello students “); String s2; s2 = s1; // Assignment s1.putstr(); s2.putstr(); } The constructor function allocates a string and copies the contents of its formal variable in it. The assignment operator in main() assigns the object s1 to s2. the data members’ cptr and size , of object s1, gets assigned to s2. The output of the program is : hello students hello students Null pointer assignment Why does the program give a ‘null pointer assignment’ message ? After the program is over, the destructor function is called automatically, which releases the memory allocated by new to the data member cptr. But, after the assignment, the data member cptr of both the objects point to the same location in the memory. Thus, the delete operator called for the first object releases the memory and for the second call attempts to release the same memory location again, resulting in the error message. The solution to this problem is to define an operator function for the assignment operator . It can be done as follows: String operator = (String s2) { delete cptr; size = strlen ( s2.cptr ); cptr = new char [size + 1]; strcpy(cptr, s2.cptr); return (*this); 140 } On including this member function in the class definition of the above example , the program outputs the following : hello students hello students Hence, the operator function for assignment of an object to another of the same class removes the quirks associated with pointers as data members. 10.6 Overloading the Extraction and Insertion Operators We’ll finish this session by showing how to overload the extraction and insertion operators. Here, you can accept the input and output the results of the user-defined variables or objects just like normal variables. Consider a class Complex that consists of two system defined variables, both float to denote the real and complex part of a complex number. In general cases to accept these member variables, we need to write some function say getval() and invoke it using the object of the class say Comobj.getval() and similar method would be required to display them. Using operator overloading we can accept or display the user-defined object just like a normal variable . i.e. by overloading the extraction operator >> we can accept the complex object as, cin >> comobj; Similarly, by overloading the insertion operator we can display the member variables of the object as, cout << comobj; just as if it were a basic data type. Consider the example : class Complex { public: friend istream& operator >> (istream &is, Complex &c2) friend ostream& operator << (ostream &os, Complex &c2) private: float real, imaginary; }; istream& operator >> (istream &is, Complex &c2) { cout << “ enter real and imaginary “ << endl; 141 is >> c2.real >> c2.imaginary; return (is); } istream& operator << (ostream &os, Complex &c2) { os << “ the complex number is “ <<endl; os << c2.real << “+i”<< c2.imaginary; return (os); } void main() { Complex c1,c2; cin >> c1; cout << c1; cin >> c2; cout << c2; } The operator functions have to be declared friends , since they have to access the user class and the objects of istream and ostream classes that are system defined. Since these operator functions are friend functions, the two objects – cin and cout are passed as arguments, along with the objects of the user-class. They return the istream and ostream objects so that the operator can be chained. That is the above two input statements can also be written as, cin >> c1 >> c2; cout << c1 << c2; 10.7 Conversion functions Conversion functions are member functions used for the following purposes: 1. 2. 3. Conversion of object to basic data type. Conversion of basic data type to object. Conversion of objects of different classes. Conversions of one basic data type to another are automatically done by the compiler using its own built-in routines (implicit) and it can be forced using built-in conversion routines (explicit). However, since the compiler does not know anything about user-defined types (classes), the program has to define conversion functions. e.g. int i = 2, j =19; 142 float f = 3.5; i = f; // i gets the value 3 , implicit conversion f = (float) j; // f gets 19.00, explicit conversion 10.7.1 Conversion from Basic to User-Defined variable Consider the following example. class Distance { public : Distance(void) // Constructor with no { // argument feet = 0; inches = 0.0; }; Distance(float metres) { float f; // Constructor with f = 3.28 * metres; // one argument feet = int(f); // also used for inches = 12 * ( f – feet);// conversion void display(void) { cout << “ Feet = “ << feet <<”,”; cout << “ Inches = “ << inches << endl; }; private : int feet; float inches; }; void main (void) { Distance d1 = 1.25; // Uses 2nd constructor Distance d2; // Uses 1st constructor float m; d2 = 2.0 ; // Uses 2nd constructor cout << “ 1.25 metres is : “ << d1.showdist() ; 143 }; cout << “ 2.0 metres is :“ << d2.showdist(); } Output : 1.25 metres is :FEET = 4 , INCHES = 1.199999 2.0 metres is :FEET = 6 , INCHES = 6.719999 The above program converts distance in metres ( basic data type) into feet and inches ( members of an object of class Distance ). The declaration of first object d1 uses the second constructor and conversion takes place. However, when the statement encountered is d2 = 2.0; The compiler first checks for an operator function for the assignment operator. If the assignment operator is not overloaded, then it uses the constructor to do the conversion. 10.7.2. Conversion from User-Defined to Basic data type The following program uses the program in the previous section to convert the Distance into metres(float). class Distance { public : Distance(void) // Constructor with no { // argument feet = 0; inches = 0.0; }; Distance(float metres) { float f; // Constructor with f = 3.28 * metres; // one argument feet = int(f); // Also used for = 12 * ( f – feet); //conversion }; operator float(void) // Conversion function { // from Distance to float float f; f = inches / 12; f = f + float (feet); return ( f/ 3.28 ); }; 144 inches void display(void) { cout << “ Feet = “ << feet <<”,”; cout << “ Inches = “ << inches << endl; }; private : int feet; float inches; }; void main (void) { Distance d1 = 1.25; // Uses 2nd constructor Distance d2; // Uses 1st constructor float m; d2 = 2.0 ; // Uses 2nd constructor cout << “ 1.25 metres is :“ << d1.showdist (); cout << “ 2.0 metres is :“ << d2.showdist (); cout << “ CONVERTED BACK TO METRES “; m = float ( d1 ); // Calls function explicitly. cout << “ d1 = “ << m; m = d2; // Calls function explicitly. cout << “ d2 = “ << m; } Output: 1.25 metres is :FEET = 4 ,INCHES = 1.199999 2.0 metres is :FEET = 6 ,INCHES = 6.719999 CONVERTED BACK TO METRES d1 = 1.25 d2 = 2.00 Actually, this conversion function is nothing but overloading the typecast operator float(). The conversion is achieved explicitly and implicitly. 145 m = float (d1); is forced where as , in the second assignment statement m = d2; first the compiler checks for an operator function for assignment ( = ) operator and if not found it uses the conversion function. The conversion function must not define a return type nor should it have any arguments. 10.7.3 Conversion Between Objects of Different Classes Since the compiler does not know anything about the user-defined type, the conversion instructions are to be specified in a function. The function can be a member function of the source class or a member function of the destination class. We will consider both the cases. Consider a class DistFeet which stores distance in terms of feet and inches and has a constructor to receive these. The second class DistMetres store distance in metres and has a constructor to receive the member. Conversion function in the Source Class class DistFeet { public : DistFeet(void) { // Constructor with no // argument feet = 0; inches = 0.0; }; DistFeet(int ft,float in) { feet = ft; inches = in }; void ShowFeet(void) { cout << “ Feet = “ << feet << “,”; cout << “ Inches = “ << inches << endl; }; private : int feet; float inches; }; 146 class DistMetres { public: DistMetres(void) { metres = 0 ; // constructor 1. } DistMetres(float m) { metres = m ; // constructor 2. } void ShowMetres(void) { cout << “ Metres = “ << metres << endl; }; operator DistFeet(void) // conversion { // function float ffeet, inches; int ifeet; ffeet = 3.28 * metres; ifeet = int (ffeet); inches = 12 * (ffeet – ifeet); return(DistFeet(inches,ifeet); }; private: float metres; }; void main (void) { DistMetres dm1 = 1.0; DistFeet df1; df1 = dm1 ; // OR df1 = DistFeet(dm1); // Uses conversion function dm1.ShowMetres(); df1.ShowFeet(); } 147 In the above example, DistMetres contains a conversion function to convert the distance from DistMetres ( source class), to DistFeet ( destination class). The statement to convert one object to another df1 = dm1; calls the conversion function implicitly. It could also have been called explicitly as df1 = DistFeet(dm1); 10.7.4 Conversion function in the Destination Class class DistMetres { public: DistMetres(void) { metres = 0 ; // Constructor 1. } DistMetres(float m) { metres = m ; // constructor 2. } void ShowMetres(void) { cout << “ Metres = “ << metres << endl; }; float GetMetres(void) { return(metres); } private: float metres; }; class DistFeet { public : DistFeet(void) // Constructor1 with no { // argument feet = 0; inches = 0.0; 148 }; DistFeet(int ft,float in) { feet = ft; inches = in; }; void ShowFeet(void) { cout << “ Feet = “ << feet << endl; cout << “ Inches = “ << inches << endl; }; DistFeet( DistMetres dm) // Constructor 3 { float ffeet; ffeet = 3.28 * dm.GetMetres(); feet = int (ffeet); inches = 12 * (ffeet – ifeet); }; private : int feet; float inches; }; void main (void) { DistMetres dm1 = 1.0; DistFeet df1; df1 = dm1 ; // Uses 2nd constructor // class DistMetres // Uses 1st constructor // class DistFeet // OR df1 = DistFeet(dm1); // Uses 3rd conversion function dm1.ShowMetres(); df1.ShowFeet(); } This program works same as previous function. Here constructor is written in the destination class. Also, we can see a new function GetMetres() . The function returns the data member metres of the invoking object. The function is required because the constructor is defined in the DistFeet class and since metres is a private member of the DistMetres class, it cannot be accessed directly in the constructor function in the DistFeet class. 149 Since you can use any of the above methods, it is strictly a matter of choice which method you choose to implement. 10.8 Table for Type Conversions Operation Function in Destination Class Operation Function in Source Class Basic to class Constructor Not Allowed Class to Basic Not Allowed Conversion Function Class to Class Constructor Conversion Function This session covered yet another concept of OOP – Polymorphism. It means one thing having many forms. It is very powerful, yet, simple concept which gives the C++ language a facility to redefine itself into a new language. There are two types of Polymorphism- operational and functional. This session covered operational polymorphism also called as operator overloading. We will see function overloading in future. 10.9 Self Test 1. WAP to add 2 complex number using OOT(Operator Overloading techniques). 2. WAP to add 2 times using OOT and display the resultant time in watch format. 3. WAP to add, subtract and multiply 2 matrices using OOT. Sort an array of objects. Each object has a string as a member variable. Overload >= or <= operators to compare the two strings. { make use of constructors and destructors whenever possible } 4. WAP to create a class called DATE . Accept 2 valid dates in the form of dd/mm/yyyy. Implement the following by overloading the operators – and + . Display the result after every operation. no_of_dasy = d1 – d2, where d1 and d2 are DATE objects; d1 > = d2 ; and no_of_days is an integer. b) d1 = d1 + no_of_days - where d1 is a DATE object and no_of_days is an integer. a) 150 5. Modify the matrix program (program 3) slightly. Overload == operator to compare 2 matrices to be added or subtracted. i.e., whether the column of first and the row of second matrix are same or not. if(m1==m2) { m3=m1+m2; m4=m1-m2; } else display error; 6. WAP to concatenate 2 strings by using a copy constructor. ------------ --------------- ---------------- -------------- 151 UNIT 11 INHERITANCE Contents 11.1 11.2 11.3 11.4 11.5 11.6 11.7 11.8 11.9 11.10 Reusability. Inheritance concept- single inheritance. 11.2.1 Private derivation 11.2.2 Public derivation 11.2.3 The Protected Access 11.2.4 Summary of derivation 11.2.5 Table of derivation and access specifiers Using the derived class Constructor and destructor in derived class. Object initialization and conversion. Nested classes (Container classes). Multilevel inheritance. Multiple inheritance. Hybrid Inheritance. Virtual base class. Object-oriented programming as seen in the preceding sessions emphasizes the data, rather than emphasizing algorithms. The previous sessions covered OOP features like extensibility, data encapsulation, information hiding, functional polymorphism and operational polymorphism. OOP, however, has more jargon associated to it, like reusability, inheritance. This session covers reusability and inheritance. 11.1 Reusability Reusability means reusing code written earlier ,may be from some previous project or from the library. Reusing old code not only saves development time, but also saves the testing and debugging time. It is better to use existing code, which has been time-tested rather than reinvent it. Moreover, writing new code may introduce bugs in the program. Code, written and tested earlier, may relieve a programmer of the nitty-gritty. Details about the hardware, user-interface, files and so on. It leaves the programmer more time to concentrate on the overall logistics of the program. What is inheritance? Class, the vehicle, which is used to implement object-oriented concepts in C++, has given a new dimension to this idea of reusability. Many vendors now offer libraries of classes. A class library consists of data and methods to operate on that data, encapsulated in a class . The source 152 code of these libraries need not be available to modify them. The new dimension of OOP uses a method called inheritance to modify a class to suit one’s needs. Inheritance means deriving new classes from the old ones. The old class is called the base class or parent class or super class and the class which is derived from this base class is called as derived class or child class or sub class. Deriving a new class from an existing one , allows redefining a member function of the base class and also adding new members to the derived class . and this is possible without having the source of the course definition also. In other words, a derived class not only inherits all properties of the base class but also has some refinements of its own. The base class remains unchanged in the process. In other words, the derived class “is a “ type of base class, but with more details added. For this reason, the relationship between a derived class and its base class is called an “is-a” relationship. Class A Class B Single Inheritance Here class A is a base class and the class B is the derived class. How to define a derived class ? A singly inherited derived class id defined by writing : • The keyword class. • The name of the derived class . • A single colon (:). • The type of derivation ( private , protected, or public ). • The name of the base, or parent class. • The remainder of the class definition. e.g. class A { public : int public_A; void public_function_A(); private : int pri_A; void private_function_A(); 153 protected : int protected_A; void protected_function_A(); }; class B : private A { public : int public_B; void public_function_B(); private : int pri_B; void private_function_B(); }; class C : public A { public : int public_C; void public_function_C(); private : int pri_C; void private_function_C(); }; class D : protected A { public : int public_D; void public_function_D(); private : int pri_D; void private_function_D(); }; A derived class always contains all of the member members from its base class . you cannot “subtract” anything from a base class. However, accessing the inherited variables is a different matter. It is also important to understand the privileges that the derived class has insofar as access to members of its base class are concerned. In other words, just because you happen to derive a class does not mean that you are automatically granted complete and unlimited access privileges to the members of the base class. to understand this you must look at the different types of derivation and the effect of each one. 154 11.2.1. Private derivation If no specific derivation is listed, then a private derivation is assumed. If a new class is derived privately from its parent class , then : • The private members inherited from its base class are inaccessible to new member functions in the derived class . this means that the creator of the base class has absolute control over the accessibility of these members , and there is no way that you can override this. • The public members inherited from the base class have private access privilege. In other words, they are treated as though they were declared as new private members of the derived class, so that new member functions can access them. However, if another private derivation occurs from this derived class, then these members are inaccessible to new member functions. e.g. class base { private : int number; }; class derived : private base { public : void f() { ++number; // Private base member not accessible } }; The compiler error message is ‘ base :: number ‘ is not accessible in the function derived :: f(); e.g. class base { public : int number; }; class derived : private base { 155 public : void f() { ++number; // Access to number O.K. } }; class derived2 : private derived { public : void g() { ++number; // Access to number is prohibited. } }; The compiler error message is ‘ base :: number ‘ is not accessible in the function derived2 :: g(); Since public members of a base class are inherited as private in the derived class, the function derived :: f() has no problem accessing it . however, when another class is derived from the class derived , this new class inherits number but cannot access it. Of course, if derived1::g() were to call upon derived::f(), there is no problem since derived::f() is public and inherited into derived2 as private. i.e. In derived2 we can write, void g() { f(); } or there is another way. Writing access declaration does this. class base { public : int number; }; class derived : private base { public : base :: number ; void f() { 156 ++number; // Access to number O.K. } }; class derived2 : private derived { public : void g() { ++number; } }; // Access to number O.K As you have just seen private derivations are very restrictive in terms of accessibility of the base class members . therefore, this type of derivation is rarely used. 11.2.2 Public derivation Public derivations are much more common than private derivations. In this situation : • The private members inherited from the base class are inaccessible to new members functions in the derived class. • The public members inherited from the base class may be accessed by new members functions in the derived class and by instances of the derived class . e.g. class base { private : int number; }; class derived : public base { public : void f() { ++number; // Private base member not accessible } }; 157 The compiller error message is ‘ base :: number ‘ is not accessible in the function derived::f(); Here, only if the number is public then you can access it. Note : However example 2 and 3 in the above section works here if you derive them as “public”. 11.2.3 The Protected Access In the preceding example, declaring the data member number as private is much too restrictive because clearly new members function in the derived class need to gain access to it and in order to perform some useful work. To solve this dilemma, the C++ syntax provides another class access specification called protected . here is how protected works : • In a private derivation the protected members inherited from the base class have private access privileges. Therefore, new member functions and friend of the derived class may access them. • In a public derivation the protected members inherited from the base class retain their protected status. They may be accessed by new members function and friends of the derived class . In both situations the new members functions and friends of the derived class have unrestricted access to protected members . however, as the instances of the derived class are concerned, protected and private are one and the same, so that direct access id always denied. Thus, you can see that the new category of protected provides a middle ground between public and private by granting access to new function and friends of the derived class while still blocking out access to non-derived class members and friend functions . class base { protected : int number; }; class derived : public base { public : void f() { ++number; } }; // base member access O.K. 158 Protected derivation In addition to doing private and public derivations, you may also do a protected derivation. In this situation : • The private members inherited from the base class are inaccessible to new member functions in the derived class. ( this is exactly same as if a private or public derivation has occurred.) • The protected members inherited from the base class have protected access privilege. • The public members inherited from the base class have protected have protected access. Thus , the only difference between a public and a protected derivation is how the public members of the parent class are inherited. It is unlikely that you will ever have occasion to do this type of derivation. Summary of access privileges 1. If the designer of the base class wants no one, not even a derived class to access a member , then that member should be made private . 2. If the designer wants any derived class function to have access to it, then that member must be protected. 3. if the designer wants to let everyone , including the instances, have access to that member , then that member should be made public . 11.2.4 Summary of derivations 1. Regardless of the type of derivation, private members are inherited by the derived class , but cannot be accessed by the new member function of the derived class , and certainly not by the instances of the derived class . 2. In a private derivation, the derived class inherits public and protected members as private . a new members function can access these members, but instances of the derived class may not. Also any members of subsequently derived classes may not gain access to these members because of the first rule. 3. In public derivation, the derived class inherits public members as public , and protected as protected . a new member function of the derived class may access the public and protected members of the base class ,but instances of the derived class may access only the public members. 4. In a protected derivation, the derived class inherits public and protected members as protected .a new members function of the derived class may access the public and protected members of the base class, both instances of the derived class may access only the public members . 11.2.5 Table of Derivation and access specifiers 159 Derivation Type Base Class Member Private Public Protected Access in Derived Class Private (inaccessible ) Public Private Protected Private Private (inaccessible ) Public Public Protected Protected Private (inaccessible ) Public Protected Protected Protected We can summarize the different access types according to whom can access them in the following way: Access public protected private members of the same class yes yes yes members of derived classes yes yes no Not-members yes no no where "not-members" represent any reference from outside the class, such as from main(), from another class or from any function, either global or local. 11.3. Using the Derived Class An instance of a derived class has complete access to the public members of the base class . assuming that the same name does not exist within the scope of the derived class , the members from the base class will automatically be used. Because there is no ambiguity involved in this situation, you do not need to use scope resolution operator to refer to this base class member. class base { public : base(int n = 0) { number = n; } int get_number(); protected : int number; }; 160 int base :: get_number() { return number; } class derived : public base { }; void main() { derived d; // First checks class derived , then class base cout << d.get_number(); // Goes directly to class base cout<< d.base ::get_number(); } Output: 0 0. 11.4 Constructor and destructor in derived class. What is inherited from the base class? In principle every member of a base class is inherited by a derived class except: • Constructor and destructor • operator=() member • friends Although the constructor and destructor of the base class are not inherited, the default constructor (i.e. constructor with no parameters) and the destructor of the base class are always called when a new object of a derived class is created or destroyed. If the base class has no default constructor or you want that an overloaded constructor is called when a new derived object is created, you can specify it in each constructor definition of the derived class: derived_class_name (parameters) : base_class_name (parameters) {} For example: mother: no parameters daughter: int parameter // constructors and derivated classes #include <iostream.h> 161 class mother { public: mother () { cout << "mother: no parameters\n"; } mother (int a) { cout << "mother: int parameter\n"; } }; mother: int parameter son: int parameter class daughter : public mother { public: daughter (int a) { cout << "daughter: int parameter\n\n"; } }; class son : public mother { public: son (int a) : mother (a) { cout << "son: int parameter\n\n"; } }; int main () { daughter cynthia (1); son daniel(1); return 0; } Observe the difference between which mother's constructor is called when a new daughter object is created and which when it is a son object. The difference is because the constructor declaration of daughter and son: daughter (int a) // nothing specified: call default constructor son (int a) : mother (a) // constructor specified: call this one 11.5 Object Initialization and conversion Initialization An object of a derived class can be initialized to an object of a base class . If both the classes have same data members , then no specific constructor needs to be defined in the derived class . It uses the constructor of the base class . An object of a base class can be assigned to the object of the derived class , if the derived class doesn’t contain any additional data members . However, if it does , then the assignment operator will have to be overloaded for the same. Conversions 162 Just like initialization , conversions are also done automatically when an object of a derived class is assigned to an object of the base class . However, the compiler resorts to a member-wise assignment in the absence of an overloaded function for the assignment operator . 11.6 Nested Classes Many a times, it becomes necessary to have a class contain properties of two other classes. One way is to define a class within another – that is a class with member classes also called nested classes. This has nothing to do with inheritance. Another way is multiple inheritance , which will be discussed later. e.g. class Aclass { public : Aclass(int pv) { private_variable_A = pv; } private : int private_variable_A; }; class Bclass { public : Bclass(int bpv, int apv): Aobj(apv) { private_variable_B = bpv; } private : int private_variable_B; Aclass Aobj; // Declaring an object here. }; As can be seen , the class Bclass contains an object Aobj in its private section as one of its members. Also, it contains a constructor function with the same name , to which the two variables passed are bpv and apv. The variable bpv is used to initialize the private variable of Bclass. However, the constructor contains something after the colon. The part after the colon in the definition of a constructor is called as initialization section and the actual body of the constructor is called as assignment section . Initialization section initializes the base class members , whereas assignment section contains statements to initialize the derived class members . 163 As seen earlier , in case of the derived classes, the name of the base class constructor is written after colon in the initialization section. This base class constructor is called before the constructor in the derived class. However, this example does not contain a derived class. This is the example of the nested class . In this case, the name of the object of the member class ‘Aclass’ is written after the colon. It tells the compiler to initialize the Aobj data member of Bclass with the value in apv. It is exactly like declaring an object of Aclass with the statement : Aclass Aobj(apv); The only change is that, it is written after the colon in the initialization section of the Bclass constructor. Its assignment section contains code to initialize its own members. The same constructor function can also be written as : Bclass (int bovine apv):Aobj(apv),private_variable_B = bpv; 11.7 Multilevel Inheritance In multilevel inheritance there is a parent class , from whom we derive another class . now from this derived class we can derive another class and so on. 164 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ Class A Class B Class C Multilevel Inheritance class Aclass { : : } class Bclass : public Aclass { : : } class Cclass : public Bclass { : : } 11.8 Multiple Inheritance 165 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ Multiple inheritance , as the name suggests , is deriving a class from more than one class . The derived class inherits all the properties of all its base classes. Consider the following example : Class A Class B Class C Multiple Inheritance class Aclass { : : }; class Bclass { : : }; class Cclass : public Aclass , public Bclass { private : : : public : Cclass(...) : Aclass (...), Bclass(...) { }; }; The class Cclass in the above example is derived from two classes – Aclass and Bclass – therefore the first line of its class definition contains the name of two classes, both 166 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ publicly inherited. Like with normal inheritance , constructors have to be defined for initializing the data members of all classes. The constructor in Cclass calls constructors for base classes. The constructor calls are separated by commas. 11. 9 Multiple inheritance with a common base (Hybrid Inheritance) Inheritance is an important and powerful feature of OOP. Only the imagination of the person concerned is the limit. There are many combinations in which inheritance can be put to use. For instance, inheriting a class from two different classes, which in turn have been derived from the same base class . e.g. base Class A Class B derived Hybrid Inheritance class base { : : }; class Aclass : public base 167 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ { : : }; class Bclass : public base { : : }; class derived : public Aclass, public Bclass { : : }; Aclass and Bclass are two classes derived from the same base class . The class derived has a common ancestor – class base. This is multiple inheritance with a common base class . However, this subtlety of class inheritance is not all that simple. One potential problem here is that both, Aclass and Bclass, are derived from base and therefore both of them, contains a copy of the data members base class. The class derived is derived from these two classes. That means it contains two copies of base class members – one from Aclass and the other from Bclass. This gives rise to ambiguity between the base data members. Another problem is that declaring an object of class derived will invoke the base class constructor twice. The solution to this problem is provided by virtual base classes. 11. 10 Virtual Base Classes This ambiguity can be resolved if the class derived contains only one copy of the class base. This can be done by making the base class a virtual class. This keyword makes the two classes share a single copy of their base class . It can be done as follows : class base { : : }; 168 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ class Aclass : virtual public base { : : }; class Bclass : virtual public base { : : }; class derived : public Aclass, public Bclass { : : }; This will resolve the ambiguity involved. 169 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ UNIT 12 ABSTRACT AND VIRTUAL FUNCTION Contents 12.1 Abstract class. 12.2 Virtual function. 12.3 Pure virtual function 12.4 Self test 12.1. Abstract Classes Abstract classes are the classes, which are written just to act as base classes. Consider the following classes. class base { : : }; class Aclass : public base { : : }; class Bclass : public base { : : }; class Cclass : public base { : : }; 170 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ void main() { Aclass objA; Bclass objB; Cclass objC; : : } There are three classes – Aclass, Bclass, Cclass – each of which is derived from the class base. The main () function declares three objects of each of these three classes. However, it does not declare any object of the class base. This class is a general class whose sole purpose is to serve as a base class for the other three. Classes used only for the purpose of deriving other classes from them are called as abstract classes. They simply serve as base class , and no objects for such classes are created. 12.2 Virtual Functions The keyword virtual was earlier used to resolve ambiguity for a class derived from two classes, both having a common ancestor. These classes are called virtual base classes. This time it helps in implementing the idea of polymorphism with class inheritance . The function of the base class can be declared with the keyword virtual. The program with this change and its output is given below. class Shape { public : virtual void print() { cout << “ I am a Shape “ << endl; } }; class Triangle : public Shape { public : void print() { 171 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ cout << “ I am a Triangle “ << endl; } }; class Circle : public Shape { public : void print() { cout << “ I am a Circle “ << endl; } }; void main() { Shape S; Triangle T; Circle C; S.print(); T.print(); C.print(); Shape *ptr; ptr = &S; ptr -> print(); ptr = &T; ptr -> print(); ptr = &C; ptr -> print(); } The output of the program is given below: I am a Shape I am a Triangle I am a Circle I am a Shape I am a Triangle I am a Circle 172 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ Now, the output of the derived classes are invoked correctly. When declared with the keyword virtual , the compiler selects the function to be invoked, based upon the contents of the pointer and not the type of the pointer. This facility can be very effectively used when many such classes are derived from one base class . Member functions of each of these can be ,then, invoked using a pointer to the base class . 12.3 Pure Virtual Functions As discussed earlier, an abstract class is one, which is used just for deriving some other classes. No object of this class is declared and used in the program. Similarly, there are pure virtual functions which themselves won’t be used. Consider the above example with some changes. class Shape { public : virtual void print() = 0; // Pure virtual function }; class Triangle : public Shape { public : void print() { cout << “ I am a Triangle “ << endl; } }; class Circle : public Shape { public : void print() { cout << “ I am a Circle “ << endl; } }; 173 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ void main() { Shape S; Triangle T; Circle C; Shape *ptr; ptr = &T; ptr -> print(); ptr = &C; ptr -> print(); } The output of the program is given below: I am a Triangle I am a Circle It can be seen from the above example that , the print() function from the base class is not invoked at all . even though the function is not necessary, it cannot be avoided, because , the pointer of the class Shape must point to its members. Object oriented programming has altered the program design process. Exciting OOP concepts like polymorphism have given a big boost to all this. Inheritance has further enhanced the language. This session has covered some of the finer aspects of inheritance. The next session will resolve some finer aspects of the language. EXAMPLE:20 10 0 // virtual members #include <iostream.h> class CPolygon { protected: int width, height; public: void set_values (int a, int b) 174 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ { width=a; height=b; } virtual int area (void) { return (0); } }; class CRectangle: public CPolygon { public: int area (void) { return (width * height); } }; class CTriangle: public CPolygon { public: int area (void) { return (width * height / 2); } }; int main () { CRectangle rect; CTriangle trgl; CPolygon poly; CPolygon * ppoly1 = &rect; CPolygon * ppoly2 = &trgl; CPolygon * ppoly3 = &poly; ppoly1->set_values (4,5); ppoly2->set_values (4,5); ppoly3->set_values (4,5); cout << ppoly1->area() << endl; cout << ppoly2->area() << endl; cout << ppoly3->area() << endl; return 0; } Now the three classes (CPolygon, CRectangle and CTriangle) have the same members: width, height, set_values() and area(). area() has been defined as virtual because it is later redefined in derived classes. You can verify if you want that if you remove this word (virtual) from the code and then you execute the program the result will be 0 for the three polygons instead of 20,10,0. That is because instead of calling the corresponding area() function for each object (CRectangle::area(), CTriangle::area() and CPolygon::area(), respectively), CPolygon::area() will be called for all of them since the calls are via a pointer to CPolygon. 175 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ Therefore what the word virtual does is to allow that a member of a derived class with the same name as one in the base class be suitably called when a pointer to it is used, as in the above example. Note that in spite of its virtuality we have also been able to declare an object of type CPolygon and to call its area() function, that always returns 0 as the result. 12.4 Self test Create a class drugs containing encapsulated data for medicine name, whether solid or liquid, price and purpose of use. From this class derive two classes, Ayurvedic and Allopathic. The class Ayurvedic should additionally store data on the herbs used, association to be used (whether honey or water). The class Allopathic should additionally include data on the chemicals used and the weight in milligrams. The classes should contain constructors and destructors. They should contain functions to accept data and display the data. The main() should test the derived classes. 176 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ UNIT 13 TEMPLATES AND EXCEPTION HANDLING Contents 13.1 Templates. 13.1.1 Function template 13.1.2 Class templates 13.1.3 Template specialization 13.1.4 Parameter values for templates 13.1.5 Templates and multiple -file project 13.2 Exception handling 13.2.1 Exception not caught 13.2.2 Standard exception 13.3 Advanced class type-casting. 13.3.1 reinterpret cast 13.3.2 static cast 13.3.3 dynamic cast 13.3.4 const_cast 13.3.5 typeid 13.4 Preprocessor directives. 13.1 Templates 13.1.1 Function templates Templates allow to create generic functions that admit any data type as parameters and return a value without having to overload the function with all the possible data types. Until certain point they fulfill the functionality of a macro. Its prototype is any of the two following ones: template <class identifier> function_declaration; template <typename identifier> function_declaration; the only difference between both prototypes is the use of keyword class or typename, its use is indistinct since both expressions have exactly the same meaning and behave exactly the same way. For example, to create a template function that returns the greater one of two objects we could use: 177 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ template <class GenericType> GenericType GetMax (GenericType a, GenericType b) { return (a>b?a:b); } As the first line specifies, we have created a template for a generic data type that we have called GenericType. Therefore in the function that follows, GenericType becomes a valid data type and it is used as the type for its two parameters a and b and as the return type for the function GetMax. GenericType still does not represent any concrete data type; when the function GetMax will be called we will be able to call it with any valid data type. This data type will serve as a pattern and will replace GenericType in the function. The way to call a template class with a type pattern is the following: function <pattern> (parameters); Thus, for example, to call GetMax and to compare two integer values of type int we can write: int x,y; GetMax <int> (x,y); so GetMax will be called as if each appearance of GenericType was replaced by an int expression. Here is the complete example: 6 10 // function template #include <iostream.h> template <class T> T GetMax (T a, T b) { T result; result = (a>b)? a : b; return (result); } int main () { int i=5, j=6, k; long l=10, m=5, n; k=GetMax<int>(i,j); n=GetMax<long>(l,m); cout << k << endl; cout << n << endl; return 0; 178 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ } (In this case we have called the generic type T instead of GenericType because it is shorter and in addition is one of the most usual identifiers used for templates, although it is valid to use any valid identifier). In the example above we used the same function GetMax() with arguments of type int and long having written a single implementation of the function. That is to say, we have written a function template and called it with two different patterns. As you can see, within our GetMax() template function the type T can be used to declare new objects: T result; result is an object of type T, like a and b, that is to say, of the type that we enclose between angle-brackets <> when calling our template function. In this concrete case where the generic T type is used as a parameter for function GetMax the compiler can find out automatically which data type is passed to it without having to specify it with patterns <int> or <long>. So we could have written: int i,j; GetMax (i,j); since both i and j are of type int the compiler would assume automatically that the wished function is for type int. This implicit method is more usual and would produce the same result: 6 10 // function template II #include <iostream.h> template <class T> T GetMax (T a, T b) { return (a>b?a:b); } int main () { int i=5, j=6, k; long l=10, m=5, n; k=GetMax(i,j); n=GetMax(l,m); cout << k << endl; cout << n << endl; return 0; } Notice how in this case, within function main() we called our template function GetMax() without explicitly specifying the type between angle-brackets <>. The compiler automatically determines what type is needed on each call. 179 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ Because our template function includes only one data type (class T) and both arguments it admits are both of that same type, we cannot call our template function with two objects of different types as parameters: int i; long l; k = GetMax (i,l); This would be incorrect, since our function waits for two arguments of the same type (or class). We can also make template-functions that admit more than one generic class or data type. For example: template <class T, class U> T GetMin (T a, U b) { return (a<b?a:b);} In this case, our template function GetMin() admits two parameters of different types and returns an object of the same type as the first parameter (T) that is passed. For example, after that declaration we could call the function by writing: int i,j; long l; i = GetMin<int,long> (j,l); or simply i = GetMin (j,l); even though j and l are of different types. 13.1.2 Class templates We also have the possibility to write class templates, so that a class can have members based on generic types that do not need to be defined at the moment of creating the class or whose members use these generic types. For example: template <class T> class pair { T values [2]; public: pair (T first, T second) { values[0]=first; values[1]=second; } 180 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ }; The class that we have just defined serves to store two elements of any valid type. For example, if we wanted to declare an object of this class to store two integer values of type int with the values 115 and 36 we would write: pair<int> myobject (115, 36); this same class would also serve to create an object to store any other type: pair<float> myfloats (3.0, 2.18); The only member function has been defined inline within the class declaration. If we define a function member outside the declaration we must always precede the definition with the prefix template <... >. 100 // class templates #include <iostream.h> template <class T> class pair { T value1, value2; public: pair (T first, T second) {value1=first; value2=second;} T getmax (); }; template <class T> T pair<T>::getmax () { T retval; retval = value1>value2? value1 : value2; return retval; } int main () { pair <int> myobject (100, 75); cout << myobject.getmax(); return 0; } notice how the definition of member function getmax begins: 181 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ template <class T> T pair<T>::getmax () All Ts that appear are necessary because whenever you declare member functions you have to follow a format similar to this (the second T makes reference to the type returned by the function, so this may vary). 13.1.3 Template Specialization A template specialization allows a template to make specific implementations when the pattern is of a determined type. For example, suppose that our class template pair included a function to return the result of the module operation between the objects contained in it, but we only want it to work when the contained type is int. For the rest of the types we want this function to return 0. This can be done the following way: 25 0 // Template specialization #include <iostream.h> template <class T> class pair { T value1, value2; public: pair (T first, T second) {value1=first; value2=second;} T module () {return 0;} }; template <> class pair <int> { int value1, value2; public: pair (int first, int second) {value1=first; value2=second;} int module (); }; template <> int pair<int>::module() { return value1%value2; } 182 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ int main () { pair <int> myints (100,75); pair <float> myfloats (100.0,75.0); cout << myints.module() << '\n'; cout << myfloats.module() << '\n'; return 0; } As you can see in the code the specialization is defined this way: template <> class class_name <type> The specialization is part of a template, for that reason we must begin the declaration with template <>. And indeed because it is a specialization for a concrete type, the generic type cannot be used in it and the first angle-brackets <> must appear empty. After the class name we must include the type that is being specialized enclosed between angle-brackets <>. When we specialize a type of a template we must also define all the members equating them to the specialization (if one pays attention, in the example above we have had to include its own constructor, although it is identical to the one in the generic template). The reason is that no member is "inherited" from the generic template to the specialized one. 13.1.4 Parameter values for templates Besides the template arguments preceded by the class or typename keywords that represent a type, function templates and class templates can include other parameters that are not types whenever they are also constant values, like for example values of fundamental types. As an example see this class template that serves to store arrays: 100 3.1416 // array template #include <iostream.h> template <class T, int N> class array { T memblock [N]; public: void setmember (int x, T value); T getmember (int x); }; template <class T, int N> array<T,N>::setmember (int x, T value) { memblock[x]=value; 183 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ } template <class T, int N> T array<T,N>::getmember (int x) { return memblock[x]; } int main () { array <int,5> myints; array <float,5> myfloats; myints.setmember (0,100); myfloats.setmember (3,3.1416); cout << myints.getmember(0) << '\n'; cout << myfloats.getmember(3) << '\n'; return 0; } It is also possible to set default values for any template parameter just as it is done with function parameters. Some possible template examples seen above: template template template template template <class T> // The most usual: one class parameter. <class T, class U> // Two class parameters. <class T, int N> // A class and an integer. <class T = char> // With a default value. <int Tfunc (int)> // A function as parameter. 13.1.5 Templates and multiple-file projects From the point of view of the compiler, templates are not normal functions or classes. They are compiled on demand, meaning that the code of a template function is not compiled until an instantiation is required. At that moment, when an instantiation is required, the compiler generates a function specifically for that type from the template. When projects grow it is usual to split the code of a program in different source files. In these cases, generally the interface and implementation are separated. Taking a library of functions as example, the interface generally consists of the prototypes of all the functions that can be called. These are generally declared in a "header file" with .h extension, and the implementation (the definition of these functions) is in an independent file of c++ code. The macro-like functionality of templates, forces a restriction for multi-file projects: the implementation (definition) of a template class or function must be in the same file as the declaration. That means we cannot separate the interface in a separate header file 184 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ and we must include both interface and implementation in any file that uses the templates. Going back to the library of functions, if we wanted to make a library of function templates, instead of creating a header file (.h) we should create a "template file" with both the interface and implementation of the function templates (there is no convention on the extension for this type of file other than there be no extension at all or to keep the .h). The inclusion more than once of the same template file with both declarations and definitions in a project doesn't generate linkage errors, since they are compiled on demand and compilers that allow templates should be prepared to not generate duplicate code in these cases. 13.2 Exception Handling During the development of a program, there may be some cases where we do not have the certainty that a piece of the code is going to work right, either because it accesses resources that do not exist or because it gets out of an expected range, etc... These types of anomalous situations are included in what we consider exceptions and C++ has recently incorporated three new operators to help us handle these situations: try, throw and catch. Their form of use is the following: try { // code to be tried throw exception; } catch (type exception) { // code to be executed in case of exception } And its operation: - The code within the try block is executed normally. In case that an exception takes place, this code must use the throw keyword and a parameter to throw an exception. The type of the parameter details the exception and can be of any valid type. - If an exception has taken place, that is to say, if it has executed a throw instruction within the try block, the catch block is executed receiving as parameter the exception passed by throw. For example: Exception: Out of range // exceptions 185 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ #include <iostream.h> int main () { char myarray[10]; try { for (int n=0; n<=10; n++) { if (n>9) throw "Out of range"; myarray[n]='z'; } } catch (char * str) { cout << "Exception: " << str << endl; } return 0; } In this example, if within the n loop, n gets to be more than 9 an exception is thrown, since myarray[n] would in that case point to a non-trustworthy memory address. When throw is executed, the try block finalizes right away and every object created within the try block is destroyed. After that, the control is passed to the corresponding catch block (that is only executed in these cases). Finally the program continues right after the catch block, in this case: return 0;. The syntax used by throw is similar to that of return: Only the parameter does not need to be enclosed between parenthesis. The catch block must go right after the try block without including any code line between them. The parameter that catch accepts can be of any valid type. Even more, catch can be overloaded so that it can accept different types as parameters. In that case the catch block executed is the one that matches the type of the exception sent (the parameter of throw): Exception: index 10 is out of range // exceptions: multiple catch blocks #include <iostream.h> int main () { try { char * mystring; mystring = new char [10]; if (mystring == NULL) throw "Allocation failure"; 186 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ for (int n=0; n<=100; n++) { if (n>9) throw n; mystring[n]='z'; } } catch (int i) { cout << "Exception: "; cout << "index " << i << " is out of range" << endl; } catch (char * str) { cout << "Exception: " << str << endl; } return 0; } In this case there is a possibility that at least two different exceptions could happen: 1. That the required block of 10 characters cannot be assigned (something rare, but possible): in this case an exception is thrown that will be caught by catch (char * str). 2. That the maximum index for mystring is exceeded: in this case the exception thrown will be caught by catch (int i), since the parameter is an integer number. We can also define a catch block that captures all the exceptions independently of the type used in the call to throw. For that we have to write three points instead of the parameter type and name accepted by catch: try { // code here } catch (...) { cout << "Exception occurred"; } It is also possible to nest try-catch blocks within more external try blocks. In these cases, we have the possibility that an internal catch block forwards the exception received to the external level, for that the expression throw; with no arguments is used. For example: try { try { // code here } 187 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ catch (int n) { throw; } } catch (...) { cout << "Exception occurred"; } 13.2.1 Exceptions not caught If an exception is not caught by any catch statement because there is no catch statement with a matching type, the special function terminate will be called. This function is generally defined so that it terminates the current process immediately showing an "Abnormal termination" error message. Its format is: void terminate(); 13.2.2 Standard exceptions Some functions of the standard C++ language library send exceptions that can be captured if we include them within a try block. These exceptions are sent with a class derived from std::exception as type. This class (std::exception) is defined in the C++ standard header file <exception> and serves as a pattern for the standard hierarchy of exceptions: Exception bad_alloc (thrown by new) bad_cast (thrown by dynamic_cast when fails with a referenced type) bad_exception (thrown when an exception doesn't match any catch) bad_typeid (thrown by typeid) logic_error domain_error invalid_argument length_error out_of_range runtime_error overflow_error range_error underflow_error ios_base::failure (thrown by ios::clear) 188 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ Because this is a class hierarchy, if you include a catch block to capture any of the exceptions of this hierarchy using the argument by reference (i.e. adding an ampersand & after the type) you will also capture all the derived ones (rules of inheritance in C++). The following example catches an exception of type bad_typeid (derived from exception) that is generated when requesting information about the type pointed by a null pointer: Exception: Attempted typeid of NULL pointer // standard exceptions #include <iostream.h> #include <exception> #include <typeinfo> class A {virtual f() {}; }; int main () { try { A * a = NULL; typeid (*a); } catch (std::exception& e) { cout << "Exception: " << e.what(); } return 0; } You can use the classes of standard hierarchy of exceptions to throw your exceptions or derive new classes from them. 13.3 Advanced Class Type-Casting Until now, in order to type-cast a simple object to another we have used the traditional type casting operator. For example, to cast a floating point number of type double to an integer of type int we have used: int i; double d; i = (int) d; or also i = int (d); 189 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ This is quite good for basic types that have standard defined conversions, however this operators can also be indiscriminately applied on classes and pointers to classes. So, it is perfectly valid to write things like: // class type-casting #include <iostream.h> class CDummy { int i; }; class CAddition { int x,y; public: CAddition (int a, int b) { x=a; y=b; } int result() { return x+y;} }; int main () { CDummy d; CAddition * padd; padd = (CAddition*) &d; cout << padd->result(); return 0; } Although the previous program in syntactically correct in C++ (in fact it will compile with no warnings on most compilers) it is code with not much sense since we use function result, that is a member of CAddition, without having declared an object of that class: padd is not an object, it is only a pointer which we have assigned the address of a non related object. When accessing its result member it will produce a run-time error or, at best, just an unexpected result. 13.3.1 Reinterpret Cast In order to control these types of conversions between classes, ANSI-C++ standard has defined four new casting operators: reinterpret_cast, static_cast, dynamic_cast and const_cast. All of them have the same format when used: reinterpret_cast <new_type> (expression) dynamic_cast <new_type> (expression) static_cast <new_type> (expression) const_cast <new_type> (expression) 190 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ Where new_type is the destination type to which expression has to be casted. To make an easily understandable parallelism with traditional type-casting operators these expression mean: (new_type) expression new_type (expression) but with their own special characteristics. reinterpret_cast reinterpret_cast casts a pointer to any other type of pointer. It also allows casting from a pointer to an integer type and vice versa. This operator can cast pointers between non-related classed. The operation results is a simple binary copy of the value from one pointer to the other. The content pointed does not pass any kind of check nor transformation between types. In the case that the copy is performed from a pointer to an integer, the interpretation of its content is system dependent and therefore any implementation is non portable. A pointer casted to an integer large enough to fully contain it can be casted back to a valid pointer. class A {}; class B {}; A * a = new A; B * b = reinterpret_cast<B*>(a); reinterpret_cast treats all pointers exactly as traditional type-casting operators do. 13.3.2 static_cast static_cast performs any casting that can be implicitly performed as well as the inverse cast (even if this is not allowed implicitly). Applied to pointers to classes, that is to say that it allows to cast a pointer of a derived class to its base class (this is a valid conversion that can be implicitly performed) and it can also perform the inverse: cast a base class to its derivated class. In this last case the base class that is being casted is not checked to determine wether this is a complete class of the destination type or not. class Base {}; class Derived: public Base {}; Base * a = new Base; Derived * b = static_cast<Derived*>(a); 191 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ static_cast, aside from manipulating pointers to classes, can also be used to perform conversions explicitly defined in classes, as well as to perform standard conversions between fundamental types: double d=3.14159265; int i = static_cast<int>(d); 13.3.3 dynamic_cast dynamic_cast is exclusively used with pointers and references to objects. It allows any type-casting that can be implicitly performed as well as the inverse one when used with polymorphic classes, however, unlike static_cast, dynamic_cast checks, in this last case, if the operation is valid. That is to say, it checks if the casting is going to return a valid complete object of the requested type. Checking is performed during run-time execution. If the pointer being casted is not a pointer to a valid complete object of the requested type, the value returned is a NULL pointer. class Base { virtual dummy(){}; }; class Derived : public Base { }; Base* b1 = new Derived; Base* b2 = new Base; Derived* d1 = dynamic_cast<Derived*>(b1); // succeeds Derived* d2 = dynamic_cast<Derived*>(b2); // fails: returns NULL If the type-casting is performed to a reference type and this casting is not possible an exception of type bad_cast is thrown: class Base { virtual dummy(){}; }; class Derived : public Base { }; Base* b1 = Base* b2 = Derived d1 Derived d2 new Derived; new Base; = dynamic_cast<Derived&*>(b1); // succeeds = dynamic_cast<Derived&*>(b2); // fails: exception thrown const_cast This type of casting manipulates the const attribute of the passed object, either to be set or removed: 192 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ class C {}; const C * a = new C; C * b = const_cast<C*> (a); Neither of the other three new cast operators can modify the constness of an object. 13.3.4 Typeid ANSI-C++ also defines a new operator called typeid that allows checking the type of an expression: typeid (expression) This operator returns a refernece to a constant object of type type_info that is defined in the standard header file <typeinfo>. This returned value can be compared with another using operators == and != or can serve to obtain a string of characters representing the data type or class name by using its name() method. a and b are of different types: a is: class CDummy * b is: class CDummy // typeid, typeinfo #include <iostream.h> #include <typeinfo> class CDummy { }; int main () { CDummy* a,b; if (typeid(a) != typeid(b)) { cout << "a and b are of different types:\n"; cout << "a is: " << typeid(a).name() << '\n'; cout << "b is: " << typeid(b).name() << '\n'; } return 0; } Preprocessor Director Preprocessor directives are orders that we include within the code of our programs that are not instructions for the program itself but for the preprocessor. The preprocessor is executed automatically by the compiler when we compile a program in C++ and is in charge of making the first verifications and digestions of the program's code. All these directives must be specified in a single line of code and they do not have to include an ending semicolon ;. 193 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ #define At the beginning of this tutorial we have already spoken about a preprocessor directive: #define, that serves to generate what we called defined constannts or macros and whose form is the following: #define name value Its function is to define a macro called name that whenever it is found in some point of the code is replaced by value. For example: #define MAX_WIDTH 100 char str1[MAX_WIDTH]; char str2[MAX_WIDTH]; It defines two strings to store up to 100 characters. #define can also be used to generate macro functions: #define getmax(a,b) a>b?a:b int x=5, y; y = getmax(x,2); after the execution of this code y would contain 5. #undef #undef fulfills the inverse functionality of #define. It eliminates from the list of defined constants the one that has the name passed as a parameter to #undef: #define MAX_WIDTH 100 char str1[MAX_WIDTH]; #undef MAX_WIDTH #define MAX_WIDTH 200 char str2[MAX_WIDTH]; #ifdef, #ifndef, #if, #endif, #else and #elif These directives allow to discard part of the code of a program if a certain condition is not fulfilled. #ifdef allows that a section of a program is compiled only if the defined constant that is specified as the parameter has been defined, independently of its value. Its operation is: #ifdef name // code here #endif For example: 194 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ #ifdef MAX_WIDTH char str[MAX_WIDTH]; #endif In this case, the line char str[MAX_WIDTH]; is only considered by the compiler if the defined constant MAX_WIDTH has been previously defined, independently of its value. If it has not been defined, that line will not be included in the program. #ifndef serves for the opposite: the code between the #ifndef directive and the #endif directive is only compiled if the constant name that is specified has not been defined previously. For example: #ifndef MAX_WIDTH #define MAX_WIDTH 100 #endif char str[MAX_WIDTH]; In this case, if when arriving at this piece of code the defined constant MAX_WIDTH has not yet been defined it would be defined with a value of 100. If it already existed it would maintain the value that it had (because the #define statement won't be executed). The #if, #else and #elif (elif = else if) directives serve so that the portion of code that follows is compiled only if the specified condition is met. The condition can only serve to evaluate constant expressions. For example: #if MAX_WIDTH>200 #undef MAX_WIDTH #define MAX_WIDTH 200 #elsif MAX_WIDTH<50 #undef MAX_WIDTH #define MAX_WIDTH 50 #else #undef MAX_WIDTH #define MAX_WIDTH 100 #endif char str[MAX_WIDTH]; Notice how the structure of chained directives #if, #elsif and #else finishes with #endif. #line When we compile a program and errors happen during the compiling process, the compiler shows the error that happened preceded by the name of the file and the line within the file where it has taken place. 195 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ The #line directive allows us to control both things, the line numbers within the code files as well as the file name that we want to appear when an error takes place. Its form is the following one: #line number "filename" Where number is the new line number that will be assigned to the next code line. The line number of successive lines will be increased one by one from this. filename is an optional parameter that serves to replace the file name that will be shown in case of error from this directive until another one changes it again or the end of the file is reached. For example: #line 1 "assigning variable" int a?; This code will generate an error that will be shown as error in file "assigning variable", line 1. #error This directive aborts the compilation process when it is found returning the error that is specified as the parameter: #ifndef __cplusplus #error A C++ compiler is required #endif This example aborts the compilation process if the defined constant __cplusplus is not defined. #include This directive has also been used assiduously in other sections of this tutorial. When the preprocessor finds an #include directive it replaces it by the whole content of the specified file. There are two ways to specify a file to be included: #include "file" #include <file> The only difference between both expressions is the directories in which the compiler is going to look for the file. In the first case where the file is specified between quotes, the file is looked for in the same directory that includes the file containing the directive. In case that it is not there, the compiler looks for the file in the default directories where it is configured to look for the standard header files. If the file name is enclosed between angle-brackets <> the file is looked for directly where the compiler is configured to look for the standard header files. #pragma 196 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ This directive is used to specify diverse options to the compiler. These options are specific for the platform and the compiler you use. Consult the manual or the reference of your compiler for more information on the possible parameters that you can define with #pragma. 197 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ UNIT 14 FILE INPUT OUTPUT Contents 14.1 Input/Output with files. 14.2 Open a file 14.3 Closing a file 14.4 Methods of Input and Output Classes 14.5 Text mode files 14.6 state flags 14.7 Binary files 14.8 Buffers and Synchronization 14.1 Input Output With Files The techniques for file input and output, i/o, in C++ are virtually identical to those introduced in earlier lessons for writing and reading to the standard output devices, the screen and keyboard. To perform file input and output the include file fstream must be used. #include <fstream> Fstream contains class definitions for classes used in file i/o. Within a program needing file i/o, for each output file required, an object of class ofstream is instantiated. For each input file required, an object of class ifstream is instantiated. The ofstream object is used exactly as the cout object for standard output is used. The ifstream object is used exactly as the cin object for standard input is used. This is best understood by studying an example. C++ has support both for input and output with files through the following classes: • ofstream: File class for writing operations (derived from ostream) • ifstream : File class for reading operations (derived from istream) • fstream : File class for both reading and writing operations (derived from iostream) 14.2 Open a file The first operation generally done on an object of one of these classes is to associate it to a real file, that is to say, to open a file. The open file is represented within the program by a stream object (an instantiation of one of these classes) and any input or output performed on this stream object will be applied to the physical file. 198 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ In order to open a file with a stream object we use its member function open(): void open (const char * filename, openmode mode); where filename is a string of characters representing the name of the file to be opened and mode is a combination of the following flags: ios::in Open file for reading ios::out Open file for writing ios::ate Initial position: end of file ios::app Every output is appended at the end of file ios::trunc If the file already existed it is erased ios::binary Binary mode These flags can be combined using bitwise operator OR: |. For example, if we want to open the file "example.bin" in binary mode to add data we could do it by the following call to function-member open: ofstream file; file.open ("example.bin", ios::out | ios::app | ios::binary); All of the member functions open of classes ofstream, ifstream and fstream include a default mode when opening files that varies from one to the other: class default mode to parameter ofstream ios::out | ios::trunc ifstream ios::in fstream ios::in | ios::out The default value is only applied if the function is called without specifying a mode parameter. If the function is called with any value in that parameter the default mode is stepped on, not combined. Since the first task that is performed on an object of classes ofstream, ifstream and fstream is frequently to open a file, these three classes include a constructor that directly calls the open member function and has the same parameters as this. This way, we could also have declared the previous object and conducted the same opening operation just by writing: ofstream file ("example.bin", ios::out | ios::app | ios::binary); Both forms to open a file are valid. 199 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ You can check if a file has been correctly opened by calling the member function is_open(): bool is_open(); that returns a bool type value indicating true in case that indeed the object has been correctly associated with an open file or false otherwise. 14.3 Closing a file When reading, writing or consulting operations on a file are complete we must close it so that it becomes available again. In order to do that we shall call the member function close(), that is in charge of flushing the buffers and closing the file. Its form is quite simple: void close (); Once this member function is called, the stream object can be used to open another file, and the file is available again to be opened by other processes. In case that an object is destructed while still associated with an open file, the destructor automatically calls the member function close. 14.4 Methods of Input and Output Classes The ifstream class has several useful methods for input. These method are also in the class cin, which is used to read from standard input. These methods are used to read from any input stream. An input stream is a source of input such as the keyboard, a file or a buffer. • Extraction Operator, >> This overloaded operator handles all built in C++ data types. By default, any intervening white space is disregarded. That is, blanks, tabs, new lines, formfeeds and carriage returns are skipped over. • get() This form of get extracts a single character from the input stream, that is, from the standard input, a file or a buffer. It does not skip white space. It returns type int. • get(char &ch) This form of get also extracts a single character from the input stream, but it stores the value in the character variable passed in as an argument. • get(char *buff, int buffsize, char delimiter='\n') 200 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ This form of get reads characters into the C-style buffer passed as an argument buffsize characters are read, the delimiter is encountered or an end of file is encountered. The '\n' is the new line character. The delimiter is not read into the buffer but is instead left in the input stream. It must be removed separately but using either another get or an ignore. Because of this added step, this form of get is a frequent source of errors and should be avoided. Fortunately, another method shown below, getline, reads in the delimiter as well and should be used in place of this form of get. • Getline There are several useful forms of getline. • ignore(int count=1, int delim=traits_type::eof) This method reads and discards "count" characters from the input stream. • peek() This method returns the next character from the input stream, but does not remove it. It is useful to look ahead at what the next character read will be. • putback(char &ch) This method puts ch onto the input stream. The character in ch will then be the next character read from the input stream. • unget() This method puts the last read character back into the input stream. • read(char *buff, int count) This method is used to perform an unformatted read of count bytes from the input stream into a character buffer. The ofstream class has several useful methods for writing to an output stream. An output stream is standard output (usually the screen), a file or a buffer. These methods are also in the object cout, which is used for standard output. The simplest way to understand how to use these methods is by looking at a few examples. Since we have seen the extraction, >>, and insertion, << in several lessons, let's look at the other methods. Getline, which is very useful to read entire lines of text into a string. Suppose we need to read a file and determine the number of alphanumeric characters, the number of blanks and the number of sentences. To determine the number of sentences we will count the number of periods (dots). We will disregard newlines and tabs. Here is a program that solves the problem. #include <iostream> #include <fstream> using namespace std; int main() 201 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ { int blank_count = 0; int char_count = 0; int sentence_count = 0; char ch; ifstream iFile("c:/lesson12.txt"); if (! iFile) { cout << "Error opening input file" << endl; return -1; } while (iFile.get(ch)) { switch (ch) { case ' ': blank_count++; break; case '\n': case '\t': break; case '.': sentence_count++; break; default: char_count++; break; } } cout << "There are " << blank_count << " blanks" << endl; cout << "There are " << char_count << " characters" << endl; cout << "There are " << sentence_count << " sentences" << endl; return 0; } As a second example, let's implement a program that will copy the contents of one file to another. The program will prompt the user for the input and output file names, and then copy. 202 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ #include <iostream> #include <fstream> #include <string> using namespace std; int main() { char ch; string iFileName; string oFileName; cout << "Enter the source file name: "; cin >> iFileName; cout << "Enter the destination file name: "; cin >> oFileName; ofstream oFile(oFileName.c_str()); ifstream iFile(iFileName.c_str()); //Error checking on file opens omitted for brevity. while (iFile.get(ch)) { oFile.put(ch); } return 0; } 14.5 Text mode files Classes ofstream, ifstream and fstream are derived from ostream, istream and iostream respectively. That's why fstream objects can use the members of these parent classes to access data. Generally, when using text files we shall use the same members of these classes that we used in communication with the console (cin and cout). As in the following example, where we use the overloaded insertion operator <<: // writing on a text file #include <fstream.h> This is a line. This is another line. int main () { 203 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ ofstream examplefile ("example.txt"); if (examplefile.is_open()) { examplefile << "This is a line.\n"; examplefile << "This is another line.\n"; examplefile.close(); } return 0; } Data input from file can also be performed in the same way that we did with cin: // reading a text file #include <iostream.h> #include <fstream.h> #include <stdlib.h> This is a line. This is another line. int main () { char buffer[256]; ifstream examplefile ("example.txt"); if (! examplefile.is_open()) { cout << "Error opening file"; exit (1); } while (! examplefile.eof() ) { examplefile.getline (buffer,100); cout << buffer << endl; } return 0; } This last example reads a text file and prints out its content on the screen. Notice how we have used a new member function, called eof that ifstream inherits from class ios and that returns true in case that the end of the file has been reached. 14.6 State flags In addition to eof(), other member functions exist to verify the state of the stream (all of them return a bool value): bad() Returns true if a failure occurs in a reading or writing operation. For example in case we try to write to a file that is not open for writing or if the device where we try to write has no space left. 204 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ fail() Returns true in the same cases as bad() plus in case that a format error happens, as trying to read an integer number and an alphabetical character is received. eof() Returns true if a file opened for reading has reached the end. good() It is the most generic: returns false in the same cases in which calling any of the previous functions would return true. In order to reset the state flags checked by the previous member functions you can use member function clear(), with no parameters. get and put stream pointers All i/o streams objects have, at least, one stream pointer: • ifstream, like istream, has a pointer known as get pointer that points to the next element to be read. • ofstream, like ostream, has a pointer put pointer that points to the location where the next element has to be written. • Finally fstream, like iostream, inherits both: get and put These stream pointers that point to the reading or writing locations within a stream can be read and/or manipulated using the following member functions: tellg() and tellp() These two member functions admit no parameters and return a value of type pos_type (according ANSI-C++ standard) that is an integer data type representing the current position of get stream pointer (in case of tellg) or put stream pointer (in case of tellp). seekg() and seekp() This pair of functions serve respectively to change the position of stream pointers get and put. Both functions are overloaded with two different prototypes: seekg ( pos_type position ); seekp ( pos_type position ); 205 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ Using this prototype the stream pointer is changed to an absolute position from the beginning of the file. The type required is the same as that returned by functions tellg and tellp. seekg ( off_type offset, seekdir direction ); seekp ( off_type offset, seekdir direction ); Using this prototype, an offset from a concrete point determined by parameter direction can be specified. It can be: ios::beg Offset specified from the beginning of the stream ios::cur Offset specified from the current position of the stream pointer ios::end Offset specified from the end of the stream The values of both stream pointers get and put are counted in different ways for text files than for binary files, since in text mode files some modifications to the appearance of some special characters can occur. For that reason it is advisable to use only the first prototype of seekg and seekp with files opened in text mode and always use nonmodified values returned by tellg or tellp. With binary files, you can freely use all the implementations for these functions. They should not have any unexpected behavior. The following example uses the member functions just seen to obtain the size of a binary file: // obtaining file size #include <iostream.h> #include <fstream.h> size of example.txt is 40 bytes. const char * filename = "example.txt"; int main () { long l,m; ifstream file (filename, ios::in|ios::binary); l = file.tellg(); file.seekg (0, ios::end); m = file.tellg(); file.close(); cout << "size of " << filename; cout << " is " << (m-l) << " bytes.\n"; return 0; } 206 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ 14.7 Binary files In binary files inputting and outputting data with operators like << and >> and functions like getline, does not make too much sense, although they are perfectly valid. File streams include two member functions specially designed for input and output of data sequentially: write and read. The first one (write) is a member function of ostream, also inherited by ofstream. And read is member function of istream and it is inherited by ifstream. Objects of class fstream have both. Their prototypes are: write ( char * buffer, streamsize size ); read ( char * buffer, streamsize size ); Where buffer is the address of a memory block where the read data are stored or from where the data to be written are taken. The size parameter is an integer value that specifies the number of characters to be read/written from/to the buffer. // reading binary file #include <iostream.h> #include <fstream.h> the complete file is in a buffer const char * filename = "example.txt"; int main () { char * buffer; long size; ifstream file (filename, ios::in|ios::binary|ios::ate); size = file.tellg(); file.seekg (0, ios::beg); buffer = new char [size]; file.read (buffer, size); file.close(); cout << "the complete file is in a buffer"; delete[] buffer; return 0; } 14.8 Buffers and Synchronization 207 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ When we operate with file streams, these are associated to a buffer of type streambuf. This buffer is a memory block that acts as an intermediary between the stream and the physical file. For example, with an out stream, each time the member function put 208 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ (write a single character) is called, the character is not written directly to the physical file with which the stream is associated. Instead of that, the character is inserted in the buffer for that stream. When the buffer is flushed, all data that it contains is written to the physic media (if it is an out stream) or simply erased (if it is an in stream). This process is called synchronization and it takes place under any of the following circumstances: • When the file is closed: before closing a file all buffers that have not yet been completely written or read are synchronized. • When the buffer is full: Buffers have a certain size. When the buffer is full it is automatically synchronized. • Explicitly with manipulators: When certain manipulators are used on streams synchronization takes place. These manipulators are: flush and endl. • Explicitly with function sync(): Calling member function sync() (no parameters) causes an immediate syncronization. This function returns an int value equal to -1 if the stream has no associated buffer or in case of failure. 14.9 I/O Manipulators Up till now, we have accepted the default output formatting. C++ defines a set of manipulators which are used to modify the state of iostream objects. These control how data is formatted. They are defined in the include file, <ios>. It is not usually necessary to explicitly include this file because it is included indirectly via the use of other includes such as <iostream> or <fstream>. Let's see how some of these manipulators work in a simple program. Manipulator Use boolalpha Causes bool variables to be output as true or false. noboolalhpa (default) Causes bool variables to be displayed as 0 or 1. dec (default) Specifies that integers are displayed in base 10. hex Specifies that integers are displayed in hexadecimal. oct Specified that integers are displayed in octal. left Causes text to be left justified in the output field. right Causes text to be right justified in the output field. internal Causes the sign of a number to be left justified and the value to be right justified. noshowbase (default) Turns off displaying a prefix indicating the base of a number. 209 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ showbase Turns on displaying a prefix indicating the base of a number. noshowpoint (default) Displays decimal point only if a fractional part exists. showpoint Displays decimal point always. noshowpos (default) No "+" prefixing a positive number. showpos Displays a "+" prefixing a positive number. skipws (default) Causes white space (blanks, tabs, newlines) to be skipped by the input operator, >>. noskipws White space not skipped by the extraction operator, >>. fixed (default) Causes floating point numbers to be displayed in fixed notation. scientific Causes floating point numbers to be displayed in scientific notation. nouppercase (default) 0x displayed for hexadecimal numbers, e for scientific notation uppercase 0X displayed for hexadecimal numbers, E for scientific notation The manipulators in the above table modify the state of the iostream object. This means that once used on an iostream object they will effect all subsequent input or output done with the object. There are several other manipulators that are used to format a particular output but do no modify the state of the object. Setting Output Width setw(w) - sets output or input width to w; requires <iomanip> to be included. width(w) - a member function of the iostream classes. Filling White Space setfill(ch) - fills white space in output fields with ch; requires <iomanip> to be included. fill(ch) = a member function of the iostream classes. Setting Precision setprecision(n) - sets the display of floating point numbers at precision n. This does not effect the way floating point numbers are handled during calculations in your program. Here is a simple program illustrating the use of the i/o manipulators. #include <iostream> #include <iomanip> #include <string> using namespace std; 210 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ int main() { int intValue = 15; cout cout cout cout cout cout cout cout cout cout << << << << << << << << << << "Integer Number" << endl; "Default: " << intValue << endl; "Octal: " << oct << intValue << endl; "Hex: " << hex << intValue << endl; "Turning showbase on" << showbase << endl; "Dec: " << dec << intValue << endl; "Octal: " << oct << intValue << endl; "Hex: " << hex << intValue << endl; "Turning showbase off" << noshowbase << endl; endl; double doubleVal = 12.345678; cout cout cout cout cout cout cout cout << << << << << << << << "Floating Point Number" << endl; "Default: " << doubleVal << endl; setprecision(10); "Precision of 10: " << doubleVal << endl; scientific << "Scientific Notation: " << doubleVal << endl; uppercase; "Uppercase: " << doubleVal << endl; endl; bool theBool = true; cout cout cout cout << << << << "Boolean" << endl; "Default: " << theBool << endl; boolalpha << "BoolAlpha set: " << theBool << endl; endl; string myName = "John"; cout << "Strings" << endl; cout << "Default: " << myName << endl; cout << setw(35) << right << "With setw(35) and right: " << myName << endl; cout.width(20); cout << "With width(20): " << myName << endl; cout << endl; 211 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ return 0; } 212 BTech IIIrd Semester Paper Code: BTC31 Paper Name: Object Oriented Programming Using C++ 213