Information Processing and Database

advertisement
Information Processing and Database
Mohammad Manjur Mahmud
Introduction :
Database systems are designed to manage complex information. The management of data
involves both the definition of structures for the storage of information and the provision
of mechanisms for the manipulation of information. It also provides certain level of
security, safety etc.
The true domain of Database has now reached to such an extend that it has become the
nucleus of information management and total office automation in the enterprise area of
computing. Here in this column overall database concepts will be highlighted. This will
help the reader to find right database management system for their need and judge a
database product under certain predefined provisions.
1.1. Problem of Traditional Information System Data redundency
Data redundency and inconsistency is one of the major problem of the traditional
information system. Same piece of information needed to be stored in different area of
the storage media, in many different format. Which lead the users to spend higher cost at
information retrieval from the heterogeneous storage media.
1.2. DATABASE PERSPECTIVE :
Database management system hides the complex of its storage, retrieval etc. which is
known as data abstraction. A database management system is a collection of interrelated
files and a set of programs that allow users to access and modify these flies. A major
purpose of a database system is to provideusers with an abstract view of the data. That is
the system hides certain details of how the data is stored and maintained.
There are three of data abstraction .
i. Physical Level
ii. Conception Level
iii. View Level
i. The lowest level of abstraction describe how the data are actually stored. At this level
complex low level data structures are described in details.
ii. This level describe what data are actually stored in the database, and the relationship
that exists among the data.
View 1
View 2
View n
Conceptual Level
Physical Level
Fig : The three level of data abstraction
Access mechanism
Data access, query from the information system needed to be programmed and a serious
application backlog arises. Data resides in various file format and new application is
difficult to write. Multiple access to the same file was results into ambiguity. Security
There is hardly any data security in the traditional information system and unauthorized
access to the same is extremely easy. Implementation of any new integrity constraint is
very difficult.
iii. The highest level of abstraction describes only part of the entire database. Despite the
use of simple structure at the conceptual level, some complexity remains because of the
large size of the database. Many users of the database system will not be concerned with
all of this information. To simplify their information with the system the view used for
Abstraction is defined.
ANALOGY OF DATA ABSTRACT :
Let there be a record with three field
Struct Customer
{
Char name [ 20 ];
Char Street [ 30 ];
Char Ciry [ 20 ] ;
)
This defines a records called "customer" with three fields. Each field has a name and a
data type associated with it.. A banking enterprise may have several such record types,
for example.
- account, with fields account number and balance
- Employee, with fields name and salary
At the physical level, a customer, account or employee can be described as a block of
consecutive storage location (e.g. words or bytes). At the conceptual level each such
record is described by a type definition, illustrated above, and the interrelationship among
these record type is defined. Finally at the view level, several view of the database are
defined. For example, tellers in a bank see only that part of the database that has
information on customer account. They can not access information concerning salaries of
employee.
1.3. Data models
Data model is described as collection of concepts to describe data, data relationships, data
semantics, and consistency constraints. The various data models that have been proposed
fall into three groups*: i) Object-based logical model, ii) Record based logical model and
iii) Physical data model.
1.3.1.Object-Based Logical Model
Object-Based logical models are used in data in the conceptual and view levels. They
provide fairly flexible structuring capability and allows data to be specified explicitly.
There are many different models like :
- The entity-relationship model.
- The Object-Oriented model
- The Binary Model
- The Semantic data model
- The infological model
- The functional data model The first two models namely
entity-relationship model and Object-Oriented model has gained a very wide use in the
real data base
31 Computer Jagat October 1993
design. Below description of both the model :
THE ENTITY-RELATIONSHIP MODEL
This model is based on a collection of basic object called "entities", and "relationships"
among these objects. An "entity" is an object that is distinguishable for other objects by a
specific set of attributes.
When modeling of the Entity-Relationship model often include a step called
"normalization". Normalization is the process of insuring that a table has a complete
primary key and that all the non-primary key columns depend on the entire primary key.
The proper application of the Entity-Relationship model results in well-designed tables.
For an example, in a ACCOUNTS record for a banking application there may be two
attribute "number" and "balance", that describes one particular account in a bank. A
"relationship" is an association among several entities. For example, a CUSTACCT
relationship associates a customer with each account she or he has. The set of entities of
the same type and relationship of the same type are termed an "entity set" and
"relationship set", respectively.
In addition to "entities" and "relationship", the E-R model represents certain constrains to
which the contents of a database must confirm. For example "mapping cardinalities",
which expresses the number of entities to which another entity can be associated via a
relationship set.
Usually the E-R database can be presented graphically with RECTANGLES,
ELLIPSES, DIAMONDS, LINES.
THE OBJECT-ORIENTED MODEL
The object-orinted model is based on a collection of objects. An "object" contains values
stored in "instance variable" within the object. The "objects" also contains bodies of the
code that operates on the code, which is called the "methods".
1.3.2 RECORD-BASED LOGICAL MODEL
It is used in describing data at the conceptual and view levels. It is
used to specify the overall logical structure of the database design and to provide a high
level description of the implementation.
Record based model are so named because the database is structured in fixed-format
records of several types. Each record type defines a fixed number of fields or attribes, and
each field is usually of fixed length.
There are three widely accepted record-based logical model
i) Relational model
ii) Network model
iii) Hierarchical model
RELATIONAL MODEL :
It represents data and relationship among data by a collection of tables, each of which has
a number of columns with unique names.
NETWORK MODEL :
Data in the network model is represented by a collection of records (in Pascal, c context)
and relationships among data are represented by "links", which can be viewed as
"pointers".
HIERARCHICAL MODEL :
It is similar to the network model in a sense that data and relationships among data are
represented by records and links, respectively. The difference is that the records are
organized as collection of trees rather than arbitrary graphs.
COMPARISON OF THE DATA MODELS
The relational model differs from the network and hierarchical modek that it does not use
"pointers" or "links". Instead, the relational model relates records by values it contains.
This freedom from the use of pointers allows a mathematical foundation to be defined.
1.3.3. PHYSICAL DATA MODEL
Physical data models are used to describe data at the lowest level. Infect Physical data
model is not widely used.
There are Two known type of Physical data model : Unifying model, Frame memory.
1.4 Data Definition Language
A database scheme is specified by a set of definitions which are expressed by a special
language
32 Computer Jagat October 1993
called a data definition language (DDL). The result of compilation of DDL statements is
a set of tables which are stored in a special file called data dictionary (or directory).
A data directory is a file that contains metadata; that is, "data about data." This file is
consulted before actual data is read or modified in the database system.
The storage structure and access methods used by the database system are specified by a
definitions in a special type of DDL called a data storage and definition language. The
result of compilation of these definitions is a set of instructions to specify the
implementation details of the database schemes which are usually hidden from the users.
1.5 Data Manipulation Language
The levels of abstraction discussed in Section 1.2 apply not only to the definition or
structuring of data but also to the manipulation of data. Data manipulation means :
* The retrieval of information stored in the database.
* The insertion of new information into the database.
* The deletion of information from the database.
* The modification of data stored in the database.
At the physical level, algorithm is defined that allows efficient access to data. At higher
levels of abstraction, an emphasis is placed on ease of use. The goal is to provide for
efficient human interaction with the system
A data manipulation language (DML) is a language that enables users to access or
manipulate data as organised by the appropriate data model. There are basically two types
:
* Procedural DMLs require a user to specify what data is needed and how to get it.
* Nonprocedural DMLs require a user to specify what data is needed without specifying
how to get it.
Bibliography :
i) Database System Concept
- H.F. Kurth, A. Silberschats ii) Relational Database
- C.J. Date iii) Database System - Wiederhold
THE ENGLISH PAGES ARE SONSORED BY COMPUTERLINE. 146/1, AZIMPUR
ROAD, DHAKA.
Download