Data Modeling [Comparison of data modeling techniques ]

advertisement
Data Modeling
[Comparison of data modeling
techniques ]
By
Renjini
Sindhuri
Contents





Introduction
E-R modeling
 Peter Chen
 Information Engineering
 Barkers Notation
 IDEFIX
UML modeling
XML modeling
 X- Entity modeling
 XUML
Conclusion
Introduction



Data modeling is the act of exploring data oriented
structures.
Examines and compares different data modeling
techniques
In the data modeling techniques we have traditional
modeling and object oriented modeling of data
E-R modeling



It is a conceptual data model that views the real world
as consisting of entities and relationships
It is used to transform relational tables that are easy to
understand that enables easy communication with the
end user
Peter –Chen developed E-R model
Peter –Chen notation




Entities are represented in the squared cornered and
circles as attributes
Many –Many relationships can be represented without
associative entity
Relationship itself has attributes and are considered
as objects
It failed to represent unique identifier
Peter Chen’s Model
Information Engineering model




Developed by Clive Finkelstein
Entities are represented in the squared cornered and
attributes are not shown at all they are shown in a
separate list called entity list
Relationships like mandatory 1 and many can be
represented
Unique identifiers are not represented
Information Engineering model diagram
Barkers Notation





Adopted by Oracle corporation for its CASE method
Entities can be represented by round cornered
rectangle
Same entity can be represented for role an interaction
or another kind of association
Relationship names are prepositions and not verbs
Unique identifiers can be represented by hash marks
next to the attribute
Barkers Notation diagram
IDEFIX Notation




It is a modeling technique that is used by many
branches of the United States Federal government
A relationship name is a verb
IDEFIX shows subtypes as separate entity boxes
IDEFIX permits multiple inheritance and multiple type
hierarchies
IDEFIX diagram
UML

UML is an object modeling technique
 It models object classes instead of entities
 In the object oriented world the relationships are called
as associations
 Cardinality and optionality in UML is conveyed by
characters or numbers
 Express in the form of more complex upper and
lower limits
 UML introduces a small flag that includes text
describing any business rules
UML diagram
XML Notation




Describing data and interchanging structured and
unstructured data on the Internet
It is a universal language of data on web
XML tags are used to create data structures
XML documents have been widely used for
interchanging data between heterogeneous systems.
XML notation
An example of XML notation
http://www.essentialstrategies.com/publicat
ions/modeling/xml.htm

X-Entity model

Conceptual model of XML uses X entity model in order
to represent additional features
 The entity can be denoted by ‘E’
 ({A1,….An},{R1,…Rm},{D1,….Dk})
 Each attribute A is associated with a domain Dom(Ai)
Which specifies its value set
Cardinality is denoted by Card(Ai)=(min,max)
X entity model diagram
XUML




XUML comprises the characteristics of XML and
UML2.
It is used to express the containment semantics more
explicitly
Supporting the concept of Business Components
Specifying the data dependencies in multiple context
XUML diagram
UML and XUML model of a book store
Comparison of data modeling techniques
S.No
Modeling
Technique
Peter Chen
Information
Engineering
IDEFIX
Richard
Barker’s
notation
UML
1.
Entities
squared
cornered
and circles
as attributes
Squared
cornered,
attributes are
not shown at
all.
Round or
square
cornered
rectangle
Round
cornered
rectangle
Models object
classes
2.
Relationship
Nouns. So
the
relationships
can be
represent as
objects and
has
attributes
Verbs
Verb or verb
phrase
Preposition
not verb
Associations
3.
Constraints
between
relationships
Failed to
represent
the
constraints
directly
exclusive
or)
Can
represent
Constraints
exclusive or
,inclusive
Cannot
represent
Can
represent
Constraints
exclusive or)
Can
represent
Constraints
exclusive or )
Comparison of Data modeling techniques
S.No
Modeling
Technique
Peter Chen
Information
Engineering
IDEFIX
Richard
Barker’s
notation
UML
4.
Cardinality
Many to Many
relationships
can be
represented
between the
entities
without the
associative
entity
Can represent
Can represent
in different
ways
Can represent
zero or more
,atleast
at least one
express more
complex
upper limits,
zero, 3, 6-7,
or 9
Cannot
represent the
sub types and
sub-types can
be
represented
inside their
super-type
5.
Sub types/
Super Types
super type
6.
Unique
Identifier
Cannot
represent
,
up to many
up to one
relationships
sub-types can
be
represented
inside their
super-type
box
box
Sub types can
be
represented
as separate
entity boxes
separate from
its super type.
Cannot
represent
Represented
in the form of
Represented
in the form of
hash next to
Can represent
Can represent
Comparison of Data Modeling techniques
S.No
Modeling
Technique
Peter Chen
Information
Engineering
IDEFIX
Richard
Barker’s
notation
UML
7.
Aggregation
Cannot
represent
Cannot
represent
Cannot
represent
Cannot
represent
Can
represent
only binary
aggregations
8.
Business
Rules /
Cannot
Represent
Cannot
Represent
Cannot
Represent
Cannot
Represent
Can
Represent
Components
Comparison of Data Modeling techniques
S.No
Modeling
Technique
9.
Peter Chen
Information
Engineering
IDEFIX
Richard
Barker’s
notation
UML
High
Medium
Low
High
High
Aesthetic
Simplicity
Score
10.
Completenes
s Score
Low
medium
medium
medium
medium
11.
Language
Notation
Score
medium
medium
low
medium
High
Advantages of XUML



XUML can express the containment semantics more
accurately.
Support the concept of Business Component.
Can specify the data dependencies in
multiple context.
Contd..

XUML is more expressive, precise and
understandable.
 More rigorous and accurate.
Conclusion




By comparing the aesthetic simplicity, completeness,
language notation (relationship) Mr. Barker's notation
is favorable for requirement analysis model
XML is used in recent trends it follows a standard
format for representing structured and semi structured
data on web
X-Entity model has the advantages of both XML
schemas and extends the ER model so that it can
explicitly represent important features of XML
schemas
The distinctive features of XUML made this technique
of data modeling the latest trend for conceptual
modeling of data.
References

1. Conceptual Modeling of XML schemas, Bernadette Farias
Losio,Ana Carolina Salgado , Year: 2003,Publisher: ACM

2. XML conceptual modeling with XUML, HongXing Liu
HuaZhong University of Science and Technology, P. R. China,
YanSheng Lu HuaZhong University of Science and Technology, P.
R. China,Qing Yang Wuhan Uni Pages: 973 – 976, Year of
Publication: 2006, Publisher: ACM Press

3. PETER PIN-SHAN CHEN, “The Entity Relationship ModelToward a Unified View of Data” , Massachusetts Institute of
Technology, ACM Transactions on Data base System Volume1,
Issue 1,Publisher-ACM

4. Data modeling in the understanding database course: adding
UML and XML modeling to the traditional content. Journal of
Computing Sciences in Colleges, Volume 17, Issue 5 (April 2002)
References

5. Data Modeling101.
http://www.agiledata.org/essays/dataModeling101.html
6.A comparison of Data Modeling ,David C Hay,Essential Strategies
Inc,October 1999.
Download