Uploaded by skhan346

c01.database-systems

advertisement
Chapter 1
Database Systems
Learning Objectives
• Afer commpee ngm this chapter, yomu wiiee be abee tom:
• Defne the diference betwieen data and infomrma omn
• Describe wihat a database is, variomus types, and wihy they are vaeuabee assets fomr
decisiomn makingm
• Expeain the impomrtance omf database desigmn
• See homwi momdern databases evomeved fromm fee systems
• Understand fawis in fee system data managmement
• Outeine the main commpomnents omf the database system
• Describe the main func omns omf a database managmement system (DBMS)
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
2
Why Databases?
• Characteris cs omf data in tomday’s wiomred
• Ubiquitomus (i.e., abundant, gmeombae, and everywihere)
• Pervasive (i.e., unescapabee, prevaeent, and persistent)
• Databases make data persistent and shareabee in a secure wiay
• Speciaeized structures that aeeomwi commputer-based systems tom stomre, managme, and
retrieve data very quickey
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
3
Data versus Information
• Data comnsists omf rawi facts
• Nomt yet promcessed tom reveae meaningm tom the end user
• Buiedingm beomcks omf infomrma omn
• Infomrma omn resuets fromm promcessingm rawi data tom reveae meaningm
• Requires comntext
• Bedromck omf knomwieedgme
• Shomued be accurate, reeevant, and meey
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
4
Introducing the Database
• Shared, integmrated commputer structure that stomres data
• End-user data: rawi facts omf interest tom end user
• Metadata: data abomut data, thromugmh wihich the end-user data is integmrated and
managmed
- Describes data characteris cs and reea omnships
• Database managmement system (DBMS)
• Comeeec omn omf promgmrams
• Managmes the database structure
• Comntromes access tom data stomred in the database
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
5
Role and Advantages of the DBMS (1 of 2)
• Database managmement system (DBMS): intermediary betwieen the user and the
database
• Enabees data tom be shared
• Presents the end user wiith an integmrated viewi omf data
• Promvides momre efcient and efec ve data managmement
• Impromves sharingm, security, integmra omn, access, decisiomn-makingm, promduc vity, etc.
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
6
Role and Advantages of the DBMS (2 of 2)
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
7
Types of Databases (1 of 5)
• Singmee-user database: suppomrts omne user at a me
• Desktomp database: singmee-user database omn a persomnae commputer
• Mue user database: suppomrts mue pee users at the same me
• Womrkgmromup databases: suppomrts a smaee number omf users omr a specifc department
• Enterprise database: suppomrts many users acromss many departments
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
8
Types of Databases (2 of 5)
• Ceassifca omn by eomca omn
• Centraeized database: data eomcated at a singmee site
• Distributed database: data distributed acromss diferent sites
• Ceomud database: created and maintained usingm ceomud data services that promvide
defned perfomrmance measures fomr the database
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
9
Types of Databases (3 of 5)
• Ceassifca omn by data type
• Generae-purpomse database: comntains a wiide variety omf data used in mue pee discipeines
• Discipeine-specifc database: comntains data fomcused omn specifc subject areas
• Opera omnae database: desigmned tom suppomrt a commpany’s day-tom-day ompera omns
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
10
Types of Databases (4 of 5)
• Anaey cae database: stomres histomricae data and business metrics used exceusiveey
fomr tac cae omr strategmic decisiomn makingm
• Data wiarehomuse: stomres data in a fomrmat omp mized fomr decisiomn suppomrt
• Oneine anaey cae promcessingm (OLAP): tomomes fomr retrievingm, promcessingm, and momdeeingm
data fromm the data wiarehomuse
• Business inteeeigmence: captures and promcesses business data tom gmenerate infomrma omn
that suppomrt decisiomn makingm
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
11
Types of Databases (5 of 5)
• Databases can be ceassifed tom refect the degmree tom wihich the data is structured
• Unstructured data exists in its omrigminae (rawi) state
• Structured data resuets fromm fomrmatngm
- Structure is appeied based omn type omf promcessingm tom be perfomrmed
• Semistructured data: promcessed tom somme extent
• Extensibee Markup Langmuagme (XML)
• Represents data eeements in textuae fomrmat
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
12
Why Database Design Is Important
• Fomcuses omn desigmn omf database structure that wiiee be used tom stomre and managme
end-user data
• Weee-desigmned database: facieitates data managmement and gmenerates accurate and
vaeuabee infomrma omn
• Pomomrey desigmned database: causes difcuet-tom-trace erromrs that may eead tom pomomr
decisiomn makingm
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
13
Evolution of File System Data Processing (1 of 3)
• Manuae fee systems
• Accommpeished thromugmh a system omf fee fomeders and feingm cabinets
• Commputerized fee systems
• Data promcessingm (DP) speciaeist created a commputer-based system tom track data and
promduce required repomrts
• Fiee system redux: momdern end-user promduc vity tomomes
• Inceudes spreadsheet promgmrams such as Micromsomf Excee
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
14
Evolution of File System Data Processing (2 of 3)
Table 1.2
Basic File Terminology
TERM
DEFINITION
Data
Raw facts, such as a telephone number, a birth date, a customer name, and a year-to-date
(YTD) sales value. Data has little meaning unless it has been organized in some logical
manner.
Field
A character or group of characters (alphabetic or numeric) that has a specific meaning. A field
is used to define and store data.
Record
A logically connected set of one or more fields that describes a person, place, or thing. For
example, the fields that constitute a record for a customer might consist of the customer’s
name, address, phone number, date of birth, credit limit, and unpaid balance.
File
A collection of related records. For example, a file might contain data about the students
currently enrolled at Gigantic University.
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
15
Evolution of File System Data Processing (3 of 3)
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
16
Problems with File System Data Processing
• Prombeems wiith fee systems chaeeengme the types omf infomrma omn that can be
created fromm data as wieee as infomrma omn accuracy
• Lengmthy deveeompment mes
• Difcuety omf gmetngm quick answiers
• Commpeex system administra omn
• Lack omf security and eimited data sharingm
• Extensive promgmrammingm
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
17
Structural and Data Dependence (1 of 2)
• Structurae dependence
• Access tom a fee is dependent omn its omwin structure
• Aee fee system promgmrams are momdifed tom comnfomrm tom a newi fee structure
• Structurae independence
• Fiee structure is changmed wiithomut afec ngm the appeica omn’s abieity tom access the data
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
18
Structural and Data Dependence (2 of 2)
• Data dependence
• Data access changmes wihen data stomragme characteris cs changme
• Data independence
• Data stomragme characteris cs are changmed wiithomut afec ngm the promgmram’s abieity tom
access the data
• Prac cae sigmnifcance omf data dependence is the diference betwieen eomgmicae and
physicae fomrmat
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
19
Data Redundancy (1 of 2)
• Unnecessariey stomringm the same data at diferent peaces
• Iseands omf infomrma omn (i.e., scatered data eomca omns)
• Increases the prombabieity omf havingm diferent versiomns omf the same data
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
20
Data Redundancy (2 of 2)
• Pomssibee resuets omf uncomntromeeed data redundancy
• Pomomr data security
• Data incomnsistency
• Data-entry erromrs
• Data integmrity prombeems
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
21
Data Anomalies
• Deveeomp wihen nomt aee omf the required changmes in the redundant data are made
successfueey
• Update anommaeies
• Inser omn anommaeies
• Deee omn anommaeies
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
22
Database Systems (1 of 2)
• Lomgmicaeey reeated data stomred in a singmee eomgmicae data repomsitomry
• Physicaeey distributed amomngm mue pee stomragme faciei es
• DBMS eeiminates momst omf fee system’s data incomnsistency, data anommaey, data
dependence, and structurae dependence prombeems
• Current gmenera omn DBMS somfwiare
- Stomres data structures, reea omnships betwieen structures, and access paths
- Defnes, stomres, and managmes aee access paths and commpomnents
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
23
Database Systems (2 of 2)
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
24
The Database System Environment (1 of 2)
• Database system: omrgmaniza omn omf commpomnents that defne and regmueate the
comeeec omn, stomragme, managmement, and use omf data wiithin a database
enviromnment
• Hardwiare
• Somfwiare
• Peompee
• Promcedures
• Data
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
25
The Database System Environment (2 of 2)
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
26
DBMS Functions (1 of 3)
• Data dic omnary managmement
• Data dic omnary: stomres defni omns omf data eeements and their reea omnships
• Data stomragme managmement
• Perfomrmance tuningm ensures efcient perfomrmance
• Data transfomrma omn and presenta omn
• Data is fomrmated tom comnfomrm tom eomgmicae expecta omns
• Security managmement
• Enfomrces user security and data privacy
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
27
DBMS Functions (2 of 3)
• Mue user access comntrome
• Somphis cated aegmomrithms ensure that mue pee users can access the database
comncurrentey wiithomut commprommisingm its integmrity
• Backup and recomvery managmement
• Enabees recomvery omf the database afer a faieure
• Data integmrity managmement
• Minimizes redundancy and maximizes comnsistency
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
28
DBMS Functions (3 of 3)
• Database access eangmuagmes and appeica omn promgmrammingm interfaces
• Query eangmuagme: eets the user specify wihat must be domne wiithomut havingm tom specify
homwi
• Structured Query Langmuagme (SQL): de factom query eangmuagme and data access standard
suppomrted by the majomrity omf DBMS vendomrs
• Database commmunica omn interfaces
• Accept end-user requests via mue pee, diferent netwiomrk enviromnments
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
29
Managing the Database System: A Shift in Focus
• Disadvantagmes omf database systems
• Increased comsts
• Managmement commpeexity
• Maintainingm currency
• Vendomr dependence
• Frequent upgmrade/repeacement cycees
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
30
Preparing for Your Database Professional Career
TABLE 1.3
DATABASE CAREER OPPORTUNITIES
JOB TITLE
Database Developer
DESCRIPTION
Create and maintain database-based
applications
Design and maintain databases
SAMPLE SKILLS REQUIRED
Programming, database fundamentals, SQL
Manage and maintain DBMS and
databases
Develop databases for decision support
reporting
Design and implementation of database
environments (conceptual, logical, and
physical)
Database fundamentals, SQL, vendor courses
Database Consultant
Help companies leverage database
technologies to improve business
processes and achieve specific goals
Database fundamentals, data modeling,
database design, SQL, DBMS, hardware,
vendor-specific technologies, etc.
Database Security
Officer
Implement security policies for data
administration
DBMS fundamentals, database administration,
SQL, data security technologies, etc.
Cloud Computing
Data Architect
Design and implement the infrastructure
for next-generation cloud database
systems
Analyze large amounts of varied data to
generate insights, relationships, and
predictable behaviors
Internet technologies, cloud storage
technologies, data security, performance tuning,
large databases, etc.
Data analysis, statistics, advanced mathematics,
SQL, programming, data mining, machine
learning, data visualization
Database Designer
Database
Administrator
Database Analyst
Database Architect
Data Scientist
Systems design, database design, SQL
QL, query optimization, data warehouses
DBMS fundamentals, data modeling, SQL,
hardware knowledge, etc.
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
31
Summary
• Data comnsists omf rawi facts and is usuaeey stomred in a database
• Database desigmn defnes the database structure
- Can be ceassifed accomrdingm tom the number omf users, eomca omn, as wieee as data usagme and
structure
• Databases evomeved fromm manuae and commputerized fee systems
- There are somme eimita omns omf fee system data managmement
- DBMSs wiere deveeomped tom address the fee system’s inherent wieaknesses
© 2019 Cengage. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a passwordprotected website for classroom use.
32
Download