DIV-Introduction

advertisement
Data & Information Visualization
Lecture 1:
Data, Information, Knowledge and
Their Presentations
Data & Information Visualization
Subject site:
http://staff.it.uts.edu.au/~maolin/32146_DIV/
Data, Information, Knowledge
 Data thing: a fundamental, indivisible thing in
databases and data sets.

Can be represented naturally by populations and labels.
 Associations between things.


If an association can be described by a succinct,
computable rule it is called an explicit association.
If an association can not be described by a succinct,
computable rule it is called an implicit association.
 An information thing is an implicit association
between the data things.
 A knowledge thing is an explicit association
between the data things or information things.
Data, Information, Knowledge
 Data: raw, uninterpreted facts
Tom, 20 years old, student, turner
 Information relates items of Data
Tom is 20 years old
 Knowledge relates items of Information
Tom is 20 years old  Tom pays > $1, 500 Insurance
 Modeling the world (Generalise)
[18 − 25] years old  P (accident) = high
Data mining  Knowledge
discovery
Knowledge
Timelines
Timelines
Knowledge
0
100
200
300
400
500
600
700
800
Utterances
0
100
200
300
400
500
600
700
800
Utterances
Visualization
of the output
Visualization
of the output
output
output
Data Mining
Algorithms
input
Data Mining
Algorithms
Visualization
of the input
input
Data
Data
Timelines
Knowledge
0
100
200
300
400
500
600
700
800
Utterances
Visualization
of the output
output
Intermediate
Visualization
Data Mining
Algorithms
Visualization
of the input
input
Data
Source
data
Integrated
datasets
Mapping
attributes
to visualisation
Visualisation
system
….
Generation of
visual models
Visual
models
Model selection
and validation
Model A
….
Model B
Reselection
Remapping
….
Analytical
techniques
Regenerating
• Decision trees
• Association
analysis
• Rule induction
• Clustering
• Graph statistics
Local URL
.com
.dk
24:00
Domains
Time
Visualization
Scientific Visualization
None Graph Visualization
Information Visualization
Graph Visualization
Graph G = (V, E)
The Definition of IV
Information visualization: the use of interactive visual
representations of abstract, non-physically based data
to amplify cognition [CMS99].
[CMS99]
Stuart K. Card, Jock D. Mackinlay, and Ben Shneiderman.
Readings in information visualization: using vision to think.
Morgan Kaufmann Publishers, Inc., 1999.
Xerox Palo Alto Research Center (PARC)
Reference Model
Visualization: Mapping from data to visual form
DATA
Data
VISUAL FORM
Data Tables
Data
Transformations
Visual
Structures
Visual
Mappings
Human Interaction
Views
View
Transformations
Data Tables
 Relational descriptions of data extended to
include metadata
Casei
Casej
Casek
Variablex
Valueix
Valuejx
Valuekx
…
Variabley
Valueiy
Valuejy
Valueky
…
…
…
…
…
…
Analogy to database:
Variable -> attribute; Case -> tuple or record
Data Tables (2)
 Variable Types

N = Nominal


O = Ordinal


Unordered set
Ordered set
Q = Quantitative

Numeric range
 Metadata

Structure
Data Transformations
 Values  Derived Values
 Structure  Derived Structure
 Values  Derived Structure
 Structure  Derived Values
 Examples?
Visual Structures
 Data Tables are mapped to Visual Structures
 Expressive, effective
 Perception…and the human eye…
Why do we need visual structures?
Maps, diagrams, and PERT charts are examples of using
visual representations to see things. A good picture is worth
ten thousand words.
Today, computers help people to see and understand abstract
data through pictures.
Visual Presentations of data
None-relational data & Relational data
The little image dots represent data records of the number of
sun spots, from 1850 to 1993, zoomed in on a small area.
(collected from GVU Center, Georgia I. T.)
An example of using SeeNet to view email data
volumes generated by AT&T long distance
network traffic. Edges represent email
connections. Weigh and colors of edges represent
volumes of email data.
Visual Structures (2)
 Spatial substrates
 Marks
 Graphical properties
Spatial Substrate
 Space is the container unto which other parts
of Visual Structure are poured.





Composition
Alignment
Folding
Recursion
Overloading
Marks
 Points
 Lines
 Areas
 Volumes
 Graphs and Trees – to show relations or links
among objects
Graph-DrivenGraph
Visualization
of Relational Data
Visualization
An example of graph
visualization. This is
the visualization of a
family tree (graph).
Here each image
node represents a
person and the edges
represent
relationships among
these people in a
large family.
Retinal Properties
 Type of graphical property
 Position/Size
 Gray Scale
 Orientation
 Color
 Texture
 Shape
Other Graphical Properties







Crispness
Resolution
Transparency
Arrangement
Color: value, hue, saturation
Table 1.22
Finally, temporal encoding for visual structures
Attributed Visualization
Visualization of collaborative workspace
View Transformations
 Interactively modify and augment Visual Structures
 Location Probes
 Viewpoint Controls


Zoom, pan, clip
Overview an detail
 Distortions

To perceive larger Visual Structure via distortion
Human Interaction and
Transformation
 Direct Manipulation
 Controlling Mappings
Application1:Visual Web browser





WebOFDAV - mapping
the entire Web,
Look at the whole of
WWW as one graph; a
huge and partially
unknown graph.
Maintain and display a
subset of this huge
graph incrementally.
Reduce mouse-click
rate
Maintain a 2D map &
history of navigation
The “lost in hyperspace” problem
 Even in this small document, which could be read in
one hour, users experienced the ‘lost in hyperspace’
phenomenon as exemplified by the following user
comment: ‘ I soon realized that if I did not read
something when I stumbled across it, then I would
not be able to find it later.’ Of the respondents, 56%
agreed fully or partly with the statement, ‘When
reading the report, I was often confused about where
I was.’ [Nielson, 1990].
Visual Web Browser addresses the problem of
“lost in hyperspace” with a sense of “space”.
 Graphic Web Browser addresses the fundamental problem of
“lost in hyperspace” by displaying a sequence of logical
visual frames with a graphic “history tail” to track the
user’s current location and keep records of his previous
locations in the huge information space.
 The logical neighborhood of the focus nodes indicates the
current location of the user, and the tail of history indicates
the path of the past locations during the navigation.
Application2:
File Management
and Site Mapping
Mapping to a Unix root with approx.
3700 directories and files
An example of using Space-Optimized
Tree Visualization for a small web site
mapping (approximately 80 pages)
- viewing techniques needed
Application3: Web Reverse Engineering

HWIT (Human Web Interface Tool) is able to reuse existing structures
of web site by visualizing and modifying the corresponding web graphs,
and then re-generating a new site by save the modified web graphs.
The layout of an existing structure of a web site
Enhancing the existing Web site by adding a sub-site
Application4: B2C e-Commerce

VOS (Visual Online Shop) can be used for online grocery shopping,
shopping cart model. It is applicable to any e-commerce shopping
application (dynamically navigate e-catalogs).
Application5:
Online Business
Process Management



WbIVC (Web-based
Interactive Visual Component)
is applied to a research
project management system
(RPMS) in universities.
A participant can review the
details of a specific process
element by clicking on the
corresponding rectangle, and
then selecting the “open a
process element” in the
popup menu.
A participant can also create
a new artifact (a Java
methods) to a research
project by opening a edit
window.
The output interface of the WbIVC in RPMS
The input interface of the WbIVC in RPMS
Application6:
Program Understanding
and Software Mining




JavaMiner is for non-linear
visual browsing of huge java
code for programming
understanding.
textual data mining
Visualize a variety of
relationships between terms in
Java code, e.g. HAS,
SUBCLASS, CALL and
INTERFACE relationships.
Text documents, the lexicon,
the neighborhood function
The input interface of the WbIVC in RPMS
Conclusion
 Reference model approximates the basic
steps for visualizing information
 Steps are an ongoing process with many
iterations
 Goal of information visualization: develop
effective mappings to increase ability to
think/to improve cognition
Download