College of Engineering and Science

advertisement
Intelligent Systems (IS)
Computer Systems Architecture (CSA)
Focus Areas
Introduction for Prospective Graduate Students
Ian Walker
Fall 2012
Outline
 Who and what are we?
 Classes, requirements, planning
 Funding opportunities, assistantships
 Degree options
 Sample research projects
 Q&A
Who are we?
 Loose confederation based upon common research interests
Loose mission statements:
IS: Building smarter machine systems
CSA: Building better/faster computing machines
Who:
IS (9 Professors): Birchfield, Brooks, Burg, Dawson, Groff, Hoover,
Schalkoff, Venayagamoorthy (new!), Walker
CSA (9 Professors): Birchfield, Brooks, Gowdy, Hoover, Ligon,
Schalkoff, Shen, Smith, Walker
Who are we?
Current enrollment
IS: 30-50 graduate students
CSA: 10-25 graduate students
Lab space
IS: Riggs 10, Riggs 13/15/17 (main lab), EIB 258 (main lab)
CSA: Riggs 309 (main lab), EIB 352 (main lab), Cluster room
Shared: Riggs 315/7, EIB 341, ...
Sample Research Areas
 Sensor networks
 Tracking filters and embedded systems
 Physiological monitoring systems
 Nonlinear system modeling and control
 Audio and visual spatial sensing
 Biologically inspired robotics
 (More that are not listed here)
Classes (IS)
Required (all these courses offered once per year) :
ECE 801 - Analysis of Linear Systems
ECE 847 - Digital Image Processing
A 600-level course chosen from (642, 655*, 668)
One of (854, 855, 856, 868, 869, 872, 874*, 877)
*For
Computer Engineering, 649 replaces 655, and 874 is removed from list
Other IS courses (typically offered once per 3 semesters):
804, 805, 854, 856, 872, 893 (various)
courses from other focus areas or departments are allowed
Planning: Take core early, figure out what you would like to do
See p. 35 of http://www.clemson.edu/ces/departments/ece/document_resource/grad/Grad_Student_Handbook_2011
Classes (CSA)
Required:
A software course (ECE 617, 852, 855, or 873)
An architecture course (ECE 629, 668, 842, or 851)
A networks course (ECE 640, 649, 848, or 849)
Other CSA courses:
any from the above lists
courses from other focus areas or departments are allowed
Note: 693 and 893 are used for new courses. Be sure to sign
up for the right section number.
See p. 32 of http://www.clemson.edu/ces/departments/ece/document_resource/grad/Grad_Student_Handbook_2011
Advisors
Selecting a faculty advisor is a two-way decision
All faculty use different criteria for evaluating students
Performance in core course taught by that professor
Evaluation of volunteer or startup work in lab
Probationary period
Assistance to PhD or senior graduate student
Funding
Grading Assistantship (GA) - assist prof. with a course
Teaching Assistantship (TA) - teach lab sections
Research Assistantship (RA) - assist prof. in funded project
GAs and TAs are administered by department
RAs are generally offered to PhD students, or sometimes
masters students showing potential and commitment for
PhD
You do not need funding to get involved in research
Degree options
Majors (at masters and PhD level):
Computer Engineering (CpE)
Electrical Engineering (EE)
Options:
Focus area (IS is one of six areas in department)
Non-thesis (coursework only)
33 hours (11 courses)
Thesis
30 hours (8 courses + research)
best to examine options after first semester completed
typically work with PhD student
probably adds a semester - 2 years total
Direct-PhD
60 hours (14 courses + research)
saves 2 courses compared with Masters + PhD
possible to get an MS along the way
For details, see http://www.clemson.edu/ces/departments/ece/document_resource/grad/Grad_Student_Handbook_2011
Recent graduates
Ph.D. students - Academic Positions
Clarkson University at Potsdam, New York
University of Michigan at Ann Arbor, Michigan
Louisiana State University, Louisiana
University of Florida
Ph.D. students - Industrial Positions
Lucent Technology in Connecticut
Oakridge National Laboratories in Tennessee
Mayo Clinic in Minnesota
MS Students - Ph.D. Pursuits
German Aerospace Institute in Germany
Stanford University in California
MS Students - Industrial Positions
General Electric in Virginia
IBM in North Carolina
Intel in Columbia and San Francisco
Yahoo! in California
Harris in Florida
GM-Fanuc in Michigan
Name: Kumar Venayagamoorthy
Focus Area: Power/IS
http://www.people.clemson.edu/~gvenaya/
Research Area:
Real-time power systems
Current Projects:
Smart Grid research
Name: Richard Brooks
Focus Area: CSA/IS
http://www.clemson.edu/~rrb
Research Area:
Distributed Systems / Information Assurance / Coordination
Current Projects:
AFOSR – Detection of Tunnelled Communications Protocols
Industry – Data Leak Prevention
NSF – Network Security Experimentation with GENI
Department of State – Internet Liberty Support for West Africa
Relevant courses:
ECE449 / 649
Data Leak Prevention (DLP) solutions monitor and control data flow
Current DLP solutions are syntax based
- Hash functions
- Regular expressions
- Keyword search
We focus on data semantics
Singular value-based approach
Apply singular value decomposition to term-document matrix.
Find concepts by retaining a number of dimensions.
Hidden Markov Model (HMM)-based approach
Build HMMs based on terms we retained in singular value-based method.
Singular value
decomposition
Find transition probabilities of each document and estimate the probabilities of unobserved transitions.
Probabilistic Context-Free Grammar (PCFG)-based approach
Obtain parse trees of sentences in training documents.
Identify features in the parse trees.
Transmission
Cache
VLSI
…….
Distributed Denial of Service Attack (DDoS) Analysis
WiMAX BCR System Parameters and DDoS Attack Analysis
•
Factorial Experimental Design and ANOVA analysis of avg. throughput Ns-2 simulator used for software
simulations
Real software-defined radio testbeds used for hardware simulations
1%
8%
4%
NS-2 Software Simulations
7%
0%
0% frame_duration,
0% X1
22%
31%
3%
6%
number_of_attac
kers, X2
dos_backoff_star
t, X3
18%
•
1%
0% 4%
1%
Setup the network using Clemson University GENI resources.
Use Operational Network traffic.
Generate DDoS attack traffic using Clemson Condor Cluster.
Analyze performance of DDoS detection methods.
11%
Backoff_start, X1
56%
Throughput of Indoor
Performance Analysis of DDoS Detection Methods on Operational Network
-
42%
Backoff_end, X2
X1*X2
Other
85%
Throughput of Outdoor
•
•
•
A bootable USB drive with the Linux system will access the proxy network.
The proxy network deploys botnet which changes DNS and IP address to avoid detection and tracking.
With this, the democracy advocates, NGOs, and journalists are protected from network censorship and
surveillance.
Detecting Hidden Communications Protocols
•
Protocol analysis of Tor through side-channel attacks
–
–
–
•
Protocol represented as a hidden Markov model (HMM)
Side-channel information: delays between packets
Using zero-knowledge HMM inference algorithm to rebuild the model, i.e. the protocol used by A.
Botnet traffic detection
-
Infer HMMs from botnet timing data
Use confidence interval approach to detect botnet traffic
Result: 95% TP and 2% FP
Name: Melissa Smith
Focus Area: CSA
http://www.parl.clemson.edu/~smithmc/
Research Area:
High-Performance Reconfigurable Computing/ Heterogeneous Computing
Current Projects:
Heterogeneous Mapping and Acceleration of Scientific Algorithms
Acceleration of Gene Co-Expression Network Generation
Performance Models for Hybrid Computing
Exploration of Concurrent Biometric Algorithms for Emerging Reconfigurable Architectures
Relevant courses: (ECE 668, 845, 842, 873, 893)
SNNs Optimizations with Multi-Core
Architectures
Spiking Neural Networks (SNN): preferred
neural network models for simulating the
biological behavior of a neuron
Ultimate goal of scientists:
Izhikevich’s Model
Flop/Byte : 0.65
Wilson model: Flop/Byte: 0.86
Level 1
Morris Lecar Model Flop/Byte:4.71
HH Model Flop/Byte : 6.02
HH model Speedup for different Architectures
900
Fermi GPU, OpenCL
800
Speedup
Model mammalian brain activity
(1011 neurons – 1014 synapses)
Object recognition/identification
Two-level character recognition network
Level 2
w/ two SNN models:
Fermi GPU, CUDA
700
Telsa 870, OpenCL
600
Telsa 870, CUDA
500
AMD GPU, OpenCL
Intel Xeon
400
AMD Opteron
300
IMB PS3
200
100
0
0
2
4
6
Neurons (millions)
Results published in HiCOMB’10, Journal of Supercomputing, & Concurrency and Computation
Exploring Multiple Levels of Heterogeneous
Performance Modeling
Synchronous Iterative GPGPU Execution
(SIGE) model
Regression models for
Use Synchronous Iterative GPGPU Execution
CPU/GPU computations
using Algorithm FLOPS (SIGE) Model for Synchronous Iterative
Algorithms (SIAs)
and Bytes
Relevant Equations describing the SIGE
Model
Texecution = ∑Tcomp. + ∑Tcomm.
Tcomp.= Tpre-process + Tpost-process + TCPU + TGPU
TGPU = TGPU-Kernel + TPCIE-Transfers
TPCIE-Transfers = Thost-to-device + Tdevice-to-host
Tcomm. = ∑Tnetwork-transactions
Initial validation of low-level abstraction
model for GPGPU clusters
Regression-based performance prediction
framework
SIA case studies: Spiking Neural Network
Regression models for PCIE
and Infiniband using micro- (SNN) models
benchmarks
Achieved over 90% prediction accuracy
Gene Co-Expression Network
Construction
Correlation Matrix
Calculation
Threshold Calculation
40
Running Time (Hours)
Running Time (Hours)
25
20
18X Faster
15
10
5
0
C
OPT
10
Accelerating construction of
gene co-expression
networks, which analyze the
relationships among
thousands of genes
Previous techniques were
slow and use excessive disk
space
JAVA
Data Storage Requirement
40
•
35
7X Smaller
30
File Size (GB)
•
20
0
JAVA
•
45X Faster
30
25
20
15
10
•
5
0
JAVA
C
SYMM
BIN
C
OPT
Our acceleration has allowed
generation of hundreds of
gene networks of multiple
sizes and types (rice, yeast,
and human) for in-depth
analysis never before
possible
Future work with GPUs and
other accelerators will
provide additional
Robust Facial Recognition with Highly-Parallel Architectures
The rapidly growing field
of biometrics uses
physical features to
perform identity
authentication.
Several facial recognition
algorithms have been
developed that can adapt to
particular types of image
variation, but no single
algorithm can provide robust
identification.
Facial recognition is the
user’s most convenient
biometric but often suffers
from poor performance,
especially in applications
with wide image variation.
FPGAs and GPUs provide the
necessary parallelism to run
multiple algorithms
simultaneously and fuse their
results together to enable
accurate recognition.
Name: Walt Ligon
Focus Area: CSA
http://www.parl.clemson.edu/~walt
Research Area:
Parallel Computing, Parallel File Systems, Programming Environments
Current Projects:
Parallel Virtual File System (PVFS)
High End Computing I/O Simulator (HECIOS)
Relevant courses: ECE 851, 873, 329, 493 (MPI)
Name: Robert Schalkoff
Focus Area: CSA/IS
http://www.ece.clemson.edu/iaal/index.html
Research Area:
Soft Computing/Parallel Programming
Current Projects:
An algebraic framework for multi-class motion estimation
using unsupervised learning with GPU implementation
Relevant courses: ECE 856, ECE 855, ECE872, ECE 642, ECE 847
An algebraic framework for multi-class motion estimation
using unsupervised learning with GPU implementation
Optical flow constraint equation (OFCE) is
Ix* u + Iy* v + It = 0
Pixel locations that suffer aperture
problem have rank-deficient system.
The min-norm solution of rank-deficient
system leads to motion estimates with low
confidence. High confidence is associated
with vectors that do not suffer aperture
problem.
Motion vectors (u,v) are separated into
two sets; one set of vectors (Hp) that
suffer aperture problem and another set
of vectors (Hc) that do not.
Implementation with NVIDIA CUDA: Compute Unified Device Architecture
SM 0
Kernels for Motion Estimation:
1. Gradients
2. Local Motion
3. SOFM/NG
SP SP
SP SP
Shared
Memory
SP
SP
SP
SP
1
Texture
Memory
Global
Memory
SM 0
SP SP
SP
SP SP
SP
Shared
Memory
2
A
Mutable
B
C
B
A
C D
Immutable
SP
SP
D
Name: Haiying Shen
Focus Area: CSA
http://www.ces.clemson.edu/~shenh
Research Area:
Distributed computer systems and computer networks
Current Projects:
Leveraging Hierarchical DHTs and Social Networks for P2P Live Streaming
P2P File Storage and Sharing System for High-End Computing
Pervasive Data Sharing Over Heterogeneous Networks
File Replication and Consistency Maintenance in Pervasive Distributed Computing
Hybrid Wireless Networks
Self-organizing P2P-based File Storage System in HPC
Relevant courses: ECE 429/629, ECE 893
P2P Live Streaming/VoD
Social network
• Internet-based video streaming
applications attract millions of
online viewers every day.
C
A
• The incredible growth of viewers
and dynamics of participants have
posed a high quality-of-service
(QoS) requirement.
channel B
cluster
• Goals: high scalability, availability,
low-latency.
DHTs
(Channels)
channel
cluster
n
channel
cluster
Pic from http://www.fmsasg.com/SocialNetworkAnalysis/
Features:
(1) Distributed Hash Table is constructed for
content delivery to increase scalability,
availability
(2) Social network is used for accurate conten
recommendation and channel switch to reduc
video delivery latency
(Images captured from paper Flexible Divide-and-Conquer protocol for multi-view peer-to-peer live streaming,
Preliminary results published in ICPP10, Infocom11, IEEE TPDS 11
P2P’09)
GENI Experiments on P2P, MANET, WSN
Networks
We will implement three existing data sharing algorithms on the P2P, MANET and WSN networks, thus
identify and investigate potential issues in the data sharing applications in heterogeneous networks.
Data sharing in P2P networks
(Cycloid P2P)
Locality-based distributed data sharing
protocol (LORD) in MANETs
Features:
(1) Energy-efficient & scalable.
(2) Reliable & dynamism-resilient.
(3) Similarity search capability
Features:
(1) Constant maintenance overhead regardless of
the system scale.
(2) Scalability, reliability, dynamism-resilience, selforganizing.
Number of
nodes:
100
Dimension:
6
Node failure
rate:
0.1-1 natural
Lookup/Insert
interval
10-100s to
every node
Number of nodes (ORBIT)
100
Total lookups
10000
Moving speed dist. (m/s)
[0.5-2.5], [1-5], [20-30]
Spatial-temporal similarity
data sharing (SDS) in WSNs
Number of sensors
128
Node in zone
9
LSH destinations
5
Features:
(1) Efficient spatial/temporal similarity data storage.
(2) Fast query speed.
(3) Low energy consumption.
Leveraging P2P in HPC/Cloud Computing

P2P network is well-known for scalability,
reliability and self-organizing





P2P-based Resource Management


Effective and efficient P2P content delivery
algorithm design (TC11, TPDS10,
INFOCOM11, IPDPS08)
P2P-based Reputation Management




Social network based P2P overlay
construction (under review of INFOCOM12 )
Locality aware P2P overlay construction
(CCNC 09)
Interest aware P2P overlay construction
(CCGRID 09)
User behavior pattern aware P2P overlay
construction (In preparation for IPDPS 12)
Social network Collusion detection (IPDPS11)
Spam filtering (INFOCOM11)
Game theory based cooperation incentive
analysis (ICCCN09, TMC)
Grid computing
P2P-based File Storage System in HPC


File replication (JPDC09)
File consistency maintenance (TPDS11)
Pic from http://innovationsimple.com/web-hosting/cloud-hosting-web-hosting/benefits-of-cloud-computing/
Cloud computing
Name: Darren Dawson
Focus Area: IS
http://www.ece.clemson.edu/crb/welcome.htm
Research Area:
Nonlinear Control and Estimation for Mechatronic Systems
Current Projects:
Following 3 Slides
Relevant courses: ECE 874, 801
Visual Servoing of Robot Manipulators
 Problem: Control of Moving Objects in an Unstructured Environment is Difficult
due to the Corrupting Influences of Camera Calibration with regard to Task
Planning
 Solution: Close the Control Loop with
Camera Measurements
 Testbed Features a High-Speed
Real-Time Camera System
 2.5D Visual Servoing
 Design a Controller to Regulate
the Position and Orientation
of the End-Effector
 Control Strategy Uses Both 2D
Image-Space and 3D Task-Space
Information
Next Generation Hardware-in-the-Loop
Ground Vehicle Steering Simulator
Custom Honda CRV steering
simulator with electric servo-motors
Test platform supports development of
advanced ground vehicle steering
technology using concepts from
“robotics” field
Also examining in-vehicle operator
feedback channels
• Visual (scene, lights)
• Haptic (steering wheel, …)
• Audio (tones/chimes/voice)
Human subject testing
Advanced Automotive Thermal Management Systems - Smart
Components
 Goal is to improve the engine’s cooling/heating
system operation using mechatronic technology
•
Improved fuel economy
•
Reduced tailpipe emissions
•
Flexible thermal system design
•
Enhanced control of engine temperatures
 Replace mechanical cooling system equipment
with electric/hydraulic-driven components
 Develop mathematical thermal models
Name: Tim Burg
Focus Area: IS
http://www.clemson.edu/~tburg
Research Area:
Nonlinear Control Applications
Current Projects:
Unmanned Aerial Vehicles
Biofabrication
Haptics
Environmental Monitoring
Relevant courses: ECE 874, 801
Bioprinting
 Bioprinting - an approach to tissue engineering
 Cells are precisely placed in a 3D structure using
inkjet printer technology.
 Active collaboration with Bioengineering.
 ECE research focused on system integration,
modeling, and control.
Haptics
 Objective Is to identify, demonstrate, and quantify the
potential benefits of specialized haptic user interfaces
within a collaborative environment.
Name: Stan Birchfield
Focus Area: IS
http://www.ces.clemson.edu/~stb
Research Area:
Computer Vision
Current Projects:
Vision-based mobile robot navigation
Vehicle traffic monitoring
Robotic laundry handling
Relevant courses: ECE 847, 877, 904
Vision-Based Mobile Robot Navigation
 Mobile robot equipped with single, off-the-shelf inexpensive camera
 Developing algorithms for
 Traversing a known path by comparing the coordinates of tracked feature points
 Detecting doors in indoor environments for navigation
 Following a person moving about the environment, maintaining a desired distance
 Applications: courier robots, tour guides, physician assistance
Vehicle Traffic Monitoring Using Cameras
 Developing algorithms for detecting,
tracking, and classifying vehicles
automatically using video
 Low-angle cameras cause occlusion and
spillover
 Shadows, reflections, and environmental
conditions are addressed using a
combination of feature tracking and
pattern detection
 Applications:
 intelligent transportation systems (ITS)
 incident detection and emergency
response
 data collection for transportation
engineering applications
Adam Hoover
Focus Area: IS/CSA
http://www.ces.clemson.edu/~ahoover/
Research Area:
Tracking systems, embedded systems
Current projects:
See the next 2 slides
Relevant courses: ECE 854, 668
Bite Counter
1 in 3 Americans is obese, another 1 in 3
is overweight; worldwide there are more
overweight than underfed people
Worn like a watch
Automatically tracks how many
bites of food have been taken
•
•
•
Bite count vs calories for 54 meals
2011-2012 large cafeteria experiment in main campus dining hall
Equipment and software for recording and correlating video, scale, gyroscope data
Signal analysis to improve bite detection accuracy and bite:calorie correlation
Ultrawideband Position Tracking
same
idea
Trilateration measures distances from a set of transmitters to a receiver to calculate position.
•
•
•
Ubisense system in Riggs basement
Particle filter methods to improve accuracy
Noise modeling, combination with other sensors
and other sources of information such as maps
Richard Groff
Focus Area: IS
http://www.ces.clemson.edu/~regroff
Research:
Robotics and control applications at small length scales
Computational and Experimental Tissue Modeling
Biomimetics
Current Projects:
Synthetic butterfly proboscises
Biofabrication and Tissue Modeling
(under revision)
Relevant coursework:
ECE801 (linear systems), ECE847 (digital signal processing)
for some projects, some background in magnetostatics, solid mechanics,
materials science, and/or molecular biology desired
Synthetic Butterfly Proboscis
Proboscis
Experimental
Platform for
Magnetic
Microfibers
 Butterflies can drink fluids of widely varying
viscosities by controlling the shape of their
feeding tube (probosicis)
 Using custom fibers from Materials Science
Department, generate a synthetic proboscis that
can sample widely varying fluids
 Fibers are paramagnetic or piezoelectric
 Control fiber shape using magnetic or electric
fields
 Preliminary work on modeling and position control
of magnetic microfibers
Tissue Engineering via Biofabrication
Fluorescent-dyed murine D1 mesenchymal stem cells (red)
and murine mammary cancer cells (red)
 Biofabrication – develop a system to place living
cells in 3D patterns mimicking native tissue
 many subprojects
 Develop computational model for interaction of
tumor cells and epithelial stem cells
 “Tissue Description Language”
 Specify Describe initial condition for
computational model
 Specify structure for biofabrication
 Use TDL to study systems biology problems in
cancer. (Feedback via intercellular signalling)
Name: Ian Walker
Focus Area: IS/CSA
http://www.ces.clemson.edu/~ianw/
Research Area:
Robotics
Current Projects:
Trunk and tentacle robots
Intelligent Robotic Workstations
Relevant courses: ECE 655, 868, 869
Invertebrate’ robot trunks/tentacles
Animated Architecture
 Integrate Robotics and Architecture
 Goal “Animated Work Environment”
What should you do next?
Find out more about specific research projects
web, senior graduate students, faculty
Contact potential advisors about projects, openings
faculty attending this meeting may be recruiting currently
Either
a) Mutually agree on advising relationship
OR
b) Establish criteria for being evaluated/considered
OR
c) Seek another advisor/project
Q&A
?
Download