Day1_IntroductionToStreams3.0

advertisement
Welcome to the
IBM InfoSphere Streams
Developers Conference
December 3-4, 2012
IBM InfoSphere Streams Version 3.0
Roger Rea
IBM InfoSphere Streams Product Manager
1
© 2012 IBM Corporation
Logistics
 Hosts: Roger Rea & Lisa Foisy
 Developer Conference runs from 9:00AM – 5:30PM Central on December 3-4
 Audio in listen only mode is available on the Webcast
 For ability to listen and ask questions during Q&A periods, please use the following
conference information:
– USA Toll-Free: 1-888-426-6840
– USA Caller Paid: 1-215-861-6239
– For Other Countries:
https://www.teleconference.att.com/servlet/glbAccess?process=1&accessCode=7821035
&accessNumber=2158616239
– Participant Code: 7821035
 The Webcast sessions will be recorded and posted to the Streams DeveloperWorks Wiki
 Reminders:
– All phone lines will be muted
– To unmute your phone line for questions, press *6
– This session is being recorded. If you have concerns, please disconnect
2
© 2012 IBM Corporation
Agenda – Monday, December 3
Times are in Central United States time zone
9:00AM – Welcome and Introduction to IBM InfoSphere Streams Version 3.0 (30 minutes) – Roger Rea
9:30AM - Installation Improvements (40 minutes) - Laurie Williams
10:10AM – Streams Console Applications (20 minutes) – Warren Acker
10:30AM – Streams Console Application Graph (30 minutes) – Michael Pfeifer
11:00AM - Morning Break (10 minutes)
11:10AM – Visualization of View Data (35 minutes) – Susan Cline
11:45AM - Lunch Break (60 minutes)
12:45PM – SPL Enhancements (70 minutes) – Howard Nasgaard
1:55PM – Java Operator API Enhancements (30 minutes) – Howard Nasgaard
2:25PM – Streams Studio (60 minutes) – Pete Nicholls
3:25PM - Afternoon Break (10 minutes)
3:35PM – Streams Studio cont'd (60 minutes) – Pete Nicholls
4:35PM – Overview of BigInsights (45 minutes) – Anshul Dawra
5:20PM – Wrap-up
3
© 2012 IBM Corporation
Agenda – Tuesday, December 4
Times are in Central time zone
9:00AM – Introduction
9:05AM - Streams Toolkit Landscape (15 minutes) - Mike Branson
9:20AM - TimeSeries Toolkit (30 minutes) - Bharath Devaraju
9:50AM – Big Data Toolkit (30 minutes) - Manasa Rao
10:20AM - Morning Break (10 minutes)
10:30AM - Geospatial Toolkit (30 minutes) - Mohan Dani
11:00AM - Messaging Toolkit (30 minutes) – Anjali Agarwal
11:30AM - Text Toolkit (30 minutes) - Rachit Arora
12:00PM - Lunch Break (60 minutes)
1:00PM - Toolkit Enhancements and Database Toolkit (20 minutes) - Paul Bye / Mike Accola
1:20PM – DataStage Integration Toolkit (25 minutes) - Mike Koranda
1:55PM – Complex Event Processing (CEP) Toolkit (10 minutes) – Howard Nasgaard
2:05PM – Miscellaneous Enhancements (10 minutes) – Mike Koranda
2:15PM – Documentation Updates (10 minutes) – Cindy Maier
2:25PM – Logging and Tracing (20 minutes) – Jingdong Sun
2:45PM – Stream Connection Health, Blockage, and Congestion (15 minutes) – Denny Hatzenbihler
3:00PM - Afternoon Break (10 minutes)
3:10PM – Introduction to IBM Accelerators (10 minutes) – Raghuram Velega
3:20PM – Social Data Analytics Accelerator (60 minutes) – Raghuram Velega
4:20PM – Telecommunications Event Data Analytics Accelerator (60 minutes) – Roger Rea
5:20PM - Wrap-up
4
© 2012 IBM Corporation
Introduction
InfoSphere Streams Version 3.0
Roger Rea
Product Manager – IBM InfoSphere Streams
5
© 2012 IBM Corporation
Asian
telco
Real-time mediation and
analysis 7B
CDRs per
day
Data processing time reduced
12 hour to 1 minute
Resources Reduced
36 to 12 blades
6
6
© 2012 IBM Corporation
Dublin City
Centre
Improved on-time
performance
600 buses on 150 routes
50 bus locations per second
Location and stop ETA
Automatically generate
routes and stop locations
Expect increase in riders
7
7
7
© 2012 IBM Corporation
University of Ontario
Institute of Technology
(UOIT)
Detect Neonatal Patient
Symptoms Sooner
Up to 24 Hours
Continuously correlate data
Thousands of events
each second
“Helps detect life
threatening conditions
up to 24 hours sooner”
88
Signal Processing and Data
Cleansing
Heart Rate Variability
8
© 2012 IBM Corporation
Big Data Accelerators Facilitating
Smarter Planet and Business Analytics
Analytic Applications
BI /
Exploration / Functional Industry Predictive Content
BI /
Reporting Visualization
App
App
Analytics Analytics
Reportin
g
Smarter
Planet
Business
Analytics
IBM Big Data Platform
Visualization
& Discovery
Application
Development
Systems
Management
Cloud
Computing
Accelerators
Hadoop
System
Stream
Computing
Data
Warehouse
Accelerators often
begin at IBM Research
IBM Research: more
patents than any other
company
Information Integration & Governance
9
Faster time to value
© 2012 IBM Corporation
IBM InfoSphere Streams
Comprehensive
Development Tools
Scale-out Architecture Sophisticated Analytics with
Toolkits & Accelerators
Front Office 3.0
•
•
•
•
•
•
Eclipse IDE
Web console
Drag & Drop editor
Instance graph
Streams visualization
Streams debugger
10
• Clustered runtime for nearlimitless capacity
• RHEL v5.3 and above
• CentOS v6.0 and above
• X86 & Power multicore
hardware
• InfiniBand support
• Ethernet support
• First Steps
 Big Data, CEP, Database, Data
Explorer, DataStage, Finance,
SPSS, Geospatial, Internet, Mining,
Messaging, Standard, Text, Time
Series Toolkits
 Telco, Machine Data & Social Data
Accelerators
© 2012 IBM Corporation
Streams Analyzes All Variety of Data
Mining in Microseconds
(included with Streams)
Acoustic
(IBM Research)
(Open Source)
Text
***New***
Advanced
Mathematical
Models
Simple & Advanced Text
(listen, verb), (included with Streams)
(radio, noun)
(Included with
Streams))
***New***
Predictive
 R( s , a )
t
(Included with
Streams)
t
population
***New***
Geospatial
Statistics
(included with
Streams)
Image & Video
(Open Source)
(Included with
Streams)
11
© 2012 IBM Corporation
Enhanced Bundling
Streams (production license)
Streams
BigInsights
New
Streams Developer Edition
Streams
BigInsights
New
Streams Non-Production
Streams
BigInsights
New
12
Drag/Drop, Visualization
Toolkits: Time Series,
Geospatial, MQ, CEP,
DataStage
Accelerators: Telco,
New
Social, Machine
Streams (production), NonProduction, Developer Edition
includes BigInsights: Limited use
bundle as supporting Program
(Max 5 TB)
Streams RVU pricing, with
Activated Processor Core as the
Resource
Drag/Drop, Visualization
Toolkits: Time Series,
Geospatial, MQ, CEP,
DataStage
Accelerators: Telco,
New
Social, Machine
Streams Developer Edition pricing
is per user. Install as many dev
and test machines as required so
long as all users are licensed. In
this case, Non-Production not
required.
Drag/Drop, Visualization
Toolkits: Time Series,
Geospatial, MQ, CEP,
DataStage
Accelerators: Telco,
New
Social, Machine
Streams Non-Production – only
used in rare cases when per user is
not a suitable metric for the
customer. Usually this is a much
more expensive approach vs
Developer Edition
© 2012 IBM Corporation
InfoSphere Streams Roadmap
Apr 2011 Streams v2.0
•Dynamic Stream Scope Matching
•Tag-based deployment constraints
•Streams Studio ease of use enhancements
•Stream Processing Language improvements
•Nested data types
•Additional Built in operators
•Composite operators
•Common Windowing Library
•Health & Performance monitoring improvements
(As of December 2012)
Nov 2011 Streams v2.0.0.3
1H2013 and beyond
•BigInsights integration capabilities
•Text Analytics & HDFS operators
•DB2 paralell adapters
•Multiple NIC for increased bandwidth
•Optional PKI authentication
•Operator and application design patterns
•End user development
•High Availability enhancements
•Translation of Streams UI
•Web Services & REST interfaces
•Healthcare, Energy & Utility, Deep
Packet Inspection toolkits
•Performance enhancements
•Additional platform support
•Incremental model building
•Auto Parallelization & Loops
Nov 2012 Streams v3.0
February 2010 –
Streams v1.2
• Finance Toolkit
• Mining Toolkit
• Passport Advantage pricing
& support
Mar 2012
Streams v2.0.0.4
• Power Linux support
•Improved usability theme
Drag and drop application composition
Data Visualization
•XML data type support
•Messaging, Time Series and Geospatial
Toolkits
•Telco & Social Data Accelerators
•Enhanced logging
November 2009 –
Streams v1.0.10
• SELinux security
April 2009 – Streams v1.0
• Runtime, tools and adapters
• Improved installation
2002 – 2008 System S
•7 releases to US Government
13
Information regarding potential future products is intended to outline our
general product direction and it should not be relied on in making a
purchasing decision. The information mentioned regarding potential future
products is not a commitment, promise, or legal obligation to deliver any
material, code or functionality. Information about potential future products
may not be incorporated into any contract. The development, release, and
timing of any future features or functionality described for our products
remains at our sole discretion.
© 2012 IBM Corporation
Download