ISPA for Auto

advertisement
IBM GBS March 2015
Predictive Analytics at Work:
Oil and Gas Exploration using Watson and Data Streaming
© 2014 IBM Corporation
Price of Gas What and Why*
Numerical models use facts
e.g. traditional supply & demand
2
•
Tell most consumers or in fact most analysts the price of oil would go from
$110 to $50 but not justify Why it would not be trusted
•
The first shale oil extraction patent was granted in 16841
•
But it was only written about and not included in the models until recently
•
What if the models could learn beyond classic numerical machine learning
•
If the models could understand the text to know that shale oil is going to
have a critical causal relationship on the price of gasoline
•
That would change the What is predicted but more importantly explain with
the Why
1 http://en.wikipedia.org/wiki/Shale_oil_extraction
2 http://www.nasdaq.com/markets/crude-oil-brent.aspx?timeframe=1y
© 2014 IBM Corporation
Numerical elements to computer predicting gas prices
1. Proven reserves
2. Refining capacity for gasoline
versus diesel
1. High Frequency data on
planned pipeline capacity
2. Streaming data on current
demand
Connectors
Ratio of the price of gas to substitutes
•
Price inflection points on supply
1. It becomes economic to uncap hard
2.
3.
3
to recover oil wells
Profitable for shale oil extraction
Heavy oil / oil sands are affordable
(e,g. natural gas, alternative energy,
diesel, etc.)
Watson
© 2014 IBM Corporation
However you cannot easily get the required rest of the model
elements because people only written in text:
Weather
impacting
demand
Slow down in
Growth
Geography (BRIC)
countries GDP
4
Political problems
Venezuela
If OPEC decides
to restrict supply
• Find new reserves
• Improve reserve
Alaskan
• Improve recovery rates
intervention in
• Improve operation
efficiency
drilling or
• Extend
resource life
pipelines
build
out
management
• Improve oil & gas recovery
• Manage and optimize
Discussion on
completions
•well
Chemical
and thermal
recovery design
(bringing
new oil
• In situ combustion
online)
optimization
injections
© 2014 IBM Corporation
Extraction can find, structure and apply content from text:
•
Weather (the average winter temp in NE down by 2)
•
Political (Strikes by PDVSA oil workers paralyze oil production … has doubled
its workforce since the strike of 2003 even though oil production has stagnated
at well below pre-strike levels.)
•
Supply (OPEC Reference Basket slipped heavily from October’s record peak,
sliding $6.41 or 14%)
•
Geography (In 2007 China’s economy expanded by an eye-popping 14.2%
The IMF now reckons China will grow by just 7.8%)
Legend
5
•
Terms in red can be extracted and put into numerical models (e.g. temp down by 2)
•
Terms in green can be extracted and looked up in a reference domain model and then
applied to numerical models (e.g. OPEC = oil supplier, or NE = US East Coast)
•
Words in black are throw away words
© 2014 IBM Corporation
Everyone cares about the price of gasoline
Model with only numerical elements
Supply
Proven reserves * Refining capacity
---------------------------------------------Pipeline capacity
Current demand
Historical Demand for the period
------------------------------------------------Ratio of the price of gas to substitutes
6
© 2014 IBM Corporation
Everyone cares about the price of gasoline
Model with numerical & unstructured elements
Supply
Proven reserves * Refining capacity
---------------------------------------------- *
Pipeline capacity
OPEC & Venezuela factor * Shale Oil *
New Regulations * New completion rates
Current demand
Historical Demand for the period
------------------------------------------------- *
Weather * China demand * Environmentalism
Ratio of the price of gas to substitutes
7
© 2014 IBM Corporation
There is a lot of information, some of it streaming:
Streaming
Data
True vertical
depth
Fluids Data
Surface
Equipment
Data
BHA
Dynamics
Data
Lithological
Data
Structured
Data
Unstructured
Data
Data
Mapping
MWD / LWD
Data
Vibration
Data
Rock
Mechanics
Data
Mud weight
8
© 2014 IBM Corporation
There is a lot of unused unstructured information:
Streaming
Data
Regulations
Well plans
Manufacturer’s
FAQ
Shift End /
Morning
reports
Best
Practices
Structured
Data
Unstructured
Data
Driller's
Network
Data
Mapping
9
© 2014 IBM Corporation
Probability of mud circulation problems is now much more
complete
Model with numerical & unstructured elements
Classical numerical model for
predicting circulation problems
Written assessment of material in the
*
Material in the shakers, Drill plan notes
on fractures or have high permeability, etc.
10
© 2014 IBM Corporation
Where else can your firm Merge Numerical and Unstructured
Other parts of drilling
Plant maintenance
Exploration
Well planning
Environment
Health, Safety, Security and Environment
Major Capital Projects
Trading advisor
etc.
11
© 2014 IBM Corporation
Download