Research on OGSA-DAI/DQP Part I

advertisement
Research on OGSA-DAI/DQP
Part I
Alexander Wöhrer
U i
University
it off Vienna
Vi
Faculty of Computer Science
Institute for Scientific Computing
woehrer@par.univie.ac.at
Active vs passive DBMS usage
„
Passive DBMS usage:
„
„
„
⇒
Client
DBMS
DMBS acts as pure data
d
provider
id
all application code @ service level
pull mechanism for data to be processed
request
JDBC
result
Data
service
Active DBMS usage:
⇒
⇒
⇒
⇒
DBMS acts as technology and data provider
request
interface code @ service level
main application code @ DBMS level
Client
push mechanism for result data
OGSA-DQP Meeting,
NeSC March. 2009
Data Service
Interface
result
DBMS
Code
2
Active Database Integration:
g
Java in the Database
„
„
„
most of common vendors support Java Stored
Procedures in some way
Static methode of Java class as program entrance
point
P tG SQL 8
PostGreSQL
8.3
3
„
„
„
Oracle 10g
„
„
„
„
supports standard JVM
not much development support
comes with OracleJVM
implements a non
non-preemptive
preemptive threading model
rich support for Web-Service call-ins/call-outs via JPublisher
All have kind of server side JDBC support
OGSA-DQP Meeting,
NeSC March. 2009
3
Active Database Integration
g
Implementation
sendData(tuples)
Database host
Why tuple serialization?
„
First Oracle JDBC call
(evaluate with plan) blocks,
as at least one child thread
is still running
Evaluator WS
interface
serialize
tuples
Tuples in
Files
de-serialize
tuples
evaluate(query partition)
ExIn
DBMS
client
JDBC
SOAP
ExOut
„
Join
Table
OGSA-DQP Meeting,
NeSC March. 2009
server
JDBC
Scan
⇒
PostGreSQL wouldn‘t
wouldn t block
but we had to block it,
otherwise „forgotten
statement“
these restriction lead to
tuple
p serialization via files
for both implementations
4
Lessons learned…
„
OGSA-DQP 3.2 TP tends to over-parallelize
„
„
„
due to poor estimates
d
i
off static
i distribution
di ib i costs
average row size provides insufficient information
average column size needed
„
„
„
providing the base information to calculate better
distribution costs
memory requirements
avoid service hops
„
Getting rid of artifical OGSA-DQP evaluator layer
on top of OGSA-DAI results in
OGSA-DQP Meeting,
NeSC March. 2009
5
Austrian Grid
„
Medical application
pp
„
„
breath gas analysis
Focus on security issues:
„
„
Single Point of Access (SPOA) cloud
computing concept based on OGSA-DAI
and Nimbus
Fine-grained data access for row level
security
OGSA-DQP Meeting,
NeSC March. 2009
6
China‘s Railwayy Freight
g
Transportation Grid
„
„
One year visitor PhD student Yan Zhang @ ISC
Application characteristics:
„
„
„
„
„
dedicated network
data integration and
analysis
1 waybill record
around 4kb
150.000 per day
220 GB / year
OGSA-DQP Meeting,
NeSC March. 2009
7
Download