Professional Documents
Culture Documents
George Klington.A 1
What is a Parallel Database?
George Klington.A 2
Parallel Database Management Systems
Architecture
Parallelism
– Intraquery parallelism
– Interquery parallelism
– Intraoperation parallelism
– Interoperation parallelism
George Klington.A 3
Performance of a database system
P P P DBMS
M M M
DB
DB George Klington.A
DB 4
DB
Why parallel DBMS?
Improve performance:
Almost died 15 years ago; with renewed interests because
– Extremely large databases -- data collected from the Web
– Decision support queries -- costly on large data
– Hardware have become cheap
– Set-oriented nature of relational DB: paralization
Improve reliability and availability: when one processor goes
down
George Klington.A 5
Shared Memory Architecture
Processors
Interconnection Network
Data Storage
George Klington.A 6
Shared Disk Architecture
Memory
Processors
Interconnection Network
Data Storage
George Klington.A 7
Shared Nothing Architecture
Interconnection Network
Memory
Provides linear speed up
Data
Storage Provides linear scale up
George Klington.A 8
parallelism
George Klington.A 9
Data Partitioning…
Round-robin partitioning
•Efficient for queries that access the entire set
of data
Range partitioning
•Can lead to data skew
Hash partitioning
•Data is evenly distributed
George Klington.A 10
Interquery vs. Intraquery parallelism
George Klington.A 11
Intraoperation parallelism -- loading/projection
George Klington.A 12
interoperation parallelism
George Klington.A 13
Thank You !!!
George Klington.A 14