Presentation is loading. Please wait.

Presentation is loading. Please wait.

MPDS 2003 San Diego 1 Reducing Execution Overhead in a Data Stream Manager Don Carney Brown University Uğur ÇetintemelBrown University Mitch Cherniack.

Similar presentations


Presentation on theme: "MPDS 2003 San Diego 1 Reducing Execution Overhead in a Data Stream Manager Don Carney Brown University Uğur ÇetintemelBrown University Mitch Cherniack."— Presentation transcript:

1 MPDS 2003 San Diego 1 Reducing Execution Overhead in a Data Stream Manager Don Carney Brown University Uğur ÇetintemelBrown University Mitch Cherniack Brandeis University Alex Rasin Brown University Michael Stonebraker MIT Stan ZdonikBrown University

2 MPDS 2003 San Diego 2 Aurora from the Sky Queries...... App QoS...... App QoS.................. App QoS

3 MPDS 2003 San Diego 3 Aurora from the Sky...... App QoS...... App QoS.................. App QoS

4 MPDS 2003 San Diego 4 Runtime Operation Basic Architecture Scheduler QOS Monitor Box Processors...... Buffer Storage Manager Persistent Store … q1q1 … q2q2 … qiqi … q1q1 … qnqn...... … q2q2  ......  ......  Catalog Router inputs outputs

5 MPDS 2003 San Diego 5 Execution Model Traditional Thread-driven Execution Traditional Thread-driven Execution Thread per query or operatorThread per query or operator Resource management done by OSResource management done by OS Easy to program Easy to program Scalability problems Scalability problems State-based Execution State-based Execution Single scheduler thread maintains execution queueSingle scheduler thread maintains execution queue Small number of worker threads execute execution queue entriesSmall number of worker threads execute execution queue entries Enables application specific allocation of resourcesEnables application specific allocation of resources

6 MPDS 2003 San Diego 6 State-Based vs. Thread-Based

7 MPDS 2003 San Diego 7 Scheduling Two level scheduling Two level scheduling Inter-query scheduling (Which query?)Inter-query scheduling (Which query?) Intra-query scheduling (Operation order?)Intra-query scheduling (Operation order?) Batching Batching Tuple trainsTuple trains Fewer box executions -> fewer scheduling decisions Fewer box executions -> fewer scheduling decisions Also, better memory utilization Also, better memory utilization Superbox schedulingSuperbox scheduling Multiple boxes per decision -> fewer scheduling decisions Multiple boxes per decision -> fewer scheduling decisions Memory utilization: allocate for entire superbox at once Memory utilization: allocate for entire superbox at once State Monitoring (# tuples, latencies, etc) State Monitoring (# tuples, latencies, etc) Incremental and approximateIncremental and approximate

8 MPDS 2003 San Diego 8 Runtime Operation Scheduling: Minimizing Per Tuple Processing Overhead Train Scheduling: A B …xyz A (x)A (y)A (z)B (A (x))B (A (y))B (A (z)) = Scheduler Action AB …xyz B (A (x))B (A (y))B (A (z)) Box Trains: A B …xyz A (z, y, x) B (A (z), A (y), A (x)) Tuple Trains:

9 MPDS 2003 San Diego 9 Tuple Trains and Superboxes

10 MPDS 2003 San Diego 10 Overheads

11 MPDS 2003 San Diego 11 Overheads

12 MPDS 2003 San Diego 12 Other Issues Priority assignment Priority assignment Box Execution Order Box Execution Order QoS QoS

13 MPDS 2003 San Diego 13 Stay Tuned! SIGMOD Demo SIGMOD Demo VLDB ’03 paper “Operator Scheduling in a Data Stream Environment” VLDB ’03 paper “Operator Scheduling in a Data Stream Environment”

14 MPDS 2003 San Diego 14 A little closer...... App QoS...... App QoS............

15 MPDS 2003 San Diego 15 A little closer...... App QoS...... App QoS............

16 MPDS 2003 San Diego 16 Aurora from the Sky Query...... App QoS...... App QoS............ Query


Download ppt "MPDS 2003 San Diego 1 Reducing Execution Overhead in a Data Stream Manager Don Carney Brown University Uğur ÇetintemelBrown University Mitch Cherniack."

Similar presentations


Ads by Google