What I am doing Amol Deshpande. Selection Ordering  Given a set of selection predicates and correlations between them, find the optimal ordering : Not.

Slides:



Advertisements
Similar presentations
Query Optimization Reserves Sailors sid=sid bid=100 rating > 5 sname (Simple Nested Loops) Imperative query execution plan: SELECT S.sname FROM Reserves.
Advertisements

 Will help you gain knowledge in: ◦ Improving performance characteristics ◦ Reducing costs ◦ Understand regression analysis ◦ Understand relationships.
CS4432: Database Systems II
Databases and Information Systems 1 Prof. Dr. Stefan Böttcher Fakultät EIM, Institut für Informatik Universität Paderborn WS 2009 / 2010 Contents: selectivity.
CSE544 Database Statistics Tuesday, February 15 th, 2011 Dan Suciu , Winter
Representing and Querying Correlated Tuples in Probabilistic Databases
EXAMPLE 3 Construct a binomial distribution Sports Surveys
MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3.
CORE 1 Patterns in Chance. Daily Starter Begin Handout.
IntroductionAQP FamiliesComparisonNew IdeasConclusions Adaptive Query Processing in the Looking Glass Shivnath Babu (Stanford Univ.) Pedro Bizarro (Univ.
Navigation Jeremy Wyatt School of Computer Science University of Birmingham.
Exploiting Correlated Attributes in Acquisitional Query Processing Amol Deshpande University of Maryland Joint work with Carlos Sam
Flow Algorithms for Two Pipelined Filtering Problems Anne Condon, University of British Columbia Amol Deshpande, University of Maryland Lisa Hellerstein,
CMSC724: Database Management Systems Instructor: Amol Deshpande
Database System Concepts 5 th Ed. ©Silberschatz, Korth and Sudarshan See for conditions on re-usewww.db-book.com Chapter 14: Query Optimization.
Model-Driven Data Acquisition in Sensor Networks - Amol Deshpande et al., VLDB ‘04 Jisu Oh March 20, 2006 CS 580S Paper Presentation.
CS186 Final Review Query Optimization.
Dependency-Based Histogram Synopses for High-dimensional Data Amol Deshpande, UC Berkeley Minos Garofalakis, Bell Labs Rajeev Rastogi, Bell Labs.
SPS ITRT April Computation/ Estimation Number & Number Sense Measurement/ Geometry Probability/ Statistics Patterns, Functions, & Algebra
Efficient Query Evaluation over Temporally Correlated Probabilistic Streams Bhargav Kanagal, Amol Deshpande ΗΥ-562 Advanced Topics on Databases Αλέκα Σεληνιωτάκη.
1. An Overview of the Data Analysis and Probability Standard for School Mathematics? 2.
IRDM WS Source: Arnold O. Allen, Probability, Statistics, and Queueing Theory with Computer Science Applications, Academic Press, 1990 Reference.
Using car4ams, the Bayesian AMS data-analysis code V. Palonen, P. Tikkanen, and J. Keinonen Department of Physics, Division of Materials Physics.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Query Evaluation Chapter 12: Overview.
General Statistics Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) Each has some error or uncertainty.
Access Path Selection in a Relational Database Management System Selinger et al.
Query Optimization. overview Histograms A histogram is a data structure maintained by a DBMS to approximate a data distribution Equiwidth vs equidepth.
EN : Adv. Storage and TP Systems Cost-Based Query Optimization.
CMSC424: Database Design Instructor: Amol Deshpande
computer
Lesley Charles November 23, 2009.
End-biased Samples for Join Cardinality Estimation Cristian Estan, Jeffrey F. Naughton Computer Sciences Department University of Wisconsin-Madison.
1 Beginning & Intermediate Algebra – Math 103 Math, Statistics & Physics.
Sampling Error SAMPLING ERROR-SINGLE MEAN The difference between a value (a statistic) computed from a sample and the corresponding value (a parameter)
Histograms for Selectivity Estimation
1 Relational Query Optimization Chapter Query Blocks: Units of Optimization  An SQL query is parsed into a collection of query blocks :  An SQL.
Optimal Bayes Classification
Aircraft Windshield Failures Statistical Methods for Reliability Engineering Professor Gutierrez-Miravete Erica Siegel December 4, 2008.
Statistics. A two-dimensional random variable with a uniform distribution.
CS 4432estimation - lecture 161 CS4432: Database Systems II Lecture #16 Query Processing : Estimating Sizes of Results Professor Elke A. Rundensteiner.
Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning Rong Jin, Joyce Y. Chai Michigan State University Luo Si Carnegie.
Robust Estimation With Sampling and Approximate Pre-Aggregation Author: Christopher Jermaine Presented by: Bill Eberle.
1 1 Slide Simulation Professor Ahmadi. 2 2 Slide Simulation Chapter Outline n Computer Simulation n Simulation Modeling n Random Variables and Pseudo-Random.
Statistics 3502/6304 Prof. Eric A. Suess Chapter 4.
Where to Get Data? Run an Experiment Use Existing Data.
MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 4.
HASE: A Hybrid Approach to Selectivity Estimation for Conjunctive Queries Xiaohui Yu University of Toronto Joint work with Nick Koudas.
Chapter 3 Statistical Models or Quality Control Improvement.
Page The Effect of AWGN on the Accuracy of Time of Arrival Detection Rick Enns IEEE a July 2004.
Dept. Computer Science, Korea Univ. Intelligent Information System Lab. 1 In-Jeong Chung Intelligent Information System lab. Department.
MARCH 18, 2014 DATA ANALYSIS. WHAT TO DO WITH DATA Take a look at your data Histogram Descriptive statistics Mean, mode, range, standard deviation/standard.
Chiu Luk CS257 Database Systems Principles Spring 2009
Discrete Probability Distributions
S519: Evaluation of Information Systems
Ask 100 randomly chosen people the following 6 QUESTIONS…
AP Statistics: Chapter 7
Summarising and presenting data - Univariate analysis continued
Linear Regression.
Akshay Tomar Prateek Singh Lohchubh
Sam Madden MIT CSAIL With Amol Deshpande (UMD), Carlos Guestrin (CMU)
Managing batch processing Transient Azure SQL Warehouse Resource
Additional notes on random variables
Additional notes on random variables
Lecture 27: Optimizations
Advanced Mean Field Methods in Quantum Probabilistic Inference
Kabra and DeWitt presented by Zack Ives CSE 590DB, May 11, 1998
Optimizing Sequences of DB Operations
Elementary statistics, bluman
Lecture 27 Wednesday, December 5, 2001.
Continuous distribution curve.
Presentation transcript:

What I am doing Amol Deshpande

Selection Ordering  Given a set of selection predicates and correlations between them, find the optimal ordering : Not known if solvable  Relation to eddies Correlated predicates What kinds of probability distributions to learn ? Using such learned probability distributions  Extensions to join order optimization

Statistics for Query Optimization  Histograms : Multidimensional histograms  Using histograms (or other summary statistics) to compute intermediate result sizes optimally  Effect of more accurate statistics on the query plan chosen