PROJECT ON STATISTICS Viju Thomas Sridhar Srikanth A Viju Thomas Sridhar Srikanth A.

Slides:



Advertisements
Similar presentations
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Advertisements

2- 1 Chapter Two McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Biostatistics Unit 4 - Probability.
CHAPTER 2 ORGANIZING AND GRAPHING DATA. Opening Example.
QMS 6351 Statistics and Research Methods Chapter 2 Descriptive Statistics: Tabular and Graphical Methods Prof. Vera Adamchik.
Ch. 2: The Art of Presenting Data Data in raw form are usually not easy to use for decision making. Some type of organization is needed Table and Graph.
Chapter 3 Cumulative Test Sample 30 people spent on books ($)
Analysis of Research Data
CHAPTER 6 Statistical Analysis of Experimental Data
Descriptive statistics (Part I)
Statistics for CS 312. Descriptive vs. inferential statistics Descriptive – used to describe an existing population Inferential – used to draw conclusions.
Probability and Statistics in Engineering Philip Bedient, Ph.D.
SECTION 12-1 Visual Displays of Data Slide
Frequency Distributions and Graphs
1. An Overview of the Data Analysis and Probability Standard for School Mathematics? 2.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
12.1 – Visual Displays of Data In statistics: A population includes all of the items of interest. A sample includes some of the items in the population.
2- 1 Chapter Two McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Histograms, Frequency Polygons Ogives
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Chapter 9 Statistics Section 9.1 Frequency Distributions; Measures of Central Tendency.
CHAPTER 1 Basic Statistics Statistics in Engineering
BY ANITHA.C ASHA V DEEPTHI.J SHALINI. OBJECTIVE: The main objective of our project is collection, classification, analysis and interpretation of data.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
AP Stats Chapter 1 Review. Q1: The midpoint of the data MeanMedianMode.
PROJECT ON STATISTICS Srikanth A Srikanth A. STATISTICS PROJECT REPORT STATISTICS PROJECT REPORT Goal The goal of doing this project was to empower ourselves.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
STATISTICS. Statistics * Statistics is the area of science that deals with collection, organization, analysis, and interpretation of data. * A collection.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Subbulakshmi Murugappan H/P:
Chapter 11 Data Descriptions and Probability Distributions Section 1 Graphing Data.
Barnett/Ziegler/Byleen Finite Mathematics 11e1 Chapter 11 Review Important Terms, Symbols, Concepts Sect Graphing Data Bar graphs, broken-line graphs,
Chapter Eight: Using Statistics to Answer Questions.
CHAPTER 1 Basic Statistics Statistics in Engineering
STATISTICS AND OPTIMIZATION Dr. Asawer A. Alwasiti.
FARAH ADIBAH ADNAN ENGINEERING MATHEMATICS INSTITUTE (IMK) C HAPTER 1 B ASIC S TATISTICS.
Probability Distributions, Discrete Random Variables
9.1 – Visual Displays of Data Objective – TSW construct visual displays of data. Chapter 1.
Histograms, Frequency Polygons, and Ogives. What is a histogram?  A graphic representation of the frequency distribution of a continuous variable. Rectangles.
Descriptive Statistics – Graphic Guidelines
Histograms, Frequency Polygons, and Ogives
Stat 101Dr SaMeH1 Statistics (Stat 101) Associate Professor of Environmental Eng. Civil Engineering Department Engineering College Almajma’ah University.
1 ES Chapter 3 ~ Normal Probability Distributions.
Statistics and Probability Theory Lecture 01 Fasih ur Rehman.
1Chapter : Organizing and Displaying Data Introduction: Statistics : Statistics is that area of study which is interested in learning how to collect, organize,
 2012 Pearson Education, Inc. Slide Chapter 12 Statistics.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Descriptive Statistics – Graphic Guidelines Pie charts – qualitative variables, nominal data, eg. ‘religion’ Bar charts – qualitative or quantitative variables,
Lesson #19: Other Graphs— But What Type Are They?.
How to change bad news to good one
ORGANIZING AND GRAPHING DATA
Chapter 12 Statistics 2012 Pearson Education, Inc.
Chapter 12 Statistics.
BUSINESS MATHEMATICS & STATISTICS.
Chapter 2: Methods for Describing Data Sets
ORGANIZING AND GRAPHING DATA
STATISTICS PROJECT Priya Mariam Simon Aparna Rajeev Sudhit Sethi
ORGANIZING AND GRAPHING DATA
STATS DAY First a few review questions.
Chapter 2 Descriptive Statistics: Tabular and Graphical Methods
Frequency Distributions and Graphs
Descriptive and Inferential
Statistics: The Interpretation of Data
Review for Exam 1 Ch 1-5 Ch 1-3 Descriptive Statistics
Probability and Statistics
Chapter Nine: Using Statistics to Answer Questions
Frequency Distribution and Graphs
Presentation transcript:

PROJECT ON STATISTICS Viju Thomas Sridhar Srikanth A Viju Thomas Sridhar Srikanth A

STATISTICS PROJECT REPORT STATISTICS PROJECT REPORT Goal The goal of doing this project is to empower ourselves and to get familiarized with the various statistical techniques used in data analysis. Thereby helping us to do various computations on a given set of data and to reach on various meaningful conclusions. So as to show an understanding in the basic concepts of statistics. In this project we have made an attempt to understand how different cars in the global market produced by various different auto makers vary from each other with respect to their engine capacity, horse power, mileage, transmission etc.

Data collection The manufacturers we considered were BMW VOLVO GENERAL MOTORS- CHEVEROLET MERCEDES NISSAN HONDA SUZUKI TOYOTA HYUNDAI FORD LEXUS

ATTRIBUTES CHOSEN Quantitative Attributes Chosen. Quantitative Attributes Chosen.  Engine Capacity (cc)  Brake Horse power (BHP)  Mileage (kilo meter/liter of fuel)  Top Speed (Kilometer/hour) Qualitative Attributes Chosen. Qualitative Attributes Chosen.  Gear Transmission (Automatic/ Manual/Both)  Segment (Sedan/SUV/MUV)  Fuel Type (Petrol/Diesel/Both )

ORGANIZED DATA Company/ Car Name Engine Capacity Horse Power Mileag e Top Speed Transmissio n Segme nt Fuel Type BMW 1. BMW 3 Series BothSEDANPetrol 2. BMW 5 Series BothSEDANPetrol 3. BMW 7 Series AutomaticSEDANpetrol 4. BMW X5 4.8i AutomaticSUVPetrol Volvo 1. Volvo V50 T AutomaticMUVPetrol 2. Volvo XC BothSUVBoth 3. Volvo S40 T ManualSEDANpetrol 4. Volvo XC AutomaticSUVBoth Chevrolet 1. Aveo ManualSEDANPetrol 2. Aveo U-VA ManualSEDANPetrol 3. Tavera ManualMUVBoth 4. Optra ManualSEDANBoth Mercedes Mercedes Benz E AutomaticSEDANDiesel Mercedes Benz SL Mercedes Benz SL AutomaticSEDANPetrol Mercedes Benz E Automatic WAGO N Petrol Mercedes Benz SLK AutomaticSEDANPetrol

Nissan Nissan Xterra SE AutomaticSUVPetrol Nissan Sentra ManualSUVBoth Nissan Quest AutomaticSEDANBoth Nissan Pathfinder AutomaticMUVBoth Honda Honda Civic Si ManualSEDANPetrol Honda CR-V AutomaticSUVPetrol Honda Element LX ManualSUVDiesel Honda Pilot EX AutomaticMUVPetrol Suzuki Suzuki SX manualSUVPetrol Suzuki XL automaticSUVPetrol Suzuki Aerio manualSEDANPetrol Suzuki Grand Vitara automaticSEDANPetrol

Toyota Toyota Highlander AutomaticSUVPetrol Toyota Camry ManualSEDANDiesel Toyota Corolla ManualSEDANPetrol Toyota Land Cruiser AutomaticSUVPetrol Hyundai Hyundai Accent ManualSEDANBoth Hyundai Elantra ManualSEDANBoth Hyundai Sonata ManualSEDANBoth Hyundai Sante Fe ManualSEDANBoth Ford Ford Fiesta ManualSEDANBoth Ford Mustang AutomaticSEDANBoth Ford Fusion ManualSUVBoth Lexus Lexus IS AutomaticSEDANPetrol Lexus LS AutomaticSUVPetrol Lexus ES AutomaticSEDANPetrol Lexus SC AutomaticSEDANPetrol

DATA ANALYSIS *Classes in 100's MidpointFrequency Cumulative freq < The above given table represents the frequency distribution of Engine Capacity measured in cubic capacity. Here the classes are chosen with class width of 600 units. With the first class starting from 0 to 1200 and going up to 6600 units The frequency distributions of the cars are done in respect to the above taken classes. Frequency Distribution of engine capacity

Measures of Central tendencies Mean Median Mode Standard Deviation Mean = Σfx/Σf, where f is the frequency and x is the midpoint of the class intervals. where: L = lower limit of the interval containing the median I = width of the interval containing the median N = total number of respondents F = cumulative frequency corresponding to the lower limit f = number of cases in the interval containing the median Mode = Lmo +(d1/(d1+d2))*w Where: LmoLower limit of the modal class d1frequency of the modal class minus the frequency of the class directly below it d2frequency of the modal class minus the frequency of the class directly above it wwidth of the modal class interval

Histogram From the histogram we can infer that the maximum number of cars in the data collected belong to the 4 th class i.e. with an engine capacity ranging between 2400 cc to 3000cc

The frequency polygon constructed helps us to sketch the distribution of the engine capacities of the cars much more clearly.

The ogive shown is constructed using the cumulative frequency. Here we are showing a less than ogive curve.If we take a point on the curve and connect it to the x- axis and then to the corresponding point on the y- axis. It helps us to infer the total number of cars that would lie below the corresponding class of engine capacity given in the x-axis.

Representation Of Frequency Distribution Of Qualitative Data Qualitative data if it has to be represented graphically, doing it on a pie- chart is the best way to do it. As this kind of representation clearly gives the reader an idea about what percentage of the data under study belongs to which category. Here in our data set we have taken totally three attributes which are qualitative. Out of which we have chosen the Fuel Type to be represented graphically. Fuel Type Frequency Petrol26 Diesel3 Both14

Probability Distribution of Transmission with respect to the Horse power Class of Horse power AutomaticManualBothtotal total

Find the probability that the selected car has an automatic gear system? Total number of cars with automatic gear system is =22 Total number of cars =43 Therefore, probability that a selected car has a gear system in it is = So there is a % chance that the selected car has an automatic gear system in it. Find the probality that a selected car with a manual gear system has a horse power of 175 bhp. Total number of cars with manual gear system = 18 Cars falling in the class with horse power of 175 bhp = 6 Hence probability that a selected car with a manual gear has a horse power Of 175= % chances are there that a selected car would have a manual gear system with 175 bhp.

Binomial Distribution Binomial Distribution  Success defined as picking a car which has mileage above 13 km/l. From the data set we can find the values of the following.  Success event: p =  Failure event: q = Probability of picking up 6 cars with mileage more than 13 kmpl in 10 trails from the data set.  No of trials: n = 10  Random variable x = 6  Probability of (X = x) = nCx * p x * q (n-x )  Therefore, P(X=6) = We can say that 6.8% of the time the selected random experiment is true.

Normal Distribution Normal Distribution Probability that a randomly selected car from the data set will have a top speed less than 220  Mean of Top speed =  Standard Deviation =38.70  x=220  μ =  σ =38.70  P (x <= 220) = % of the times a randomly selected car from the data will have a top speed less than % of the times a randomly selected car from the data will have a top speed less than 220.

APPLICATION OF CORRELATION

  From the graph it is observable that there is a high degree of positive correlation between the two attributes.   The correlation coefficient was found out to be Which means that as the engine capacity increases the horse power also increases. This conclusion led us to apply the concept of regression in the current aspect.   As a result of which we were able to get the regression equation- Y=13.927X   Here Y represents engine capacity and X represents the horse power.   Using this equation we can predict what the engine capacity will be for a given value of horse power.   Eg:- What will the engine capacity be for a car with an horse power of 600 BHP   Y=13.927X   Here X=600   Therefore Y= *   Hence the engine capacity=Y= cc   In turn the coefficient of determination was found to be R 2 =0.8377

THANK YOU THANK YOU