Comments on Analysis submitted by Chairman-CBSE Debasis Sengupta Indian Statistical Institute April 14, 2013.

Slides:



Advertisements
Similar presentations
Comparison of two Normalization Procedures and implications for students.
Advertisements

Normalization of Board Marks for Admission to Engineering Courses Findings and recommendations of the Joshi Committee.
Sampling Distributions
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 16 Mathematics of Normal Distributions 16.1Approximately Normal.
Averages and Variation
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Chapter 10: Hypothesis Testing
Estimates and sample sizes Chapter 6 Prof. Felix Apfaltrer Office:N763 Phone: Office hours: Tue, Thu 10am-11:30.
7-3 Estimating a Population Mean
BPS - 5th Ed. Chapter 31 The Normal Distributions.
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Measures of Central Tendency
Comparison of two Normalization Procedures Results of some statistical analysis.
A P STATISTICS LESSON 9 – 1 ( DAY 1 ) SAMPLING DISTRIBUTIONS.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Basic Statistics Standard Scores and the Normal Distribution.
Introduction to Statistics Measures of Central Tendency.
Statistics Used In Special Education
Data Collection & Processing Hand Grip Strength P textbook.
Sociology 5811: Lecture 7: Samples, Populations, The Sampling Distribution Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Copyright © Cengage Learning. All rights reserved. Averages and Variation 3.
16-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 16 The.
Chapter 7 Estimates and Sample Sizes
IIT BOMBAYIDP in Educational Technology * Paper Planning Template Resource – Paper-Planning-Template(SPT)Version 1.0, Dec 2013 Download from:
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
A study on Normalization of scores from various School Boards (After NTS Meeting of 26 th August, 2011 at IIT, Delhi) Indian Statistical Institute(ISI),
1 Psych 5500/6500 Standard Deviations, Standard Scores, and Areas Under the Normal Curve Fall, 2008.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
QUANTITATIVE RESEARCH AND BASIC STATISTICS. TODAYS AGENDA Progress, challenges and support needed Response to TAP Check-in, Warm-up responses and TAP.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
6.1 Inference for a Single Proportion  Statistical confidence  Confidence intervals  How confidence intervals behave.
Estimating a Population Mean:  Known
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 11.1 Estimating a Population Mean.
Introduction to Statistics Measures of Central Tendency and Dispersion.
Copyright © 2009 Pearson Education, Inc. 8.1 Sampling Distributions LEARNING GOAL Understand the fundamental ideas of sampling distributions and how the.
Chapter 13 Sampling distributions
Normal Distributions (aka Bell Curves, Gaussians) Spring 2010.
INFERENTIAL STATISTICS DOING STATS WITH CONFIDENCE.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
The expected value The value of a variable one would “expect” to get. It is also called the (mathematical) expectation, or the mean.
The Normal Approximation for Data. History The normal curve was discovered by Abraham de Moivre around Around 1870, the Belgian mathematician Adolph.
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
The normal approximation for probability histograms.
The Law of Averages. What does the law of average say? We know that, from the definition of probability, in the long run the frequency of some event will.
Review Design of experiments, histograms, average and standard deviation, normal approximation, measurement error, and probability.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Virtual University of Pakistan
Statistical analysis.
Lecture Slides Elementary Statistics Twelfth Edition
Descriptive Statistics
Statistical analysis.
5.3 The Central Limit Theorem
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Distribution of the Sample Means
Simulation-Based Approach for Comparing Two Means
8.1 Sampling Distributions
Overview and Basics of Hypothesis Testing
5.3 The Central Limit Theorem
Introduction to Statistics
Statistics: The Interpretation of Data
5.3 The Central Limit Theorem
MICE, MUSIC, & MAZES. MICE, MUSIC, & MAZES We have been running mice through mazes since we figured out they both started with M, which was quite a.
COMMON MISTAKES!.
Presentation transcript:

Comments on Analysis submitted by Chairman-CBSE Debasis Sengupta Indian Statistical Institute April 14, 2013

General remarks It is good to see some concrete analysis. Summary of findings was missing. I have summarized as I have understood. I do not have access to the 516,416 sets of matched data (about 48.6% of AIEEE 2012 candidates). I have done additional analysis from what I could get from the presentation. I used same definitions and notations. No contradiction with findings of Normalisation Committee.

A major issue: How the two procedures alter current scenario The NIT’s are concerned with top 3% of rank list. Chairman-CBSE has computed, for each state, as a percentage. ‘%age under procedure 1’ minus ‘%age under current scenario’ is called P1 – P0. P2 – P0 is similarly defined. Comparative plot is reproduced here from page 6 of circulated presentation.

Plot of P1–P0 and P2–P0 for 25 boards Main conclusion: Maximum deviation is only about 2%. No problem (on this aspect) with either procedure. Larger deviations for P1 – P0 is found in other percentile ranges, which do not matter to NIT’s. Board scores often have less than 0.5 correlation with AIEEE scores (see page 21 of report). A procedure giving 40% weight on board scores is expected to produce some changes. This fact was known to the normalisation committee from earlier analysis of over 2 lakh matched data (subset of present data), and included in the report (pages 22-23, F12-F21).

Another percentage Consider, for each state expressed as a percentage. This gives the composition of the top 5% students. This is a matter or direct relevance to NIT’s. P1 – P0 and P2 – P0 can be defined as before. Comparative plot is shown next.

Change in representation of boards in top 5%, for the two procedures Main conclusion: CBSE gains from Procedure 2; AP board and CBSE lose from Procedure 1. Under Procedure 1, – AP board share declines from 20.5% to 17.5%; CBSE share declines from 61.7% to 59.4%. – Both boards have above-par representation even after decline, as 10.8% of candidates in matched data set are from AP board; 39.6% from CBSE. Under Procedure 2, rise of CBSE share is from 61.7% to 65.7%, which is further above-par.

Tip of the iceberg This finding (increase in CBSE’s already large share among top 5%) is not new. Same phenomenon was observed from simulation study based on the 2 lakh matched cases available earlier. The present finding (though based on more data) is not necessarily the ultimate one, as the matched cases have different composition than the ‘set of all candidates’. In AIEEE 2012, 30% students were from CBSE; in matched part of data, 40% were from CBSE. Simulations indicate more dramatic increase (through Procedure 2) in CBSE’s share in top 1%, 0.1% etc.

Simulation conditions 10% Random samples chosen from AIEEE 2012 scores data. Presumed correlation between AIEEE and board scores: – Observed correlations for AS, CBSE, JK, MR, MZ, UK (available till Feb’13) – 0.4 for other boards (observed value for most boards). 100 separate simulation runs.

Actual performance of CBSE students in AIEEE % of all AIEEE 2012 candidates were CBSE students. However, this groups constitutes – 66 of the top 100 AIEEE rank holders, – 608 of the top 1,000 AIEEE rank holders, – 5,977 of the top 10,000 AIEEE rank holders, – 54,430 of the top 100,000 AIEEE rank holders. Slide 12 of Chairman-CBSE reports the corresponding values for the matched data set, which show similar trend. (Figure in that slide is correct, the numbers listed below are swapped.)

Share of boards in different percentile ranges of AIEEE 2012

Rest of the iceberg: How Procedure 1 or 2 would change the shares Procedure 1 reduces marginally the dominance of CBSE students in merit list, Procedure 2 increases drastically that dominance.

…and the numbers behind the charts Procedure 1: Share of boards in different percentile ranges of AIEEE 2012 Board CBSE15%18%21%24%26%29%33%38%44%53% MR17%16%14%13%12%11%10%8%7%5% AP7%8% 9% 10%13% BR10%9% 8% 7%6% 5%3% All other51%49%48%47%46%44%42%39%35%26% Procedure 2: Share of boards in different percentile ranges of AIEEE 2012 Board CBSE11%14%17%21%25%29%35%42%48%59% MR20%18%16%14%12%11%9%7%5%4% AP6% 7%8% 9% 10%11%13% BR11%10% 9%8%7%6%5%4%2% All other53%51%50%49%47%44%41%36%31%22% Current scenario: share of boards in different percentile ranges of AIEEE 2012 Board Overall CBSE14%17%20%22%26%28%33%39%46%54%30% MR17% 15%14%12%11%9%8%6%5%11% AP7%8% 9% 10%13%9% BR10%9% 8% 7%6%5%4%3%7% All other51%49%48% 46%45%42%38%33%25%43%

Findings from simulation Present scenarioProcedure 1Procedure 2 Top 0.01%65.1%61.9%82.7% Top 0.1%60.5%60.4%80.1% Top 1%60.0% 69.6% Top 10%54.4%53.4%58.8% Procedure 1 produces a marginal reduction in CBSE’s share at the top; 2% reduction at the top 0.01%. Procedure 2 brings a great windfall to CBSE students. CBSE students gain more and more from Procedure 2 as one goes up the merit list – share at top 1% increases by 10%, – share at top 0.1% increases by 20%. Table given in Slide 16 of Chairman-CBSE is in line with this finding. (Slide 15 erroneously shows CBSE’s share in top 0.01% under Procedure 1 as 88.2%. This goes against common sense and also against the trend of the adjacent column.)

How does Procedure 2 dramatically increase CBSE’s share of top-rankers? It does so by practically turning the board score into another AIEEE score. The composite score for each board has same mean as AIEEE scores from that board, but has much smaller spread. The set of scores from each board get more concentrated around their respective means. It is as if every student gets a second chance to perform at the same mean level. As a result, top CBSE students gain conspicuously.

Why does Procedure 1 have only a mild effect on CBSE’s share of top-rankers? For CBSE, the students who do well in AIEEE are generally those who do well in board exam. – This may be guessed from the higher rank correlation (see page 21 of report) of CBSE and AIEEE scores. – They mostly hold on to their position in merit list. For boards with less correlation with AIEEE, this is not the case. – Many AIEEE top-rankers from other boards move towards middle in terms of composite score. – If their board exams had been better aligned with AIEEE, they would have gained more from Procedure 1.

Will smaller boards gain unfairly from Procedure 1? Consider the percentage of candidates from a board making it to top 5%. – Plot in Slide 6 and Table in Slide 8 of Chairman-CBSE shows that the maximum change effected by Procedure 1 is 2%. – Table in slide 7 of Chairman-CBSE shows that these percentages are below 5% (par- score) for all small boards under Procedure 1. (In fact, all boards except AP, CBSE, CISC and KK have below par representation.) – If boards with less than 1000 matched samples are regarded as ‘small’ and the rest are ‘large’, the difference in mean changes in the two groups are statistically insignificant at the 5% level. Consider the percentage of the top 5% candidates coming from a board. – Slide 5 of this presentation shows that the changes effected by Procedure 1 are well below 2% for all boards except AP and CBSE. – ‘Small’ and ‘large’ boards have statistically insignificant difference of mean change. Answer to the question is NO.

Fate of students with common scores In 2012, five students (4 from CBSE, 1 from Maharashtra board) were tied at – AIEEE aggregate marks of 130, – board percentile of According to Procedure 1, they shared rank 20,352. According to Procedure 2 – CBSE students had shared rank 18,077, – Maharashtra board students had rank 34,175. The differential treatment is entirely due to relatively poor performance of other students of Maharashtra board in AIEEE. This goes against the principle of fairness. Chairman-CBSE has tried to give a counter-example (Slide 11), involving 19 matched scores tied exactly as above. It is claimed that Procedure 2 would preserve the equivalence but Procedure 1 would not. This is impossible, as equal percentiles are treated equally by Procedure 1.

Assumptions Chairman-CBSE mentions twice in his message: Procedure 1 assumes same merit distribution across boards, Procedure 2 does not. This is a half truth. The other half is: Procedure 2 assumes that the disparity of merit distributions across boards can be precisely measured (and hence adjusted for) by the AIEEE/JEE-Main score. In reality, AIEEE/JEE-Main performance disparity across boards can result from – Difference in merit/ability distributions, – Difference of board examination pattern with AIEEE (rank correlations indicate this), – Lack of instruction in English and Hindi (only available languages for AIEEE/JEE-Main), – Less access to coaching, – Load of an extra subject in board (for some boards). Adjusting only for one factor is impossible – in presence of confounding factors. Disparity may worsen after adjustment. AIEEE performance patterns are VERY disparate. If that is used to calibrate, 80% of MH board would be deemed equivalent to 50% of CBSE. The implied assumption of LARGE merit disparity among large boards is risky.

How students with same AIEEE score move under two procedures Slide 10 of Chairman-CBSE shows the movements of ranks of candidates tied at AIEEE aggregate marks 130 (rank 27493). Claim : Procedure 1 shows greater movement of rank than Procedure 2. Fact : Barring stray cases, the range of movements differ only minimally. In Slide 10, a large movement of rank of a UP candidate through Procedure 1 has been marked with an arrow. Greater movement can be seen in the case of some CBSE candidates for Procedure 2. The other marked case is not too far. The table given in Slide 13 gives an overall picture of rank movements. However, it is hard to draw conclusions from a large table. The normalization committee had looked at scatter plots of the ranks for a pictorial overview (Figure 3.5, page 23). The scatter for Procedure 2 was found to be marginally less dispersed. As explained in the report, this may be attributed to the board-specific distortions inflicted on board percentiles by Procedure 2, which bring them closer to AIEEE score. The low rank correlation between AIEEE percentiles (restricted to boards) and board percentiles, mentioned in page 21 of the report, imply that large movements in some cases are normal – if board percentiles are indeed given 40% importance.

In summary… Barring some errors (e.g., in slides 11, 12 and 15), the computations reported by Chairman- CBSE make sense. These computations corroborate the findings of the normalisation committee led by Prof. S.K. Joshi.