Shuang Wu REU-DIMACS, 2010 Mentor: James Abello.  Project description  Our research project Input: time data recorded from the ‘Name That Cluster’ web.

Slides:



Advertisements
Similar presentations
Four girls soccer teams took a random sample of players regarding the number of goals scored per game. The results are below. Use a significance level.
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Inference for Regression
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Regression Analysis Notes. What is a simple linear relation? When one variable is associated with another variable in such a way that two numbers completely.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
Ch11 Curve Fitting Dr. Deshi Ye
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
Copyright ©2011 Brooks/Cole, Cengage Learning Analysis of Variance Chapter 16 1.
The art and science of measuring people l Reliability l Validity l Operationalizing.
Stat 217 – Day 24 Analysis of Variance Have yesterday’s handout handy.
Stat 512 – Lecture 18 Multiple Regression (Ch. 11)
The research process Psych 231: Research Methods in Psychology.
Analysis of Variance. Experimental Design u Investigator controls one or more independent variables –Called treatment variables or factors –Contain two.
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
PSY 307 – Statistics for the Behavioral Sciences
Lecture 9: One Way ANOVA Between Subjects
Chapter Topics Types of Regression Models
Two Groups Too Many? Try Analysis of Variance (ANOVA)
Lecture 23 Multiple Regression (Sections )
Chapter 2 Simple Comparative Experiments
Linear Regression Example Data
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Regression and Correlation Methods Judy Zhong Ph.D.
Hypothesis Testing in Linear Regression Analysis
Means Tests Hypothesis Testing Assumptions Testing (Normality)
Statistical Analysis Statistical Analysis
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Anthony Greene1 Correlation The Association Between Variables.
More About Significance Tests
Ms. Khatijahhusna Abd Rani School of Electrical System Engineering Sem II 2014/2015.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2012 Pearson Education. Chapter 23 Nonparametric Methods.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
PY 603 – Advanced Statistics II TR 12:30-1:45pm 232 Gordon Palmer Hall Jamie DeCoster.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Multiple Regression BPS chapter 28 © 2006 W.H. Freeman and Company.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.1 One-Way ANOVA: Comparing.
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.3 Two-Way ANOVA.
Text Exercise 12.2 (a) (b) (c) Construct the completed ANOVA table below. Answer this part by indicating what the f test statistic value is, what the appropriate.
2 sample interval proportions sample Shown with two examples.
PCB 3043L - General Ecology Data Analysis.
Inferential Statistics. Explore relationships between variables Test hypotheses –Research hypothesis: a statement of the relationship between variables.
Experimental Statistics - week 3
SUMMARY EQT 271 MADAM SITI AISYAH ZAKARIA SEMESTER /2015.
Term Project Math 1040-SU13-Intro to Stats SLCC McGrade-Group 4.
Analysis of Variance ANOVA - method used to test the equality of three or more population means Null Hypothesis - H 0 : μ 1 = μ 2 = μ 3 = μ k Alternative.
MATH 1040 – FINAL PROJECT. Research Question  For SLCC (Utah) Summer 2013 students, is the number of concurrent class credits a student takes in high.
Shuang Wu REU-DIMACS, 2010 Mentor: James Abello. Project description Our research project Input: time data recorded from the ‘Name That Cluster’ web page.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
CHAPTER 15: THE NUTS AND BOLTS OF USING STATISTICS.
Chapter 13 Simple Linear Regression
Lecture Slides Elementary Statistics Twelfth Edition
CHAPTER 29: Multiple Regression*
PENGOLAHAN DAN PENYAJIAN
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Shuang Wu REU-DIMACS, 2010 Mentor: James Abello

 Project description  Our research project Input: time data recorded from the ‘Name That Cluster’ web page. Output: statistic results of participants’ different behaviors when using three interfaces.  Collected Statistics  Conclusions

 For a pre-computed collection of search engine queries, users select for each query one out of three interfaces: Textual, Graphical and Hybrid.  The evaluation process consists of exploring clusters associated with each query, naming the correspond clusters, selecting Clusters Ratings. (ClusterFitRatings and Cluster Name Ratings).  The ClusterFitRatings are on a scale from -1 to 4 and Name Ratings are on a scale from -1 to 2. Note: -1 means that participants didn’t give a rating.  The collected statistics are: Exploration Times, Naming Times, Cluster Rating Times, Name Rating Times, ClusterFitRatings, and Name Ratings.

The raw data collected online is:  Userid  QueryString: the evaluated query  ClusterNum: the evaluated cluster in that query  Name: name/description/summary given to the cluster  Timestamp: server data/time at which the evaluation was written on the data base

 440 clusters were evaluated in the Textual interface, and 338 clusters were evaluated in the Graphical interface, another 378 clusters were evaluated in the Hybrid interface.  We used the Exploration Time, and the Evaluation Time = sum of Naming time, ClusterFitRating time and Name Rating time in the following analysis.  Notation: Ex(T),Ex(G),Ex(H) denote Exploration time per interface; T,G,H denote Evaluation time per interface; NT(T), NT(G), NT(H) denote Naming time per interface.

Dealt with the outliers Note: We treated data with 3.5 standard deviations from mean as outliers.

TWO SAMPLE T-TEST ANOVA F-TEST Test for the difference in means of two samples. Null Hypothesis: there is no difference in two means. vs. Alternative Hypothesis: a mean of the first sample is larger/smaller than a mean of the second sample. Reject a Null Hypothesis if P-value is less than.05. Test for the difference in means for three or more samples. Null Hypothesis: all means are equal. vs. Alternative Hypothesis: at least one of the means are different. Reject a Null Hypothesis if P-value is less than.05.

Statistics Results After a series of T-tests and ANOVA F-tests we got the following results.  Exploration time: There is no difference on the average of Exploration times per interface.  Name time: The Textual interface has the larger Naming time mean.  Evaluation time: The Graphical interface has a larger mean of Evaluation time than the Textual and Hybrid interfaces.

We wanted to see if there was a relationship between ClusterFitRatings and Evaluation times or Naming times for the cluster collection.

We also wanted to see if there was a relationship between Name Ratings and Evaluation times or Naming times for the cluster collection.

Statistics Results According to the results from the four pages and a regression test: test for the linear relationship between a response variable and a explanatory variable, we got the following observations.  When participants gave ClusterFitRating=4, they had the shorter mean of Evaluation time and Naming time than the other Cluster FirRatings in all interfaces.  Users either had the shorter mean of Evaluation time and Naming time when they gave a Name Rating=-1 or 2 than when they gave other ratings or there was no significant time difference in all interfaces.  There are linear correlations between ClusterFitRatings and Name Ratings in all interfaces.

We wanted to see if there was a per query variation of task time among the three interfaces. In order to do this, we grouped the queries that were evaluated with the three interfaces by different users. There was 16 queries that were evaluated by different users with the 3 interfaces. For each such query we tested for the difference.

Statistics Results According to the results from the last two pages, we got the following observations for these 16 queries user triples.  The Textual interface has the shorter mean of Evaluation time and Exploration time than the Graphical and Hybrid interfaces.  The difference in the average of Evaluation time and Exploration time between the Textual and the Graphical interfaces is larger than the one between the Textual and the Hybrid and the Graphical and the Hybrid.

To see if there is an interface with the shortest Exploration time and Evaluation time for these 16 qualified queries. We found the minimum number of triples over all such queries, in order to best deal with the leftovers (a remaining data after grouping in triples). After a consideration of the number of triples per query and the outliers of these data, we set this minimum number as five.

This is a part of table with five randomly selected triples from each query.

Statistics Results After a series of T-tests, ANOVA F-tests and regression tests, we got the following results for this tripled set of 16 queries  There is no difference in the means of Exploration times and Evaluation times for each interface.  There exist linear correlations between Exploration times and Evaluation times in the Graphical and the Hybrid interface, but not in the Textual interface.

 The Textual interface has the larger mean of Naming time.  The Graphical interface has the larger mean of Evaluation time.  Participants give the highest ClusterFitRating have the shorter mean of Evaluation time and Naming time in all interfaces.  There exists a linear correlation between ClusterFitRatings and Name Ratings in all interface; and a linear correlation between Exploration times and Evaluation times in the Graphical and Hybrid interfaces.

 Name Than Cluster online survey,  J. Abello, J, Schulz, H, Gaudin, B, and Tominski, C (2007). Name That Cluster - Text vs. Graphics, IEEE InfoVis Conference, Sacramento, November  Ramsey, Fred L, The statistical sleuth : a course in methods of data analysis, Duxbury/Thomson Learning, 2002

THE END