Browser Evaluation Test …A Trial Run Pierre Wellner & Mike Flynn, IDIAP Fribourg Nov 26, 2004 Mike Flynn, Pierre Wellner IDIAP Simon Tucker, Steve Whittaker.

Slides:



Advertisements
Similar presentations
MIDYIS – A BRIEF INTRODUCTION
Advertisements

Please close your laptops
Second Quarterly Assessment 8th Grade Science
7 th Grade Quarterly Assessment TWO. In any physical or chemical process, what two quantities are always conserved? A. matter and total energy B. light.
Measurement: errors, accuracy, and precision
5.1 Accumulated Changes Example 1: An objects travels with a velocity of 15 mph. What is the distance traveled after 4 hours t v Distance = area.
1 The Model of Production Possibilities This is a basic model designed to highlight the impact of scarcity on an economic system.
Today’s quiz on 8.2 B Graphing Worksheet 2 will be given at the end of class. You will have 12 minutes to complete this quiz, which will consist of one.
© English Language Testing Ltd Taking the Password Skills Test.
Simon Tucker NLP Presentation Efficient user-centred access to multimedia meeting content Simon Tucker and Steve Whittaker University.
USING TI-84 How to use the calculator to find the equation of a line. Lesson Six.
Dyer Junior High School
The Paired-Samples t Test Chapter 10. Paired-Samples t Test >Two sample means and a within-groups design >The major difference in the paired- samples.
1 Work Sampling Can provide information about men and machines in less time and lower cost. It has three main uses: 1.Activity and delay sampling To measure.
Inference for Regression BPS chapter 24 © 2006 W.H. Freeman and Company.
Chapter 7 Continuous Distributions. Continuous random variables Are numerical variables whose values fall within a range or interval Are measurements.
Chapter 1 Science & Technology. Science: (and technology)  has help societies throughout history to advance and even helped many thrive above other cultures.
The Browser Evaluation Test A Proposal Pierre Wellner, Mike Flynn IDIAP, September 2003.
Temporal Compression Of Speech: An Evaluation IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 16, NO. 4, MAY 2008 Simon Tucker and Steve.
A problem is a doubtful or difficult question; a matter of inquiry, discussion, or thought; a question that exercises the mind (Oxford English Dictionary)
5.8 Graphing Absolute Value Functions
Experiment Type your project title here Your name.
QUESTION Your name Your teacher’s name Your school.
Journal Question 05 Nov 2012 Answer the following three questions on the index card: Put your name and hour on top. 1.A force is a __________ or a ____________.
SCIENTIFIC METHOD. A researcher must follow scientific method for research to be considered valid. The following slides will discuss the procedure for.
Outline Lecture 6 1. Two kinds of random variables a. Discrete random variables b. Continuous random variables 2. Symmetric distributions 3. Normal distributions.
Scientific Method A blueprint for experiment success.
© 2010 Pearson Prentice Hall. All rights reserved 7-1.
1.Use two different brands of chocolate and break into squares. Put one type on a plate labelled A. The other on a plate labelled B. 2.The students work.
40 Minutes Left.
1 Lecture 6 Outline 1. Two kinds of random variables a. Discrete random variables b. Continuous random variables 2. Symmetric distributions 3. Normal distributions.
PROJECT TITLE Your name | Your teacher’s name | John W. Dodd Middle School.
WHITEBOARD PRACTICE FINDING THE MISSING ANGLE IN AN ANGLE PAIR.
Ferret A New Meeting Browser Mike Flynn, Maël Guillemot, Pierre Wellner IDIAP, January 2004.
Machine Learning in Practice Lecture 9 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
© English Language Testing Ltd Taking the Password Skills Test.
Science Fair Project Type your project title in place of the title Your name(s) Your teacher’s name(s) Your school.
Chapter 2:.  Come up to board and write the number of different types of social media YOU have used TODAY; write anywhere; no need to organize in any.
Answer these quick fire questions on your mini whiteboards.
© English Language Testing Ltd Taking the Password Skills Test.
Question Catch The teacher throws a bean bag when asking questions. This makes questioning a kinaesthetic activity and can engage pupils who don’t normally.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Data measurement, probability and Spearman’s Rho
Continuous Distributions
The SAT vs. ACT Scholastic Aptitude Test American College Testing
DISPLAYING AND ANALYZING DATA
Frequency Tables Histograms
In this section you will:
Simon Tucker and Steve Whittaker University of Sheffield
Steve Whittaker, Rachel Laban and Simon Tucker University of Sheffield
Project IDEA Name Name of the speaker Entity.
Science Fair Project Title of Project Student Name(s) Teacher
A blueprint for experiment success.
A blueprint for experiment success.
Steps of the Scientific Method
6. Records the stimuli being manipulated using a 10-s MTS method for 5 minutes The data sheet on this slide is an example of how you will be recording.
The velocity is constant and the distance is:
A blueprint for experiment success.
A __________ for experiment success.
A blueprint for experiment success.
Using communication symbols
Graphing Calculator Lesson
A blueprint for experiment success.
Section 3.6A Calculus AP/Dual, Revised ©2018
2.4 Rates of Change & Tangent Lines
Type your project title here Your name Your teacher’s name Your school
6-3 and 6-4 Quiz Review Pages , #24, 30, 32, 38, 39, 42, 45–46
The velocity is constant and the distance is:
Type your project title here Your name Your teacher’s name Your school
A blueprint for experiment success.
Presentation transcript:

Browser Evaluation Test …A Trial Run Pierre Wellner & Mike Flynn, IDIAP Fribourg Nov 26, 2004 Mike Flynn, Pierre Wellner IDIAP Simon Tucker, Steve Whittaker University of Sheffield

Outline Reminder of BET Trial Run Results Analysis Future work

Reminder What is a Browser for? “Browsing a meeting recording is an attempt to find a maximum number of observations of interest in a minimum amount of time.” “Observations of Interest” –Pairs of complementary statements about the meeting –Of interest to… the participants, or to people who missed the meeting. Observers –Unlimited access –No time limit actually 4½ x meeting time (on average) Subjects –Answer as many Questions as possible –Time limit: ½ meeting time –Questions are observation pairs, without indication

The BET Process

Trial Run: Observers Needed native English speakers –University of Sheffield –Students, researchers, lecturers Meetings1 x 44 minutes Observers6 Observations294 (only 255 used)

Observer’s Screen Shot

Observations… about the observations Examples: Agnes thinks having the sofa along the whiteboard is a good idea. Agnes thinks the sofa will be in the way if under the whiteboard. Martin wants to put the coffee machine along the left wall. Martin wants to put the coffee machine along the right wall. Mainly about what was said, not done Participants names all in top ten words –Others: the, of, to, at, is, that 283/294 (83%) use participant by name Observation density…

Observation Density Graph

Trial Run: Subjects 11f + 13m = 24 total University of Sheffield Three conditions: “Guess”- no media whatsoever “Base”- same media as Observers “F 1 ”- Ferret with Brno ASR transcript + slides + speaker segmentations

Guess Condition Screen Shot

Base Condition Screen Shot

F1 Condition Screen Shot

Results: Guess Condition SubjectAnswersCorrectIncorrectScore A % A % A % Total %

Results: Base Condition SubjectAnswersCorrectIncorrectScore B % B % B % B % B552340% B631233% B % B854180% B983537% B % B % Base Total %

Results: F 1 Condition SubjectAnswersCorrectIncorrectScore C % C263350% C % C % C % C % C % C % C % C % F 1 Total %

Details Scores by time Media time-difference Speed versus accuracy

Results by time, overlaid Scores by Time

Media time difference histogram Proximity of Answers to Questions

Speed versus Accuracy graph Speed versus Accuracy

BET scores ConditionSpeedAccuracy Guess % Base % F %

Future work AMI recording 100 hour corpus More observations More subjects –reduce confidence interval (~18% wide) Design, test & compare browser improvements