Classifications and CASCOT Ritva Ellison Institute for Employment Research University of Warwick.

Slides:



Advertisements
Similar presentations
The development of Cascot: Computer Aided Structured Coding Tool
Advertisements

Database Basics. What is Access? Database management system Computer-based equivalent of a manual database Makes it easy to organize and update information.
1 ADVANCED MICROSOFT POWERPOINT Lesson 5 – Using Advanced Text Features Microsoft Office 2003: Advanced.
Copyright © 2008 Pearson Prentice Hall. All rights reserved. 1 1 Committed to Shaping the Next Generation of IT Experts. Chapter 2: Relational Databases.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 2 1 Microsoft Office Access 2003 Tutorial 2 – Creating And Maintaining A.
Creating And Maintaining A Database. 2 Learn the guidelines for designing databases When designing a database, first try to think of all the fields of.
FIRST COURSE Access Tutorial 2 Building a Database and Defining Table Relationships.
CASCOT International version 5 User Guide Peter Elias, Margaret Birch and Ritva Ellison Institute for Employment Research University of Warwick December.
Tutorial 11: Connecting to External Data
Reference Manager Making your life easier! Updated September 2007.
V Slide 1V 1.0Slide 1 FMP/Staff Cost FMP – Staff Cost Account Clerk School Principal FMP Common Setup Budgeting Bookkeeping Staff Cost Capital.
Nextgen Bank Reconciliation Resource Bank Reconciliation Menu Financial Management Bank Reconciliation –Import Bank Statements –Reconcile Bank Accounts.
Multi-language CASCOT Margaret Birch and Ritva Ellison Institute for Employment Research.
COMPREHENSIVE Excel Tutorial 8 Developing an Excel Application.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall 1 1. Chapter 2: Relational Databases and Multi-Table Queries Exploring Microsoft Office.
Scheduling Instructions- Elementary Working in the Mainframe to schedule students for the Esembler Grade Book Program If you wish to view a printable version.
CPSC 203 Introduction to Computers T59 & T64 By Jie (Jeff) Gao.
Copyright © 2008 Pearson Prentice Hall. All rights reserved.1 1 Committed to Shaping the Next Generation of IT Experts. Chapter 2: Relational Databases.
Database Applications – Microsoft Access Lesson 9 Designing Special Queries.
CASCOT for EurOccupations Demonstration of the software English, Dutch, French Manual coding Linking to EurOccupations database Automated coding Specific.
CASCOT AND THE CODING OF OCCUPATIONS IN EUROPEAN SURVEYS Demonstration of CASCOT Presentation for the InGRID Workshop Amsterdam, February 2014 Ritva.
® Microsoft Office 2013 Access Building a Database and Defining Table Relationships.
The Use of Administrative Sources for Statistical Purposes Matching and Integrating Data from Different Sources.
Slideshow Slideshow 1B Creating the Chart of Accounts and Batch Processing in the General Ledger.
Lesson 2.  To help ensure accurate data, rules that check entries against specified values can be applied to a field. A validation rule is applied to.
UNIT 7: Using Excel in the Law Office. This Week’s Assignment You should be working on your three-part assignment Part 1 deals with the things you learned.
1 By: Nour Hilal. Microsoft Access is a database software where data is stored in one or more Tables. A Database is a group of related Tables. Access.
XP New Perspectives on Microsoft Access 2002 Tutorial 21 Microsoft Access Tutorial 2 – Creating And Maintaining A Database.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 2 1 Microsoft Office Access 2003 Tutorial 2 – Creating And Maintaining A.
Slideshow 2 Setting Up Bank Services, Tax Services and Schedule Codes.
Chapter 17 Creating a Database.
Press Esc to Exit ©2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in.
® Microsoft Office 2010 Building a Database and Defining Table Relationships.
Create Lists in Millennium Jenny Schmidt SWITCH Library Consortium.
Page 1 Non-Payroll Cost Transfer Enhancements Last update January 24, 2008 What are the some of the new enhancements of the Non-Payroll Cost Transfer?
The Price Index Processor System PIPS Niall O’Hanlon.
July 2013 Exams Officers Network Meeting – Part 2.
Copyright 2007, Paradigm Publishing Inc. ACCESS 2007 Chapter 3 BACKNEXTEND 3-1 LINKS TO OBJECTIVES Modify a Table – Add, Delete, Move Fields Modify a Table.
CASCOT and its coding rules Presentation for DASISH Workshop Venice, April 2014 Ritva Ellison Institute for Employment Research.
Input Design Lecture 11 1 BTEC HNC Systems Support Castle College 2007/8.
Paolo Valente - UNECE Statistical Division Slide 1 Technology for census data coding, editing and imputation Paolo Valente (UNECE) UNECE Workshop on Census.
XP New Perspectives on Microsoft Access 2002 Tutorial 31 Microsoft Access 2002 Tutorial 3 – Querying a Database.
IN THE NAME OF GOD. Reference Citing Software.
Autoentry and Autocoder Efficiently creating and coding people records from resumes.
CPSC 203 Introduction to Computers T97 By Jie (Jeff) Gao.
PestPac Software. Pay On Commission Commission can be paid on Production, Receipt, or Up-Front. Production: Commission will be paid when work is completed/an.
MSOffice Access Microsoft® Office 2010: Illustrated Introductory 1 Part 1 ® Database & Table.
Midwest alio ® Conference November 13-14, 2013 Fixed Assets Michael Williams.
Basic Navigation in Oracle R12 BY: Muhammad Irfan.
Nextgen Bank Reconciliation Resource Bank Reconciliation Menu Financial Management Bank Reconciliation –Import Bank Statements –Reconcile Bank Accounts.
CASCOT Editor Ritva Ellison Institute for Employment Research University of Warwick.
Excel Tutorial 8 Developing an Excel Application
Access Tutorial 2 Building a Database and Defining Table Relationships
Compatible with the latest browsers; Chrome, Safari, Firefox, Opera and Internet Explorer 9 and above.
Computer Fundamentals
Access Maintaining and Querying a Database
Lesson 3: Epic Appointment Scheduling Referrals
ALEPH Version 22 Beginning Cataloging
Creating and Modifying Queries
NextGen Trustee General Ledger Accounting
Benchmark Series Microsoft Word 2016 Level 2
Coding occupations The new coding process Sue Westerman, Marc Houben.
Lesson 3: Epic Appointment Scheduling Referrals
Lesson 3: Epic Appointment Scheduling Referrals
IBM SCPM PIT Data Download/Upload
COURSE OBJECTIVES Review Case Comparison
Standard Operating Procedure
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
Presentation transcript:

Classifications and CASCOT Ritva Ellison Institute for Employment Research University of Warwick

Coding text to a classification Coding is the process of categorising the range of all possible answers to a pre-defined set of categories. The full set of categories is termed a classification. Examples are: –SOC 2010 (UK Standard Occupational Classification 2010) –ISCO 08 (International Standard Classification of Occupations 2008) –SIC 2007 (UK Standard Industrial Classification 2007) Three parts to a classification: the structure, the index and the classification rules

In total 369 Unit Groups Structure

In total 27,738 index entries Index

Rules

Manual coding procedures Manual methods –code books; –temporary labour; –query resolution systems. No standardised approach, major variations between institutions, companies, etc. in quality of coding. Time-consuming, expensive.

Computer Assisted Structured Coding Tool CASCOT

Development of software CASOC: Pascal/C++ text coding software for DOS 1993 – CASCOT: Java text coding software for any operating system. CASOC was ad hoc development, funded from sales revenue. CASCOT initially funded by ESRC, now funded via sales income.

Occupational coding in practice Quality of coding reflects quality of text available for coding. Need rules which specify how to deal problems such as ambiguous job titles (e.g. engineer, teacher). Need to be aware that machine coding of text can introduce bias. Need to establish ‘trade off’ between accuracy and cost.

Coding with Cascot Cascot will provide: A list of recommendations. Code, title, best matching index entry, and certainty score Certainty Score Approximates the probability that the recommended code is correct. This is represented by a number in the range People never 100% right. Computer can’t be 100% right.

catering manager Press enter, or click ‘Code’ button Type job title

Recommendations Table

Classification Structure

Index Entries

Output

Best recommendation selected automatically Select another by clicking a different line OR: Change selection via structure

Accept the selection

Large scale coding An experienced person can code manually about 100 occupations/hour maintaining a good level of quality What if they need to code a file of 100,000 occupational texts? Use input/output files and automated coding

Using files Instead of typing every job title in, we can read them from a file. Rather than having to copy the code produced by Cascot we can have Cascot save the codes (and other output) to a file.

Example: Using Files Input file (tab delimited).

Choose Output Items Click Edit

Available Items

Example: Using Files Output file (Output items = “Input Record, Code, Title, Score”)

Automated coding Rather than choosing manually the best recommendation every time we can automate the process Automation options –fully manual –semi-automatic, select the certainty level (manual coding when score is below the level) –fully automatic

A fully automated run How good is this? Example: –19,087 unique job titles –Coded fully automatically, sorted descending by certainty score –Selections from output file shown –Wrong codes coloured with light orange

How do I make use of the comparison score? Tests with large datasets give us an indication of the accuracy of the coding done by Cascot. Above 80: coding takes place automatically : reasonably sure that the code shown is the correct one to allocate – but need to look at the alternatives shown with slightly lower comparison scores and check against the additional information you have for coding : there is some ambiguity. Careful consideration of all relevant information is required. 39 and lower: Cascot is struggling to find an appropriate match. More information is necessary before coding can be concluded.

Making use of additional information Click to see input record