CAPTCHA What humans can do, But computers can not.

Slides:



Advertisements
Similar presentations
Don’t Type it! OCR it! How to use an online OCR..
Advertisements

17 HTML, Scripting, and Interactivity Section 17.1 Add an audio file using HTML Create a form using HTML Add text boxes using HTML Add radio buttons and.
Collecting data Chapter 6. What is data? Data is raw facts and figures. In order to process data it has to be collected. The method of collecting data.
COMPUTER MALWARE FINAL PROJECT PROPOSAL THE WAR AGAINST CAPTCHA WITH IMPLEMENTATION OF THE WORLDS MOST ACCURATE CAPTCHA BREAKER By Huy Truong & Kathleen.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Review of AI from Chapter 3. Journal May 13  What advantages and disadvantages do you see with using Expert Systems in real world applications like business,
CAPTCHA: Using Hard AI Problems for Security 12 Jun 2007 Ohad Barak (a.k.a. jo) Luis Von Ahn, EuroCrypt 2003.
CAPTCHA Completely Automated Public Turing test to tell Computers and Humans Apart A Computer Program that can generate and grade test that: Most Humans.
CAPTCHA Presented by: Sari Louis SPAM Group: Marc Gagnon, Sari Louis, Steve White University of Illinois Spring 2006.
Input to the Computer * Input * Keyboard * Pointing Devices
Breaking an Animated CAPTCHA Scheme
Extraction of text data and hyperlink structure from scanned images of mathematical journals Ann Arbor, March 19, 2002 Masakazu Suzuki (Kyushu University)
Information Retrieval in Practice
CAPTCHA Presented By Sayani Chandra (Roll )
Telling Humans and Computers Apart (Automatically) Or How Lazy Cryptographers do AI Luis von Ahn The Aladdin Center Carnegie Mellon University.
CAPTCHA Prabhakar Verma “08MC30”.
CAPTCHA & THE ESP GAME SHAH JAYESH CS575SPRING 2008.
Computer Vision Group University of California Berkeley Recognizing Objects in Adversarial Clutter: Breaking a Visual CAPTCHA Greg Mori and Jitendra Malik.
Human Computation CSC4170 Web Intelligence and Social Computing Tutorial 7 Tutor: Tom Chao Zhou
S OFTWARE AND M ULTIMEDIA Chapter 6 Created by S. Cox.
Glencoe Digital Communication Tools Create a Web Page with HTML Chapter Contents Lesson 4.1Lesson 4.1 Get Started with HTML (85) Lesson 4.2Lesson 4.2 Format.
Overview of Search Engines
Using Multimedia on the Web
ITM352 PHP and Dynamic Web Pages: Server Side Processing.
IOTA Improved Design and Implementation of a Modular and Extensible Website Framework Andrew Hamilton – TJHSST Computer Systems Lab Abstract.
Copyright © cs-tutorial.com. Introduction to Web Development In 1990 and 1991,Tim Berners-Lee created the World Wide Web at the European Laboratory for.
Matthias Neubauer CAPTCHA What humans can do, But computers can not.
CAPTCHA 1 Are you Human? (Sorry, I had to ask). CAPTCHA 2 Agenda What is CAPTCHA? Types of CAPTCHA Where to use CAPTCHAs? Guidelines when making a CAPTCHA.
intelligence study and design of intelligent agentsis the intelligence of machines and the branch of computer science that aims to create it. AI textbooks.
Part 2  Access Control 1 CAPTCHA Part 2  Access Control 2 Turing Test Proposed by Alan Turing in 1950 Human asks questions to another human and a computer,
XHTML Introductory1 Linking and Publishing Basic Web Pages Chapter 3.
Analyzing CAPTCHAs May 1, 2009 Kyle Anderson Michelle Krause Matthew Turner.
Exploration Seminar 3 Human Computation Roy McElmurry.
Section 17.1 Add an audio file using HTML Create a form using HTML Add text boxes using HTML Add radio buttons and check boxes using HTML Add a pull-down.
Files, File Format Folders Paths, URL Absolute Addresses Relative Addresses © Ms. Masihi.
Introduction to Interactive Media Interactive Media Components: Text.
IMAGINATION: A Robust Image-based CAPTCHA Generation System Ritendra Datta, Jia Li, and James Z. Wang The Pennsylvania State University – University Park.
Preventing Automated Use of STMP Reservation System Using CAPTCHA.
Web Design (1) Terminology. Coding ‘languages’ (1) HTML - Hypertext Markup Language - describes the content of a web page CSS - Cascading Style Sheets.
How the Web Works Building a Website – Lesson 1. How People Access the Web Browsers People access websites using software called a web browser. To view.
Marwan Al-Namari 1 Digital Representations. Bits and Bytes Devices can only be in one of two states 0 or 1, yes or no, on or off, … Bit: a unit of data.
Designing Human Friendly Human Interaction Proofs (HIPs) Kumar Chellapilla, Kevin Larson, Patrice Simard and Mary Czerwinski Microsoft Research Presented.
COMP135/COMP535 Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 2 Lecture 2 – Digital Representations.
Application Layer Attack. DDoS DDoS – Distributed Denial of Service Why would any one want to do this? In some cases, for bringing down service of competitors,
JavaScript 101 Introduction to Programming. Topics What is programming? The common elements found in most programming languages Introduction to JavaScript.
1 Lesson 14 Sharing Documents Computer Literacy BASICS: A Comprehensive Guide to IC 3, 4 th Edition Morrison / Wells.
CAP Malware and Software Vulnerability Analysis Term Project Proposal - Spring 2009 Professor: Dr. Zou Team members: Andrew Mantel & Peter Matthews.
By: Steven Baker.  What is a CAPTCHA?  History of CAPTCHA  Applications of CAPTCHAs  Accessibility  Examples of CAPTCHAs  reCAPTCHA  Vulnerabilities.
Separating man from machine since 2000….. ?. Agenda  Definition  History  Need  Types  Constructing CAPTCHAs  Breaking CAPTCHAs  Applications 
Text2PTO: Modernizing Patent Application Filing A Proposal for Submitting Text Applications to the USPTO.
Usability of CAPTCHAs Or usability issues in CAPTCHA design Authors: Jeff Yan and Ahmad Salah El Ahmad Presented By: Kim Giglia CSC /19/2008.
Windows Vista Configuration MCTS : Internet Explorer 7.0.
XP Creating Web Pages with Microsoft Office
Your Interactive Guide to the Digital World Discovering Computers 2012 Chapter 13 Computer Programs and Programming Languages.
SUBMITTED TO:-SUBMITTED BY:- Ms.Kavita KhannaShruty Ahuja H.O.D(CSE DEPARTMENT)02/MT/10 PDM,BAHADURGARHCE(2 ND SEM)
Billy Vivian Dr. Oblitey COSC  What is CAPTCHA?  History  Uses  Artificial Intelligence Relationship  reCAPTCHA  Works Cited.
CAPTCHA Presented by: Md.R ahim 08B21A Agenda Definition Background Motivation Applications Types of CAPTCHAs Breaking CAPTCHAs Proposed Approach.
مباني امنيت شبكه CAPTCHA)) به نام خدا مدرس: شهرزاد گلستانی Website:
SANDEEP MEHTA (ECE, IV Year). CAPTCHA Completely Automated Public Turing test to tell Computers and Humans Apart Invented at CMU by Luis von Ahn, Manuel.
THE ESP GAME, AND OTHER STUFF
Internet Econ: Google/Facebook POV
3.6 Fundamentals of cyber security
Section 17.1 Section 17.2 Add an audio file using HTML
Are you Human?.
Breaking Visual CAPTCHAs with Naïve Pattern Recognition Algorithms
Web Programming Week 11 Old Dominion University
A novel probabilistic language-based CAPTCHA system
Analyzing CAPTCHAs.
Fighting the WebBots A webbot is a program that visits web sites for all kinds of purposes. For example, Google webbots make copies of all web sites for.
Presented By Vibhute J.B. Class : M.Sc. (CS)
Presentation transcript:

CAPTCHA What humans can do, But computers can not.

CAPTCHA, the Acronym Completely Automated Public Turing Test to Tell Computers and Humans Apart

Completely--- Whole Automated--- made by machine Public--- universally known also easy for hackers to break it Turing Test to Tell--- test presented by Alan Turing Computers and Humans Apart CAPTCHA – literal meaning

CAPTCHA Origins 1997: Andrei Broder at AltaVista wanted to prevent bots from automatically submitting sites for indexing He decided to add a test to the submission page He reversed Brother scanner OCR optimization techniques 2000: Luis von Ahn, Manuel Blum & John Langford at CMU coined term CAPTCHA

CAPTCHA: Deciding Human or Bot? A puzzle or problem that is easy for humans to solve and very difficult for computers If the puzzle is solved correctly, you are considered human and can continue

Basic two types Printed CAPTCHA H-CAPTCHA

Printed CAPTCHA  Printed CAPTCHA is difficult to break  Lots of algorithms are available to generate these  Humans cannot identify these very easily  Two major types are there viz. Baffle text,Pessimal print.

Baffle Text image Developed by Monica Chew and Henry Baird Uses pronounceable English characters with masking that are not present in English dictionary

Pessimal Print Image Developed by Allison Coates and Henry Baird and Richard Fateman Uses the degradation model simulating physical defects caused by printing and scanning of printed text

Handwritten CAPTCHA less frequently used because human can easily identify the handwriting rather than text images Use of transformations by adding lines,arcs,circles etc.

Example showing H-CAPTCHA

Types of Printed CAPTCHA GIMPY BONGO PIX KittenAuth Face Recognition Audio Logic Puzzles

GIMPY Randomly chooses 7 words from a dictionary Distorts the words using a variety of techniques Human must correctly type 3 of the words to pass the test In the real world, most applications only test for a single word (EZ-Gimpy)

GIMPY Examples EZ-GYMPY R-GIMPY

BONGO A visual recognition problem Two sets of shapes with a distinguishing characteristic Must choose which set the shape belongs to

PIX A database of labeled images of recognizable objects Randomly chooses an object and displays N pictures of it Must correctly identify the object Pictures are distorted

KittenAuth “The Cutest Human Test” A 3x3 matrix of cute animals Choose the 3 kittens Strategy is to use animals that look similar to kittens

Face Recognition CAPTCHA

Audio CAPTCHA Pick a word or a sequence of numbers at random Render them into an audio clip using a TTS software Distort the audio clip Ask the user to identify and type the word or numbers

Logic Puzzles Easy trivia questions Example: Which of the following is a bird? Elephant, Tiger or Robin,Cons Difficult to create a big enough database of these questions Difficult for ESL users / international users

Most text based CAPTCHAs have been broken by software OCR Segmentation Other CAPTCHAs were broken by streaming the tests for unsuspecting users to solve. Breaking CAPTCHA

Uses of CAPTCHA Online polls Free services Search engine bots Prevention to Worms and spams Preventing dictionary attack etc.

Properties  CAPTCHA should be automatically generated and graded  Test can be taken quickly and easily by human users  Test will accept virtually all human users and reject software agents  Test will resist automatic attack for many years despite the technology advances and prior knowledge of algorithms

Free Registration Hotmail Registration Yahoo! Registration

Final Thoughts  They are crucial to preventing bot attacks Hopefully, they will become more user-friendly to people with disabilities (visual, mental) CAPTCHA’s are mainly produced from AJAX and PHP technology Various algorithms are present Use of XML

Different CAPTCHA’s

PHP PHP – originally known as Personal Home Page It’s a Hypertext Preprocessor It is a scripting lang. Used to create dynamic web pages. With syntax from C,JAVA,perl etc PHP code is embedded within HTML pages for server side execution.

OCR (Optical Character Recognition) The machine recognition of printed characters. OCR systems can recognize many different OCR fonts, as well as typewriter and computer-printed characters. Advanced OCR systems can recognize hand printing. When a text document is scanned into the computer, it is turned into a bitmap, which is a picture of the text. OCR software analyzes the light and dark areas of the bitmap in order to identify each alphabetic letter and numeric digit. When it recognizes a character, it converts it into ASCII text. Hand printing is much more difficult to analyze than machine-printed characters. Old, worn and smudged documents are also difficult. Scanning documents and processing them with OCR is sometimes as much an art as it is a science.

OCR

Segmentation It is nothing but Image Processing Pixel based Segmentation Model based Segmentation Multi-scale Segmentation Semi-automatic Segmentation

Validators Types of validators : 1)Mark up : checks web documents in format like HTML,XHTML etc. 2)Link validator : checks hyperlinks,useful to find broken links 3)CSS validator : checks stylesheet 4)RDF validator : checks RDF documents 5)Feed validator 6)P3P validator : related to protocols Etc.

Session Management Process of keeping tracks of user’s activity across the sessions of interaction of user with comp sys. When user opens some web pages and does not do anything on that, session gets xpired. E.g : score watch on web site So after certain time when user re-login to the page then previously xpired session gets restored. E.g: if user opened yahoo acc in two windows, and after some time he\ she logged off from one window.then user cannot use same acc from other window, session gets xpired. User have to re-login to acc.

Session Management There are types : 1)Desktop management 2)Browser management Mainly useful for web applications