PDF Accessibility with Python Anand B Pillai. A few terms ● Accessibility – *“Accessibility is a general term used to describe the degree to which a product,

Slides:

Advertisements

Similar presentations

WCAG 2 Compliance With PDF

Advertisements

MMDE5011 – INTERACTIVE MEDIA PRACTICE 1 WEEK 1: INTRODUCTION TO HTML5

Basic Accessible PDF Document Training Adobe Acrobat Professional 11.

Workshop 1 Basics of Web Accessibility Web Content Accessibility Project Funded by BCcampus Natasha Boskic, Kirsten Bole, Nathan Hapke University of British.

Accessible PDF Creation using Adobe Acrobat Professional 11.

Introduction to Web Accessibility. What is Web Accessibility Web accessibility means that people with disabilities can use the Web Disabilities including.

Web Accessibility Tests Using the Firefox Browser ACCESS to Postsecondary Education through Universal Design for Learning.

CM143 - Web Week 2 Basic HTML. Links and Image Tags.

The W3C Web Accessibility Initiative (WAI) Inclusive learning through technology Damien French.

PDFs & Dorsetforyou.com Laura Hall Senior Website Officer

George Irwin Syracuse University.  Definitions  Creating PDF  Retrofitting PDF documents  Assistive technology and PDF  Resources.

Accessibility Tools in Microsoft Office 2010 and 2013 ADA Conference 2014 Norah Sinclair Tessa Greenleaf.

Web Accessibility. Ensuring people of all abilities have equal access to web content Disability Discrimination Act – Web Access Advisory notes 2010 Required.

Dhananjay Bhole, Coordinator, Accessibility Research Group, Department of Education and Extension, University of Pune.

Creating and publishing accessible course materials Practical advise you can replicate.

Accessible Word Document Training Microsoft Word 2010.

 What is web accessibility? ture=relatedhttp://

Understanding End User Role in PDF Accessibility Brad Hodges, AFB Consulting Pete De Vasto, Adobe Systems.

CMPT241 Web Programming More HTML. Homework 1 Summary of what you have learned Link to the homework on the home page The page needs to be representable.

Creating a Simple Page: HTML Overview

Week 2 Web Site Design Principles. 2 Design for the Computer Medium Craft the look and feel Make your design portable Design for low bandwidth Plan for.

The Internet Writer’s Handbook 2/e Web Accessibility Writing for the Web.

WEB DESIGN USING DREAMWEAVER. The World Wide Web –A Web site is a group of related files organized around a common topic –A Web page is a single file.

Measuring PDF Accessibility PDF a11y Testing for the Rest of a11yCamp DC 2013 Deque Systems, Inc.

Learning Web Design: Chapter 4. HTML  Hypertext Markup Language (HTML)  Uses tags to tell the browser the start and end of a certain kind of formatting.

Web Accessiblity Carol Gordon SIU Medical Library.

Measuring PDF Accessibility PDF a11y Testing for the Rest of a11yBOS 2013 Deque Systems, Inc.

Design and Construction of Accessible Web Sites Michael Burks Chairman Internet Society SIG For Internet Accessibility for People with Disabilities June.

Introduction to HTML Tutorial 1 eXtensible Markup Language (XML)

Accessibility of online instructional tools and documents Terrill Thompson Technology Accessibility

Measuring PDF Accessibility PDF a11y Testing for the Rest of a11yCamp TO 2013 Deque Systems, Inc.

Use CSS to Implement a Reusable Design Selecting a Dreamweaver CSS Starter Layout is the easiest way to create a page with a CSS layout You can access.

A centre of expertise in digital information managementwww.ukoln.ac.uk Accessibility and Usability For Web Sites: An Introduction to Web Accessibility.

Accessible Word Document Creation Using Microsoft Word 2010.

Accessible PDF Creation using Adobe Acrobat Professional 11.

Making the Most of PDFs PDF (portable document format) is a file format developed by Adobe Systems. PDFs make it possible to send documents with original.

2 |2 | Overview of the presentation What is disability? What is the global situation for persons with disabilities? What is accessibility? What is ICT.

Creating Inclusive Classrooms in Online Courses using Universal Design for Learning (UDL) Principles Pamela T. Dunning, Ph.D. Troy University

PDF Accessibility with Python Anand B Pillai. A few terms Accessibility – *“Accessibility is a general term used to describe the degree to which a product,

Instructional Technology and Accessibility

4.01 How Web Pages Work.

County Website Content Management System

Creating Accessible PDF’s for the Web

Getting Started with CSS

Web & accessibility resources

Section 508 At long last, two of the most looming accessibility questions in the United States have been answered.

Accessibility with Lectora Inspire 16

Making the Web Accessible to Impaired Users

Pamela T. Dunning, Ph.D. Troy University

XML QUESTIONS AND ANSWERS

Documentation Generators

Information Architecture and Design I

Best Practices and Lessons Learned

Introduction to Web Accessibility

Objective % Explain concepts used to create websites.

Universally Designed Course Materials

Context Is Everything Meaningful Alternative Text

Web Content Accessibility Beata M. Ofianewska (DG COMM) 7 December 2006 December 2006 COMM C2.

Web Programming– UFCFB Lecture 3

Lakeshore Public Schools

Information Architecture and Design I

Why use Web Standards?.

Structuring Content in a Web Document

Introduction to Web Accessibility

Lars Ballieu Christensen Advisor, Ph.D., M.Sc. Tanja Stevns

Change how your event is seen

Web Standards and Accessible Design.

Lars Ballieu Christensen Advisor, Ph.D., M.Sc. Tanja Stevns

4.01 How Web Pages Work.

And Mobile Web Browsers

Presentation transcript:

PDF Accessibility with Python Anand B Pillai

A few terms ● Accessibility – *“Accessibility is a general term used to describe the degree to which a product, device, service, or environment is accessible by as many people as possible.” ● Web Accessibility - *“Web accessibility refers to the inclusive practice of making websites usable by people of all abilities and disabilities.” ● Document Accessibility – Accessibility principles applied to documents such as PDF, Word, Openoffice etc. *definitions from Wikipedia

Accessible

Not Accessibile

Web/Document Accessibility ● Accessibility techniques help disabled users to interpret web pages or documents with the help of technologies such as screen readers. ● For this, web sites/documents need to be written in keeping with accessibility guidelines. ● Web content accessibility guidelines – WCAG 1.0 (earlier) and WCAG 2.0 ● Document accessibility – No “official” guidelines, but general guidelines and techniques available.

PDF ● Rapid growth on the web ● In creasing use by governments, banks and other agents. – Example: Mobile Bills, Bank Statements, IT returns etc. ● In India, the usage is just taking off now ● In western countries, a lot of e-governance transactions use PDF documents by default.

PDF and Accessibility ● Very easy to create inaccessible PDF! ● Before Acrobat 5 (2001), PDF was not very accessible ● Acrobat 5 and later introduced ability to “tag” content like HTML documents, which greatly improved accessibility ● W3C doesn't recognize PDF as a standard format since it requires a browser plug-in. So WCAG guidelines don't consider PDF as fully accessible yet.

Using Acrobat for quick accessibility check Go to Document->Accessibility Quick Check

5 ways of creating inaccessible PDF ! ● Scanned PDF ● Embedding multimedia such as video or audio files ● Embedding interactive forms ● Disabling access to PDF structure to accessibility technologies (screen readers etc) using encryption ● Multi-columned pages

Scanned PDF =

Checking scanned PDF accessibility in Acrobat

Why scanned PDF is Evil ● Scanned PDF is one big raster image – a big binary blob ● One loses all structure in the original scanned document ● Assistive technologies completely fail on scanned PDF documents since there is no meta or structure information to process ● If you use scanned PDF, you are creating accessibility barriers for the disabled who might use your documents

Other PDF Evils ● Multiple columns – Makes it very difficult for screen readers to process the document (tends to read text on two columns as a single line) ● Interactive Forms – Forms are meant for HTML pages, not PDF documents. Defer from using them unless there is a clearly defined need. ● Not defining natural language – Define a natural language for the document. Otherwise screen readers could use wrong speech engines. (Egs: English engine for spanish document) ● No document title – Defining a meaningful title for the document might seem like a small thing, but for the visually disabled, this is a major barrier to accessibility

Python and PDF ● A handful of open source libraries ● PyPDF - Pretty good PDF parser and writer, very extensible (last rel, 1.12, Sep 2008) ● PDFMiner- Robust PDF parser, well maintained (last rel Aug 2010) ● Reportlab Professional PDF reporting toolkit

Egovmon.no ● A project based in Norway to measure e- governance indicators in the areas of Accessibility, Transparency, Efficiency & Impact funded by Research Council of Norway. ● Part of the project is an onlne PDF accessibility evaluator web service ● PDF web accessiblity module (WAM) is written in Python using pyPdf as the back-end. ●

PDF WAM Checks ● Tests a PDF document for the following – Valid document title – Natural language definition – Presence of tags (document structure) – Multiple columns present or not – Consistent document structure (headers in correct order etc) – Embedded multimedia – Interactive forms – Bookmarks – Scanned PDF – Document permissions (encryption etc)

PDF WAM ● Provides a SOAP web-service at port 8893 for evaluating PDF URLs or content ● Returns a Python dictionary of results after processing the PDF which is processed by the front-end to display accessibility data.

PDF WAM Output (Server Log) Evaluating: #Pages => 23 Producer=> Adobe PDF Scan Library Creator=> "PFU ScanSnap Manager" Title=> (None) Version=> 1.3 Has structure tree=> False Has forms=> False Has bookmarks=> False Scan check: found scan producer! Warning: document has no headers! Processed in 0.05 seconds {'EIAO.A PDF.1.1': {(0, 1): 1}, 'EIAO.A PDF.1.1': {(0, 1): 0}, 'EIAO.A PDF.5.1': {(0, 1): 0}, 'EIAO.A PDF.8.1': {(0, 1): 0}, 'EIAO.A PDF.1.1': {(0, 1): 0}, 'EIAO.A PDF.1.1': {(0, 1): 0}, 'EIAO.A PDF.9.1': {(0, 1): 0}, 'EIAO.A PDF.1.1': {(0, 1): 0}, 'EIAO.A PDF.1.1': {(0, 1): '1.3'}, 'EIAO.A PDF.2.1': {(0, 1): u'"PFU ScanSnap Manager"'}, 'EIAO.A PDF.7.1': {(0, 1): 0}, 'EIAO.A PDF.6.1': {(0, 1): 0}, 'EIAO.A PDF.1.1': {(0, 1): 0}, 'EIAO.A PDF.1.1': {(0, 1): 1}, 'EIAO.A PDF.1.1': {(0, 1): 1}, 'EIAO.A PDF.1.1': {(0, 1): 1}, 'EIAO.A PDF.3.1': {(0, 1): u'Adobe PDF Scan Library 1.0.0'}, 'EIAO.A PDF.1.1': {(0, 1): 1}}

Source Code ● Open-source, released under GNU GPL ● Subversion ● Compatible with Python <=2.6.x ● pyPDf is packaged along, so no need to download it separately. ● Provides a command line checker called “pdfchecker.py”

Links ● Web AIM, defining PDF accesibility: ● Creating accessible PDF files: ● Egovmon : ● Egovmon PDF accessibility checker: ● A list apart – Facts and opinions about PDF accessibility:

Questions ? Thank you!