PDF Accessibility with Python Anand B Pillai. A few terms Accessibility – *“Accessibility is a general term used to describe the degree to which a product,

Slides:



Advertisements
Similar presentations
WCAG 2 Compliance With PDF
Advertisements

Join the A-Team ACCESSIBILITY-TEAM. Session 1 EVALUATE ACCESSIBILITY OF HTML FILES AND SET THE PATH TO A D2L STYLE SHEET.
Introduction to Web Accessibility. What is Web Accessibility Web accessibility means that people with disabilities can use the Web Disabilities including.
A Web Accessibility Primer: Usability for Everyone XX Presenter Name Presenter Title Presenter Contact Office of Web Communications.
Lab: Making PDF documents truly accessible Mireia Ribera, Universitat de Barcelona Friday, Nov , 2:15 - 4:15 p.m. 12th Annual Accessing Higher.
Web Accessibility Tests Using the Firefox Browser ACCESS to Postsecondary Education through Universal Design for Learning.
CM143 - Web Week 2 Basic HTML. Links and Image Tags.
The W3C Web Accessibility Initiative (WAI) Inclusive learning through technology Damien French.
Topics in this presentation: The Web and how it works Difference between Web pages and web sites Web browsers and Web servers HTML purpose and structure.
George Irwin Syracuse University.  Definitions  Creating PDF  Retrofitting PDF documents  Assistive technology and PDF  Resources.
Accessibility Tools in Microsoft Office 2010 and 2013 ADA Conference 2014 Norah Sinclair Tessa Greenleaf.
Web Accessibility. Ensuring people of all abilities have equal access to web content Disability Discrimination Act – Web Access Advisory notes 2010 Required.
Accessibility of online instructional tools and documents Terrill Thompson ATUS Technology Accessibility Consultant x 2136
Creating and publishing accessible course materials Practical advise you can replicate.
Understanding End User Role in PDF Accessibility Brad Hodges, AFB Consulting Pete De Vasto, Adobe Systems.
Creating a Simple Page: HTML Overview
The Internet Writer’s Handbook 2/e Web Accessibility Writing for the Web.
WEB DESIGN USING DREAMWEAVER. The World Wide Web –A Web site is a group of related files organized around a common topic –A Web page is a single file.
Introduction to HTML5. History of HTML HTML first published – Tim Berners-Lee HTML 2.0 HTML 3.2 HTML 4.01 XHTML 1.0 XHTML 2.0.
Learning Web Design: Chapter 4. HTML  Hypertext Markup Language (HTML)  Uses tags to tell the browser the start and end of a certain kind of formatting.
Web Accessiblity Carol Gordon SIU Medical Library.
Measuring PDF Accessibility PDF a11y Testing for the Rest of a11yBOS 2013 Deque Systems, Inc.
Accessibility of online instructional tools and documents Terrill Thompson Technology Accessibility
XHTML1 Images N100 Building a Simple Web Page. XHTML2 The Element The src attribute specifies the filename of an image file To include the src attribute.
Using Plug-Ins Adding Multimedia to an HTML Document.
Lab: Making PDF documents truly accessible Mireia Ribera, Universitat de Barcelona Friday, Nov , 2:15 - 4:15 p.m. 12th Annual Accessing Higher.
A centre of expertise in digital information managementwww.ukoln.ac.uk Accessibility and Usability For Web Sites: An Introduction to Web Accessibility.
CS 200 Multimedia Objects in Web Pages. MultiMedia Objects Three primary types of multimedia objects  Audio  Video Includes Flash Objects  Images.
Iframe.
PDF Accessibility with Python Anand B Pillai. A few terms ● Accessibility – *“Accessibility is a general term used to describe the degree to which a product,
HTML Structure & syntax
Instructional Technology and Accessibility
4.01 How Web Pages Work.
Making Your Website Accessible
The Role of Tool Support in Public Policies and Accessibility
Python Data Analytics & Visualization
County Website Content Management System
How do Web Applications Work?
Creating Accessible PDF’s for the Web
Web & accessibility resources
Section 508 At long last, two of the most looming accessibility questions in the United States have been answered.
4.01B Authoring Languages and Web Authoring Software
Accessibility with Lectora Inspire 16
Making the Web Accessible to Impaired Users
Everyday inclusive Web design: an activity perspective
Screen Reader Testing and Website Support for Beginners
Documentation Generators
Information Architecture and Design I
Screen Reader Testing and Website Support for Beginners
Introduction to Web Accessibility
Creating ADA Compliant Resources
Objective % Explain concepts used to create websites.
Shawn and Rebeca Who are we?. Accessibility, the final frontier, these are our voyages into best practices...
Universally Designed Course Materials
Context Is Everything Meaningful Alternative Text
4.01B Authoring Languages and Web Authoring Software
Web Programming– UFCFB Lecture 3
Lakeshore Public Schools
Managing a Web Server and Files
What are Cascading Stylesheets (CSS)?
Information Architecture and Design I
ADA Compliant Website & Documents
Structuring Content in a Web Document
Introduction to Web Accessibility
Change how your event is seen
Web Standards and Accessible Design.
Lars Ballieu Christensen Advisor, Ph.D., M.Sc. Tanja Stevns
4.01 How Web Pages Work.
4.01 How Web Pages Work.
And Mobile Web Browsers
Presentation transcript:

PDF Accessibility with Python Anand B Pillai

A few terms Accessibility – *“Accessibility is a general term used to describe the degree to which a product, device, service, or environment is accessible by as many people as possible.” Web Accessibility - *“Web accessibility refers to the inclusive practice of making websites usable by people of all abilities and disabilities.” Document Accessibility – Accessibility principles applied to documents such as PDF, Word, Openoffice etc. *definitions from Wikipedia

Accessible

Not Accessibile

Web/Document Accessibility Accessibility techniques help disabled users to interpret web pages or documents with the help of technologies such as screen readers. For this, web sites/documents need to be written in keeping with accessibility guidelines. Web content accessibility guidelines – WCAG 1.0 (earlier) and WCAG 2.0 Document accessibility – No “official” guidelines, but general guidelines and techniques available.

PDF Rapid growth on the web In creasing use by governments, banks and other agents. Example: Mobile Bills, Bank Statements, IT returns etc. In India, the usage is just taking off now In western countries, a lot of e-governance transactions use PDF documents by default.

PDF and Accessibility Very easy to create inaccessible PDF! Before Acrobat 5 (2001), PDF was not very accessible Acrobat 5 and later introduced ability to “tag” content like HTML documents, which greatly improved accessibility W3C doesn't recognize PDF as a standard format since it requires a browser plug-in. So WCAG guidelines don't consider PDF as fully accessible yet.

5 ways of creating inaccessible PDF Scanned PDF Embedding multimedia such as video or audio files Embedding interactive forms Disabling access to PDF structure to accessibility technologies (screen readers etc) using encryption Multi-columned pages

Scanned PDF =

Why scanned PDF is Evil Scanned PDF is one big raster image – a big binary blob One loses all structure in the original scanned document Assistive technolgies completely fail on scanned PDF documents since there is no meta or structure information to process If you use scanned PDF, you are creating accessibility barries for the disabled who might use your documents

Python and PDF Not much support but a few open source libraries PyPDF - Pretty good PDF parser and writer, very extensible (last rel, 1.12, Sep 2008) PDFMiner - dex.html Robust PDF parser, well maintained (last rel Aug 2010) Reportlab - Professional PDF reporting toolkit

Egovmon.no A project based in Norway to measure e-governance indicators in the areas of Accessibility, Transparency, Efficiency & Impact funded by Research Council of Norway. Part of the project is an onlne PDF accessibility evaluator web service PDF web accessiblity module (WAM) is written in Python using pyPdf.

PDF WAM Provides a SOAP web-service at port 8893 for evaluating PDF URLs or content Returns a Python dictionary of results after processing the PDF which is processed by the front- end to display accessibility data.

PDF WAM Output Evaluating: pdf #Pages => 23 Producer=> Adobe PDF Scan Library Creator=> "PFU ScanSnap Manager" Title=> (None) Version=> 1.3 Has structure tree=> False Has forms=> False Has bookmarks=> False Scan check: found scan producer! Warning: document has no headers! Processed in 0.05 seconds {'EIAO.A PDF.1.1': {(0, 1): 1}, 'EIAO.A PDF.1.1': {(0, 1): 0}, 'EIAO.A PDF.5.1': {(0, 1): 0}, 'EIAO.A PDF.8.1': {(0, 1): 0}, 'EIAO.A PDF.1.1': {(0, 1): 0}, 'EIAO.A PDF.1.1': {(0, 1): 0}, 'EIAO.A PDF.9.1': {(0, 1): 0}, 'EIAO.A PDF.1.1': {(0, 1): 0}, 'EIAO.A PDF.1.1': {(0, 1): '1.3'}, 'EIAO.A PDF.2.1': {(0, 1): u'"PFU ScanSnap Manager"'}, 'EIAO.A PDF.7.1': {(0, 1): 0}, 'EIAO.A PDF.6.1': {(0, 1): 0}, 'EIAO.A PDF.1.1': {(0, 1): 0}, 'EIAO.A PDF.1.1': {(0, 1): 1}, 'EIAO.A PDF.1.1': {(0, 1): 1}, 'EIAO.A PDF.1.1': {(0, 1): 1}, 'EIAO.A PDF.3.1': {(0, 1): u'Adobe PDF Scan Library 1.0.0'}, 'EIAO.A PDF.1.1': {(0, 1): 1}}

Source Code Open-source, released under GNU GPL Subversion Compatible with Python <=2.6.x pyPDf is packaged along, so no need to download it separately. Provides a command line checker called “pdfchecker.py”

Links Web AIM, defining PDF accesibility: Creating accessible PDF files: Egovmon : Egovmon PDF accessibility checker: A list apart – Facts and opinions about PDF accessibility:

Questions ? Thank you!