Software Applications for Cultural Diversity Rod Davis, SIL International

Slides:



Advertisements
Similar presentations
© 2011 Delmar, Cengage Learning Chapter 1 Getting Started with Dreamweaver.
Advertisements

Chapter 3: Software Basics Section I: Preparing to Use Technology.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Software Tools for Language Documentation DocLing 2013 Peter K. Austin Department of Linguistics, SOAS.
Understand Web Page Development Software Development Fundamentals LESSON 4.1.
XP Information Technology Center - KFUPM1 Microsoft Office FrontPage 2003 Creating a Web Site.
Content Management Systems Why to use. And, if you’re going to use one, which one???
Developing a Basic Web Page with HTML
Applications Software
Version 4 for Windows NEX T. Welcome to SphinxSurvey Version 4,4, the integrated solution for all your survey needs... Question list Questionnaire Design.
CGS 1000 Introduction to Computers and Technology.
Computer Software.
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
Software Construction and Evolution - CSSE 375 Software Documentation 1 Shawn & Steve Right – For programmers, it’s a cultural perspective. He’d feel almost.
26 April 2001 Unicode and Windows XP, IUC 18 (Hong Kong) Unicode and Windows XP Cathy Wissink Program Manager, Globalization Windows Division Microsoft.
Chapter 5 Application Software.
Tutorial Introduction Fidelity NTSConnect is an innovative Web-based software solution designed for use by customers of Fidelity National Title Insurance.
COMPUTER SOFTWARE Chapter 3. Software & Hardware? Computer Instructions or data, anything that can be stored electronically is Software. Hardware is one.
B.A. (Mahayana Studies) Introduction to Computer Science November March Office Tools A look at the main tools most computer users.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
Lecturer: Ghadah Aldehim
Sophia Antipolis, September 2006 Multilinguality, localization and internationalization Miruna Bădescu Finsiel Romania.
Chapter 3 Application Software.
M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,
Trimble Connected Community
© Paradigm Publishing, Inc. 5-1 Chapter 5 Application Software Chapter 5 Application Software.
Classroom User Training June 29, 2005 Presented by:
Enlightening minds. Enriching lives. Tamil Digital Industry Badri Seshadri K.S.Nagarajan New Horizon Media.
A Field Linguist’s Guide to Unicode Deborah Anderson Script Encoding Initiative (Universal Scripts Project) Dept. of Lings., UC-Berkeley LSA Panel: A Field.
ICTA Workshop on Unicode Publishing for Sinhala and Tamil
Integrating Educational Technology into the Curriculum
Using Visual Basic 6.0 to Create Web-Based Database Applications
Using authoring tools to produce materials Hot Potatoes:  small windows or Mac program that creates a variety of exercises  can be freely downloaded.
Tutorial 8 Designing a Web Site with Frames. XP Objectives Explore the uses of frames in a Web site Create a frameset consisting of rows and columns of.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 8 1 Creating Effective Web Pages Creating HTML Documents Tutorial 8.
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Write basic.
CHAPTER TEN AUTHORING.
Just as there are many human languages, there are many computer programming languages that can be used to develop software. Some are named after people,
COMP106 Assignment 2 Proposal 1. Interface Tasks My new interface design for the University library catalogue will incorporate all of the existing features,
Key Applications Module Lesson 21 — Access Essentials
Computer Literacy BASICS: A Comprehensive Guide to IC 3, 5 th Edition Lesson 23 Getting Started with Access Essentials 1 Morrison / Wells / Ruffolo.
Tutorial 8 Designing a Web Site with Frames. 2New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition Objectives Explore the uses of frames.
© Paradigm Publishing Inc. 5-1 Chapter 5 Application Software.
The new European Toolkit EC-CHM Miruna Bădescu EEA contractor: Eau de Web.
Introducing the World Wide Web Internet- a structure made up of millions of interconnected computers whose users communicate with each other and share.
SIL FieldWorks Language Explorer: The lexicon component Gary Simons SIL International Lexicon Tools and Lexicon Standards Nijmegen, 4–5 August 2010.
MIS 105 LECTURE 1 INTRODUCTION TO COMPUTER HARDWARE CHAPTER REFERENCE- CHP. 1.
C OMPUTING E SSENTIALS Timothy J. O’Leary Linda I. O’Leary Presentations by: Fred Bounds.
Discovering Computers Fundamentals, Third Edition CGS 1000 Introduction to Computers and Technology Summer 2007.
COMPUTER SOFTWARE FORM 1. Learning Area Introduction to computer software Operating System (OS) Application Software Word Processing Software Presentation.
Accessible Media Using Video and Audio to meet the needs of a diverse populations Presented by Kaela Parks.
© Paradigm Publishing, Inc. 4-1 Chapter 4 System Software Chapter 4 System Software.
IN THE NAME OF GOD. Reference Citing Software.
IT and Network Organization Ecommerce. IT and Network Organization OPTIMIZING INTERNAL COLLABORATIONS IN NETWORK ORGANIZATIONS.
+ Publishing Your First Post USING WORDPRESS. + A CMS (content management system) is an application that allows you to publish, edit, modify, organize,
Colby Smart, E-Learning Specialist Humboldt County Office of Education
A computer contains two major sets of tools, software and hardware. Software is generally divided into Systems software and Applications software. Systems.
Chapter – 8 Software Tools.
DocLing2016 Software Tools Peter K. Austin Department of Linguistics SOAS, University of London
Content Management Systems. Agenda Week overview Web-page basics The why and what of CMS Typo3.
V7 Foundation Series Vignette Education Services.
Сивова Дарья МГУП, CorelDRAW is a vector graphics editor developed and marketed by Corel Corporation of Ottawa, Canada. It is also the name of Corel's.
Writing System Implementation On-the-Fly Extensibility for the common man Sharon Correll, SIL International Copyright © 2001.
Joomla Awdhesh Kumar Singsys Pte Ltd. What is Joomla? Joomla is an award-winning content management system (CMS), which enables you to build Web sites.
Smart Fonts, Font features and ODF Tim Eves SIL International.
Complex Text Layout Issues with examples from Myanmar
Keyboard Decisions Heidi Rosendall.
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
Tutorial 7 – Integrating Access With the Web and With Other Programs
Web Application Development Using PHP
Presentation transcript:

Software Applications for Cultural Diversity Rod Davis, SIL International

About SIL International faith-based International NGO faith-based International NGO language-based development language-based development research, translation and literacy research, translation and literacy active in 20% of the world’s languages active in 20% of the world’s languages

SIL Language Software Development 53 software development personnel 53 software development personnel Member of the Unicode Consortium and very active on behalf of minority languages Member of the Unicode Consortium and very active on behalf of minority languages 60 software titles to support the work of language fieldworkers 60 software titles to support the work of language fieldworkers

One of the Biggest Challenges: Dealing with Complex Scripts Encoding Encoding –Transition to Unicode Input (Keyboarding) Input (Keyboarding) –Complexity –Extensibility Type Design Type Design –Unicode based –Smart-font compiler technology Rendering Rendering –Diacritic placement, Contextual shaping, Ligatures, Reordering/splitting, Bi-directionality

Solutions in Dealing with Complex Scripts Encoding Encoding –TechKit, a utility to convert legacy encodings to Unicode – SIL resource site with information, tutorials, utilities, etc. for making the transition to Unicode Input Input –Keyman (Windows); –KMFL (a Linux ‘Keyman’ still under development) –IMEs (Input Method Editors) Type Design Type Design –Graphite Compiler adds Graphite rendering tables to a TrueType font, giving it ‘smart-font’ capability Rendering Rendering –Graphite rendering engine: Can handle basic display of any complex script in use today. NOTE: These technologies can benefit not only ‘complex scripts’, but also orthographies with special diacritics or a few non-Roman characters. NOTE: These technologies can benefit not only ‘complex scripts’, but also orthographies with special diacritics or a few non-Roman characters.

What is a Smart Font? A font containing data describing how the glyphs are to displayed. A font containing data describing how the glyphs are to displayed. The smart font data is in the form of tables inside the font file itself. The smart font data is in the form of tables inside the font file itself. A rendering engine interprets the font tables to appropriately render the glyphs. A rendering engine interprets the font tables to appropriately render the glyphs. In contrast, a ‘dumb font’ has only a direct correspondence between the data characters and the displayed glyphs. In contrast, a ‘dumb font’ has only a direct correspondence between the data characters and the displayed glyphs.

What is Graphite? A package, developed by SIL International, that can be embedded into other applications, adding "smart font behavior" to that application. A package, developed by SIL International, that can be embedded into other applications, adding "smart font behavior" to that application. This package includes the following: This package includes the following: –A programming language (GDL) for specifying the font's behavior –A compiler for building the smart font –A rendering engine for displaying text using the smart font Note that it is open-source: Note that it is open-source:

What Makes Graphite Special? It differs from other complex script technologies: It differs from other complex script technologies: –Unlike OpenType, it does NOT assume that script-specific information is incorporated at the application or operating system level (as in Uniscribe). –It is extensible, and provides support for characters in Unicode’s Private Use Area ranges … unlike Uniscribe, which handles only script behaviors already part of the Unicode standard. –It is an open-source solution versus proprietary. –It can handle basic display of any complex script in use today. –The following slides demonstrate the types of rendering difficulties that Graphite was designed to handle:

SIL’s Work with UNESCO In 2003, SIL International and UNESCO engaged in a cooperative project as part of UNESCO’s Initiative effort. In 2003, SIL International and UNESCO engaged in a cooperative project as part of UNESCO’s Initiative effort. Initiative Initiative Goal: Enable the development of complex script support in information and communication technologies (ICTs). Goal: Enable the development of complex script support in information and communication technologies (ICTs). Project SILA: A Graphite-enabled version of Mozilla Project SILA: A Graphite-enabled version of Mozilla –Project Goal: To enable minority language communities to publish on the Internet … Graphite-enabled Edit Control Version 0.9 Graphite-enabled Edit Control Version 0.9 –basic Graphite edit control for data input applications –SDK for Windows 2000/XP, developer’s guide and well-commented source code Modified version of WorldPad -- Graphite-Enabled Text Processor Modified version of WorldPad -- Graphite-Enabled Text Processor –Simple text processor for Windows 2000/XP

Graphite-enabled Mozilla Contrast Internet Explorer on the left with Graphite-enabled Mozilla on the right. The Graphite-enabled version of Mozilla displays the multiple diacritics separately as intended.

Graphite-enabled Mozilla

Graphite-enabled Mozilla Instant Messaging

Graphite-enabled WorldPad

Lessons learned We are participating in or have created several open source development efforts. We have learned that… –FLOSS (Free/Libre Open Source Software) Linux is very attractive to low income groups we work with. –Setup and maintenance of open source OS and software is too complex for most of our end users. –Open source development is complex. –Getting support for complex minority scripts into the “core” builds of open source software is hard. –Localization of software is complex and time consuming. –Despite the difficulties, the benefits are great and we are heading in this direction on a number of fronts.

What is Keyman? Keyman allows you to enter text in Windows®-based applications in other languages without changing your physical keyboard (or system keyboard). Keyman allows you to enter text in Windows®-based applications in other languages without changing your physical keyboard (or system keyboard). It does this by remapping the character keys according to the font for the language you wish to use. It does this by remapping the character keys according to the font for the language you wish to use. It works with both ANSI” and Unicode. It is in wide-use and well tested. It works with both ANSI” and Unicode. It is in wide-use and well tested. Keyman 6.1 is a commercial program available at a reasonable price: Keyman 6.1 is a commercial program available at a reasonable price: The development tool called ‘Keyman Developer’ is sold separately, and uses a rule-based programming language. The development tool called ‘Keyman Developer’ is sold separately, and uses a rule-based programming language. Note: There are some issues with very large character sets that are handled better with IMEs (Input Method Editors). However, the IMEs are more complicated and are not really user modifiable or extensible. Note: There are some issues with very large character sets that are handled better with IMEs (Input Method Editors). However, the IMEs are more complicated and are not really user modifiable or extensible.

Technical overview of Keyman TIKE Compiler Keyman Program (.KMN) Keyman Executable (.KMX) Driver Operating system Application software Config- uration Program keystrokes characters

SIL Unicode-based Fonts Doulos SIL 4.0: A linguist’s general use font … This is a comprehensive inventory of glyphs needed for almost any Roman- or Cyrillic-based writing system, whether used for phonetic or orthographic needs. Status: Released but not a ‘full-family’ font (regular type-face only) Doulos SIL 4.0: A linguist’s general use font … This is a comprehensive inventory of glyphs needed for almost any Roman- or Cyrillic-based writing system, whether used for phonetic or orthographic needs. Status: Released but not a ‘full-family’ font (regular type-face only) Gentium is a typeface family designed to enable the diverse ethnic groups around the world who use the Latin script to produce readable, high-quality publications. It supports a wide range of Latin-based alphabets and includes glyphs that correspond to all the Latin ranges of Unicode. Status: Released and in wide use; Linux installation also released. Gentium is a typeface family designed to enable the diverse ethnic groups around the world who use the Latin script to produce readable, high-quality publications. It supports a wide range of Latin-based alphabets and includes glyphs that correspond to all the Latin ranges of Unicode. Status: Released and in wide use; Linux installation also released. Additional Unicode-based fonts, including one specifically tuned for literacy use, are under development. Additional Unicode-based fonts, including one specifically tuned for literacy use, are under development. (Just remember ) (Just remember )

Language and Culture tools SIL International has also developed software tools that serve the following needs: Lexical database programs that facilitate: Lexical database programs that facilitate: –Grammatical/morphological analysis –Dictionary creation and printing –Web publication of lexicons –Entry and analysis of cultural data Language Survey tools for doing statistical comparison of related languages/dialects Language Survey tools for doing statistical comparison of related languages/dialects Speech Tools for doing phonetic analysis of speech (including tonal analysis) Speech Tools for doing phonetic analysis of speech (including tonal analysis) An extensive CD-Rom based reference tool that covers all aspects of language and cultural field work An extensive CD-Rom based reference tool that covers all aspects of language and cultural field work

FieldWorks Data Notebook for analyzing cultural data

ToolBox for language data management

Lexique Pro for publishing interactive lexicons

WordSurv and PalmSurv to compare language survey data

Speech Tools for analyzing speech data See

LinguaLinks Library for electronic reference materials to support language and culture fieldwork Anthropology Anthropology Consulting Consulting General Reference Works General Reference Works Language Learning Language Learning Linguistics Linguistics Literacy Literacy Sociolinguistics Sociolinguistics Translation Translation

Partnering in Script Development Things SIL International could potentially offer: – –Support script development for languages currently unsupported – –Partner in the development of keyboarding solutions – –Continued development of the Graphite rendering technology – –Support incorporating these technologies into current open source or commercial tools, e.g., Open Offiice Things SIL International would benefit from: – –Resources necessary to actually develop the scripts (people, money, hardware/software) – –Community cooperation in the standardization of scripts – –Help getting open source software groups to incorporate these technologies into their core software efforts

Web-sites of Interest SIL International: SIL International: Multilingual Computing: Content: Here are Links to some SIL contributions to research and development in the area of multilingual computing, links to additional resources, and a glossary of Character Encoding and Rendering. Multilingual Computing: Content: Here are Links to some SIL contributions to research and development in the area of multilingual computing, links to additional resources, and a glossary of Character Encoding and Rendering. Fonts in Cyberspace: Content : This is a guide to finding language fonts on the Internet. Containing more than 400 sources for 123 languages. Fonts in Cyberspace: Content : This is a guide to finding language fonts on the Internet. Containing more than 400 sources for 123 languages. NRSI: Computers & Writing Systems: Content: Character Encoding, Understanding Unicode, Keyboard Design And Keyboarding Utilities, Type Design, and Script Rendering Technologies NRSI: Computers & Writing Systems: Content: Character Encoding, Understanding Unicode, Keyboard Design And Keyboarding Utilities, Type Design, and Script Rendering Technologieshttp://scripts.sil.org SIL Software Catalog: Content: SIL has developed more than 60 pieces of software to support the work of its fieldworkers; most available to the public for free download. SIL Software Catalog: Content: SIL has developed more than 60 pieces of software to support the work of its fieldworkers; most available to the public for free download. Keyman 6.1: Content: Keyman is a commercial Windows keyboard mapping software that, with Keyman Developer, is used to design your own keyboard layouts. Keyman 6.1: Content: Keyman is a commercial Windows keyboard mapping software that, with Keyman Developer, is used to design your own keyboard layouts. SIL FieldWorks 2.0: SIL FieldWorks is a suite of software tools that work together to help language teams worldwide. The current released suite of programs includes WorldPad and the SIL FieldWorks Data Notebook. Additional applications are under development. SIL FieldWorks 2.0: SIL FieldWorks is a suite of software tools that work together to help language teams worldwide. The current released suite of programs includes WorldPad and the SIL FieldWorks Data Notebook. Additional applications are under development. Field Linguist’s Toolbox: Toolbox is a data management and analysis tool for field linguists. It is especially useful for maintaining lexical data, and for parsing and interlinearizing text, but it can be used to manage virtually any kind of data. Field Linguist’s Toolbox: Toolbox is a data management and analysis tool for field linguists. It is especially useful for maintaining lexical data, and for parsing and interlinearizing text, but it can be used to manage virtually any kind of data. Lexique Pro: Lexique Pro is an interactive lexicon viewer, with hyperlinks between entries, category views, dictionary reversal, search, and export tools. It's designed to display your data in a user-friendly format so you can distribute it to others. Lexique Pro: Lexique Pro is an interactive lexicon viewer, with hyperlinks between entries, category views, dictionary reversal, search, and export tools. It's designed to display your data in a user-friendly format so you can distribute it to others.