Hermes An Effective Converter from TeX into MathML Romeo Anghelache & Bernard Schutz Albert Einstein Institute Potsdam, Germany

Slides:



Advertisements
Similar presentations
Rclis in vision and reality Thomas Krichel
Advertisements

EPrints 2.0 / March 4 th 2002 / Glasgow / Chris Gutteridge Introduction to EPrints 2.0 March 4 th 2002 Glasgow Christopher Gutteridge from the Department.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
MoWGLI - Mathematics in the Semantic Web An Approach to Machine-Understandable Representation of the Information in Digital Documents Bernd Wegner Technical.
EndNote reference manager. Introduction to EndNote.
XSL-FO + MathML Render MathML to Display, PDF, SVG September 18
EndNote. What is EndNote:  EndNote is referencing software that enables you to create a database of references from your readings. Your database of references.
XHTML Basics.
This presentation is intended as a detailed WebEx, to bring potential customers to an understanding of Dream Report capabilities. This presentation focuses.
Features and Uses of a Multilingual Full-Text Electronic Theses and Dissertations (ETDs) System Yin Zhang Kent State University Kyiho Lee, Bumjong You.
© Nova / Grassioulet 5-6 mai 2002 TECFA staf2x S V G glossary Scalable Vector Graphics.
Extraction of text data and hyperlink structure from scanned images of mathematical journals Ann Arbor, March 19, 2002 Masakazu Suzuki (Kyushu University)
Technical Tips and Tricks for User Support Mike Gardner
Elsevier's program to support research data Presented by: Dr. Eleonora Presani, Publisher High Energy Physics.
Unit 2, cont. September 14 HTML,Validating your pages, Publishing your site.
1 CA201 Word Application Creating Document for the Web Week # 9 By Tariq Ibn Aziz Dammam Community college.
Web of Science: An Introduction Peggy Jobe
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Dreamweaver 8 Concepts and Techniques Introduction Web Site Development and Macromedia Dreamweaver 8.
HTML, XML, PDF Pros and Cons.
Reference Manager Making your life easier! Updated September 2007.
4.01B Authoring Languages and Web Authoring Software 4.01 Examine webpage development and design.
THE BASICS OF THE WEB Davison Web Design. Introduction to the Web Main Ideas The Internet is a worldwide network of hardware. The World Wide Web is part.
Carrie Ann Desnoyers Instructional Designer CREATING ACCESSIBLE MATH IN D2L MATHTYPE, LATEX, MATHPAGES, AND MATHPLAYER.
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
Web Design Basic Concepts.
Use Watch folders to automatically add PDFs to Mendeley Desktop. When you place a document in a watched folder, it will be automatically added to Mendeley.
PubMed/History; Accessing Full-Text Articles (module 4.4)
Web Content Management Systems. Lecture Contents Web Content Management Systems Non-technical users manage content Workflow management system Different.
For AMS 200, 2009 Dept. of Applied Math & Statistics School of Engineering University of California Santa Cruz, CA A Very Short Introduction to LaTeX.
Welcome to the 3-day Workshop on > Presented by Saeid Abolfazli Faculty of Computer Science and IT University Malaya 23,24, and.
Exploring Word Grauer and Barber 1 Committed to Shaping the Next Generation of IT Experts. Chapter 6: Introduction to HTML: Creating a Home Page.
M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,
Luc Audrain Hachette Livre Head of digitalization
Getting Started with Expression Web 3
INTRODUCTION TO FRONTPAGE. TOPICS TO BE DISCUSSED……….  Introduction Introduction  Features Features  Starting Front Page Starting Front Page  Components.
XP 1 HTML: The Language of the Web A Web page is a text file written in a language called Hypertext Markup Language. A markup language is a language that.
Managing your References Sue Bird Bodleian Bio- & Environmental Sciences October 2010.
Open Textbooks and Electronic Publishing Formats/Standards Arctic Virtual Learnng Tools
THOMSON SCIENTIFIC Web of Science 7.0 via the Web of Knowledge 3.0 Platform Access to the World’s Most Important Published Research.
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
Part 4 – Preview/Index, History, combining search sets, Accessing full text articles and restricting results to the HINARI subset of journals. Instructions.
WIRESCRIPT1 WIRESCRIPT Web Interactive REview of Scientific Culture, Research, Innovation Policy and Technology.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Fundamentals of Web Design Copyright ©2004  Department of Computer & Information Science Introducing XHTML: Module A: Web Design Basics.
New Printing Options from RTF Charles Engelke Info Tech, Inc. Trnsport Users’ Group October 15, 2004.
Digital Archiving in the Hungarian Széchényi Library The story and the plans of the Hungarian Electronic Library Rome, 21. Oct István Moldován OSZK,
OARE Module 5A: Scopus (Elsevier). Table of Contents About Scopus (Elsevier) Using Scopus Search Page Results/Refine Search Pages Download, PDF, Export,
TUMS Scientific Writing EndNote Web Payam Kabiri, MD. PhD Epidemiologist Department of Epidemiology & Biostatistics School of Public Health Tehran University.
EndNote. What is EndNote? EndNote is referencing software that enables you to create a database of references from your readings.
1 Tutorial 11 Creating an XML Document Developing a Document for a Cooking Web Site.
Document Computing Technologies for Managing Electronic Document Collections Ross Wilkinson... [et al.] Circulation Counter [RES3H] ZA4080.D
4.01B Authoring Languages and Web Authoring Software 4.01 Examine webpage development and design.
THOMSON SCIENTIFIC Web of Science 7.0 via the Web of Knowledge 3.0 Platform Access to the World’s Most Important Published Research.
WORLD CONSORTIUM Welcome to. An overview by Phil Elliott Satzconcept Skandinavia a.s.
FACES General Overview ViRR (Virtueller Raum Reichsrecht) Software Solutions Kristina Büchner and Bastien Saquet Contact:Kristina Buechner:
Cascading Style Sheet CSS Closing Switch Closing Tag Code View
PubMed/Preview, Index & History; Accessing Full-Text Articles (module 4.4)
Tutorial #1 Using HTML to Create Web Pages. HTML, XHTML, and CSS HTML – HyperText Markup Language The tags the browser uses to define the content of the.
COMP 143 Web Development with Adobe Dreamweaver CC.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
A Bibliographic Management Software NORSHUHADA SAIDIN REFERENCE & RESEARCH DIVISION PERPUSTAKAAN KEJURUTERAAN UNIVERSITI SAINS MALAYSIA.
XHTML Basics.
Workshop on XML-Based Library Applications 5
XHTML Basics.
XHTML Basics.
EndNote by: fatimah alotaibi.
XHTML Basics.
XHTML Basics.
Ann Arbor, March 19, 2002 Masakazu Suzuki (Kyushu University)
Presentation transcript:

Hermes An Effective Converter from TeX into MathML Romeo Anghelache & Bernard Schutz Albert Einstein Institute Potsdam, Germany

Bernard F Schutz Albert Einstein Institute 27 April 2004 Hermes 2 Authoring Mathematics for the Web Mathematical display on the Web is still rudimentary. Most journals with e-portals offer full text only as a download. MathML becoming a standard, but not easy to write. Physicists and mathematicians write comfortably in TeX. A tool to put this on the web would not only help journal publishing. Research departments and institutes could build better websites, scientific collaborations could exchange information internally more effectively, and mathematical searching could be supported. EU-sponsored MoWGLI project is developing mathematical databases, theorem tools, journals with common search interfaces. Living Reviews in Relativity joined MoWGLI to develop a MathML authoring tool. Hermes is one of the deliverables. Hermes is a tool that allows authors to create MathML by writing in LaTeX.

Bernard F Schutz Albert Einstein Institute 27 April 2004 Hermes 3 Living Reviews Family LR Relativity first published in 1998 by AEI. Open-access review journal, now very widely used. Authors keep articles up to date, revisions 1-3 yr. Authors write in LaTeX with LR style file, already provide plentiful keyword information for articles and cited references. All articles displayed on web in full-text HTML: –mathematics included as gif images (not scalable, browser displays inconsistent); –HTML enhanced to offer many reader-friendly navigation aids (example);example –downloads offered in PDF, PS, HTML. Conversion TeX  HTML by customizable open-domain article builder ( LR family growing: Solar Physics is now on web, European Political Science expected soon.

Bernard F Schutz Albert Einstein Institute 27 April 2004 Hermes 4 Living Reviews and MoWGLI MoWGLI: LR develops MathML-based XHTML publishing/authoring tools. MoWGLI partners plan a unified search engine that spans all partner sites. With MathML, the HTML presentation scales properly and is (almost) browser-independent. More interestingly, equations can include metadata to assist searching and other bibliographic purposes. LRR Editorial Board is committed to enhancing searches: –develop a dictionary for relativity mathematics; –index equations with metadata to help relatively un-intelligent searches; –implement user searches for mathematics; –within MoWGLI, implement automated search interface. MathML may also permit us to offer cut-and-paste of content- MathML from the journal to formula-based software, such as algebraic mathematical programs or to spreadsheets.

Bernard F Schutz Albert Einstein Institute 27 April 2004 Hermes 5 Hermes Author: Romeo Anghelache There have been various attacks on LaTeX  MathML conversion. Our approach is not to try to solve all problems with LaTeX. Simply map presentation(TeX)  presentation(MathML). Hermes starts from DVI file: –no need to re-invent TeX compiler; –DVI contains formatting, font information; –DVI nearly independent of TeX/LaTeX varieties, packages; –must interpret/ignore \special constructs for, say, dvips. Hermes uses \special commands itself to preserve mathematical meaning and insert metadata into DVI file. Author does this using LaTeX macros provided in a style file. Most are transparent to author: renaming of standard LaTeX. So Hermes permits authors to use LaTeX to author MathML. Hermes is open-source, protected by the GNU GPL license.

Bernard F Schutz Albert Einstein Institute 27 April 2004 Hermes 6 How Hermes works: 4 steps 1.Seed the LaTeX source with \special commands –Authors use LaTeX macros, never deal with “raw” \special command 2.Parse the DVI file, recover the \special commands –Does not interfere with TeX, leaves rendering untouched 3.Output XML –Text mapped to Unicode –Math expressions/symbols mapped to MathML 4.Create Web presentation using style sheet –XSLT  XHTML+MathML

Bernard F Schutz Albert Einstein Institute 27 April 2004 Hermes 7 1. Seed the LaTeX source with \specials In author’s article: \documentclass{article} \input ltxdef \begin{document} … In ltxdef : … \let\oldcdots=\cdots \gdef\cdots{\special{cdots}\oldcdots} …

Bernard F Schutz Albert Einstein Institute 27 April 2004 Hermes 8 2. Parse the DVI file $a+b$ gives, in the dvi file : ‘imath:’ ‘fntnum10’ ‘a’ ‘fntnum7’ ‘+’ ‘fntnum10’ ‘b’ ‘:imath’ Has sufficient presentation info: Hermes translates it to a + b We’re lucky here: there is content information too. Hermes automatically writes: a b

Bernard F Schutz Albert Einstein Institute 27 April 2004 Hermes 9 3. Output XML Text mapped to Unicode –Most TeX font character codes generate corresponding Unicode characters. –For accents, Hermes uses the Unicode combining characters mechanism. Math mapped to MathML –Everything defaults to MathML presentation –Where the meaning is clear, Hermes wraps it in MathML content Resulting XML: text …

Bernard F Schutz Albert Einstein Institute 27 April 2004 Hermes XSLT to XHTML+MathML The XML document will be the reference version in Living Reviews (i.e. the source for searches, etc). Contains document metadata (subset of Dublin Core). For rendering in browsers, this is converted to XHTML using XSL. Fine rendering also needs: A unicode font installed in the systemunicode Mathematical fonts IE6 needs MathPlayer to render MathMLMathPlayerMathML Mozilla renders MathML on its ownMozillaMathML Example available at Living ReviewsExampleLiving Reviews

Bernard F Schutz Albert Einstein Institute 27 April 2004 Hermes 11 Content and Hermes: possibilities Manual metadata: modify LaTeX \equation macro to add extra argument where author can insert search metadata. –Carried through to the XML via a \special in modified macro. Automatic metadata: suppose we want to index every occurrence of Riemann tensor in all equations. –Create LaTeX macro \riemann that authors use instead of composing R abcd by hand. –\riemann has arguments for LaTeX and MathML presentation (indices, placement) and it also inserts a \special containing standard search metadata for this concept. –As in the manual case, the \special inserts the metadata into the XML. –Such macros can be easier to use and produce cleaner LaTeX!

Bernard F Schutz Albert Einstein Institute 27 April 2004 Hermes 12 The Future of Hermes Currently in version –Testing by processing back volumes of Zentralblatt für Mathematik When 1.0 is released: –Hermes will be open-source, public domain. –Living Reviews will use it to process new and existing articles, offer XHTML/MathML versions alongside standard HTML/gif versions for older browsers. –For 2005 (Einstein Year), AEI and Max Planck Institute for History of Science (Berlin) will process and make available about 20 original Einstein papers from Annalen der Physik, with content MathML and searchable metadata markup. If MoWGLI successor project Authomath is funded: –LRR will build relativity mathematics dictionary, create search engine, use Hermes to insert dictionary keywords into equations. Partners welcome: like all open-source projects, the key to the future is building a community of users and developers!

Bernard F Schutz Albert Einstein Institute 27 April 2004 Hermes 13 Links to further information Hermes information/downloads Hermes author Romeo Anghelache’s and home page Living Reviews portal Living Reviews in Relativity MoWGLI Sample LRR article processed by Hermes Info/AboutLR/mowgli/article.xml Advertisement: Berlin 2 Open Access Meeting, May, at CERN. Theme: implementation of Berlin Declaration (2003).