Project Tukaram Sagar Tamhane

Slides:



Advertisements
Similar presentations
The creation of "Yaolan.com" A Site for Pre-natal and Parenting Education in Chinese by James Caldwell DAE Interactive Marketing a Web Connection Company.
Advertisements

Technical and design issues in implementation Dr. Mohamed Ally Director and Professor Centre for Distance Education Athabasca University Canada New Zealand.
June 2004 Adil Allawi Technical Director
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
Features and Uses of a Multilingual Full-Text Electronic Theses and Dissertations (ETDs) System Yin Zhang Kent State University Kyiho Lee, Bumjong You.
The Marathi Portal with a Search Engine Center for Indian Language Technology Solutions, IIT Bombay.
Caching the MDSPlus Data via Hibernate By Ajith M Jose Comp6703 Project Client: Raju Karia Supervisor: Dr. Henry Gardner (Development of “WebScope”)
Information Retrieval in Practice
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Design of Web-based Systems IS Development: lecture 10.
©TheMcGraw-Hill Companies, Inc. Permission required for reproduction or display. COMPSCI 125 Introduction to Computer Science I.
Apache Tomcat Server – installation & use Server-side language-- use Java Server Pages Contrast Client-side languages HTML Forms Servers & Server-side.
Multiple Tiers in Action
Internationalization of Java Platform Presenter: Ataru Nakazawa Advisor: Xiaoping Jia Date: January 23, 2004.
WWW and Internet The Internet Creation of the Web Languages for document description Active web pages.
Introduction to Web-Based Systems HTML, XML, and JavaScript.
CS 0008 Day 2 1. Today Hardware and Software How computers store data How a program works Operators, types, input Print function Running the debugger.
Text-To-Speech System for Marathi Miss. Deepa V. Kadam Indian Institute of Technology, Bombay.
Modular InfoTech’s Modular Infotech is proud to offer Tools and Components enabled with Indian language so as to address each & every client located across.
CHAPTER 9 Using the World Wide Web. OBJECTIVES 1.Describe the Internet and the World Wide Web 2.Define related Internet terms 3.Explain the components.
Information Retrieval and Knowledge Organisation Knut Hinkelmann.
Reading Aid for Visually Impaired Veera Raghavendra, Anand Arokia Raj, Alan W Black, Kishore Prahallad, Rajeev Sangal Language Technologies Research Center,
Similar Document Retrieval and Analysis in Information Retrieval System based on correlation method for full text indexing.
Implementation Issues Mark Davis Properties.
Third Conference December, Basm 28 Years >450,000 Terminologies >250 Scientific field.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 14 Database Connectivity and Web Technologies.
Your Search for Indian languages ends at Modular InfoTech, Pune Web-Samhita from Modular InfoTech Pvt. Ltd. Modular InfoTech is proud to offer various.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
JavaScript Dynamic Active Web Pages Client Side Scripting.
By R. O. Nanthini and R. Jayakumar.  tools used on the web to find the required information  Akeredolu officially described the Web as “a wide- area.
Invitation to Computer Science 6 th Edition Chapter 10 The Tower of Babel.
VIVO architecture March 1, Major Components Vitro is a general-purpose Web-based application leveraging semantic standards VIVO is a customized.
Basics Components of Web Design & Development Basics, Components, Design and Development.
A Presentation Presentation On JSP On JSP & Online Shopping Cart Online Shopping Cart.
Information Retrieval in Practice
Building Library Web Site Using Drupal
Web Page Introduction.
Search Engine Optimization
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
WWW and HTTP King Fahd University of Petroleum & Minerals
Search Engine Architecture
Publishing on JACoW I ask for a single file that I can download or a CD which contains a complete set of files for publication. The internet is good enough.
Chapter Five Web Search Engines
CIW Lesson 6 Web Search Engines.
Prepared by Rao Umar Anwar For Detail information Visit my blog:
Dynamic Web Pages (Flash, JavaScript)
Design and Maintenance of Web Applications in J2EE
Web Content FileSystem
Chapter 7 - JavaScript: Introduction to Scripting
Technology Development
Chapter 27 WWW and HTTP.
Automatic Language Identification – A Syntactic Approach
JavaScript: Introduction to Scripting
Declarative Creation of Enterprise Applications
How to Improve Releasing Efficiency via i18N/L10n Test Automation.
Database Connectivity and Web Development
Centre For Indian Language Technology
Back end Development CS Programming Languages for Web Applications
<Text> <Text> What is Web Content? <align left>
Chapter 7 - JavaScript: Introduction to Scripting
PHP Forms and Databases.
Chapter 7 - JavaScript: Introduction to Scripting
Client-Server Model: Requesting a Web Page
CS/SE ADVANCED SOFTWARE ARCHITECTURE AND DESIGN FALL 2015
Back end Development CS Programming Languages for Web Applications
Chapter 7 - JavaScript: Introduction to Scripting
SDMX IT Tools SDMX Registry
Конференција на МАКС, 17 јуни
Presentation transcript:

Project Tukaram Sagar Tamhane Centre for Indian Language Technology Solutions IIT Bombay 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions The Goal To make Saint Tukaram’s Abhangas available over web for browsing and searching Locate the right Abhangas that you need. Present the pages to the user in an order of importance. 12 June 2002 Center For Indian Language Technology Solutions

“EaI tukaramabaavaaMcyaa ABaMgaaMcaI gaaqaa” The Source The Abhangas are typed from a book called “EaI tukaramabaavaaMcyaa ABaMgaaMcaI gaaqaa” published on 6th November 1973 by the Govt. of Maharashtra Previous editions: 1950 and 1955. Number of Abhangas: 4644 12 June 2002 Center For Indian Language Technology Solutions

Creation of Web Content Software used for typing: MS Word with Akruti_Priya_Expanded font and Akruti keyboard driver Problems faced: Non displayable characters Eg: This was typed as mna Automated page splitting 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions Converters Used Akruti_Priya_Expanded ISCII converter: required for indexing the text ISCII Monolingual ISFOC converter: required for displaying the text through DV-TTYogesh XDVNG ISCII: for query strings to ISCII 12 June 2002 Center For Indian Language Technology Solutions

Technologies used for the Tukaram Search Engine Input Technology: Jtrans: XDVNG font Keyboard Mapping: Phonetic English Result Display at client: ISFOC Encoding for indexing (storage): ISCII 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions Architecture 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions Input Technology 12 June 2002 Center For Indian Language Technology Solutions

Components of the Search Engine Index Case sensitive ISCII Database structure Searcher In-memory search Algorithm: Hybrid of Hashing & Binary search 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions Database Structure 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions Snap shot of result 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions Relevancy Criteria Number of query words in the abhang Position Adjacency Total number of words in the abhang 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions General information Number of abhangas : 4,644 Total number of words : 2,09,702 Number of distinct words : 34,773 Languages used for converters: Lex & C Language used for search engine: Java 2 Scripting on client side : JavaScript 12 June 2002 Center For Indian Language Technology Solutions

Center For Indian Language Technology Solutions Thank You 12 June 2002 Center For Indian Language Technology Solutions