Avalanche Internet Data Management System. Presentation plan 1. The problem to be solved 2. Description of the software needed 3. The solution 4. Avalanche.

Slides:



Advertisements
Similar presentations
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Advertisements

T-FLEX DOCs PLM, Document and Workflow Management.
Chapter 3 Database Management
Chapter 9 Describing Process Specifications and Structured Decisions
11 3 / 12 CHAPTER Databases MIS105 Lec14 Irfan Ahmed Ilyas.
Software Engineering For Beginners. General Information Lecturer, Patricia O’Byrne, office K115A. –
By: Bihu Malhotra 10DD.   A global network which is able to connect to the millions of computers around the world.  Their connectivity makes it easier.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
Lecturer: Ghadah Aldehim
The Software Development Cycle Defining and understanding the problem.
Basic tasks of generic software Chapter 3. Contents This presentation covers the following: – The basic tasks of standard/generic software including:
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Systems Analysis – Analyzing Requirements.  Analyzing requirement stage identifies user information needs and new systems requirements  IS dev team.
Systems Analysis And Design © Systems Analysis And Design © V. Rajaraman MODULE 14 CASE TOOLS Learning Units 14.1 CASE tools and their importance 14.2.
Chapter 5 Lecture 2. Principles of Information Systems2 Objectives Understand Data definition language (DDL) and data dictionary Learn about popular DBMSs.
Computers & Employment By Andrew Attard and Stephen Calleja.
Computers Are Your Future Tenth Edition Chapter 12: Databases & Information Systems Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall1.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
COMPUTER PROGRAMMING Source: Computing Concepts (the I-series) by Haag, Cummings, and Rhea, McGraw-Hill/Irwin, 2002.
Fundamentals of Information Systems, Fifth Edition
1 California State University, Fullerton Chapter 8 Personal Productivity and Problem Solving.
9 Chapter Nine Compiled Web Server Programs. 9 Chapter Objectives Learn about Common Gateway Interface (CGI) Create CGI programs that generate dynamic.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
WHAT IS A SEARCH ENGINE A search engine is not a physical engine, instead its an electronic code or a software programme that searches and indexes millions.
Search Engine Interfaces search engine modus operandi.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 27 How Internet Searching Works.
Universiti Utara Malaysia Chapter 3 Introduction to ASP.NET 3.5.
© 2001 Business & Information Systems 2/e1 Chapter 8 Personal Productivity and Problem Solving.
Lead Black Slide Powered by DeSiaMore1. 2 Chapter 8 Personal Productivity and Problem Solving.
Internet tool to find answers to poorly defined questions SmartNet © ITC Software,
Discovering Computers Fundamentals Fifth Edition Chapter 9 Database Management.
Professor Michael J. Losacco CIS 1110 – Using Computers Database Management Chapter 9.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
The Internet 8th Edition Tutorial 4 Searching the Web.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
The Anatomy of a Large-Scale Hyper textual Web Search Engine S. Brin, L. Page Presenter :- Abhishek Taneja.
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
Software Development Life Cycle by A.Surasit Samaisut Copyrights : All Rights Reserved.
SmartReport Backend Reporting Tool © 2003 ITC Software
Search Tools and Search Engines Searching for Information and common found internet file types.
Search Engines By: Faruq Hasan.
CPT 499 Internet Skills for Educators Session Three Class Notes.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Information Retrieval
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Web 2.0: Making the Web Work for You, Illustrated Unit A: Research 2.0.
G042 - Lecture 09 Commencing Task A Mr C Johnston ICT Teacher
T EST T OOLS U NIT VI This unit contains the overview of the test tools. Also prerequisites for applying these tools, tools selection and implementation.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Types Pros & cons.  A program for the retrieval of data, files, or documents from a database or network, esp. the Internet.  Search engines usually.
General Architecture of Retrieval Systems 1Adrienn Skrop.
Introduction to Computer Programming Concepts M. Uyguroğlu R. Uyguroğlu.
CASE Tools and their Effect on Software Quality
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
Seminar on seminar on Presented By L.Nageswara Rao 09MA1A0546. Under the guidance of Ms.Y.Sushma(M.Tech) asst.prof.
Managing Data Resources File Organization and databases for business information systems.
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Database Principles: Fundamentals of Design, Implementation, and Management Chapter 1 The Database Approach.
PLM, Document and Workflow Management
Tools of Software Development
Chapter 1 Introduction(1.1)
Chapter 11 user support.
Chapter 11 Describing Process Specifications and Structured Decisions
Spreadsheets, Modelling & Databases
T-FLEX DOCs PLM, Document and Workflow Management.
INTELLIGENT BROWSERS Cenk Ursavas.
Lesson 2: Gathering and Organizing Information Using ICT KEY QUESTION: HOW DO YOU GATHER AND ORGANIZE INFORMATION USING THE COMPUTER AND INTERNET?
Presentation transcript:

Avalanche Internet Data Management System

Presentation plan 1. The problem to be solved 2. Description of the software needed 3. The solution 4. Avalanche features and advantages 5. Avalanche detailed description 6. Instruments and technologies used

Internet Surfers This task is: To gather and to store Web- information. These groups are: Regular Internet users collecting information on their hobby (basketball news, cooking recipes, pets info, etc.) Analysts with the task to gather and sort Internet data (e.g. for Gartner Group, Bloomberg or IDC). There are two different Internet users groups having to fulfill the same task day by day.

Step 1 to solve the task 1. User needs to run some search or meta-search engine (e.g. Google, Yahoo, Copernic) and define the search query. Let’s keep in mind that different search engines have different syntactic rules for building the request and they return very different results for the same request. So, to make the search more or less complete one needs to repeat it several times with different search and meta-search engines with different syntactic rules to build the requests.

Steps 2, 3 to solve the task 2. User needs to look through each screen of each output of each search engine thoroughly to filter only the sites with the information that seems to be what he is looking for. 3. User needs to validate each of the filtered connections to understand whether they are alive or not.

Steps 4, 5 to solve the task 4. User needs to enter each of the sites that have passed validation procedure and to load its content to his local computer. 5. User needs to check few more links at each of the sites to load the content of the linked sites that is interesting to him.

Steps 6, 7 to solve the task 6. After downloading all the data needed one has to make few steps offline. First of all he has to examine all the downloaded files thoroughly to place each of them to the corresponding subfolder of his file system folder designated to store files downloaded from Internet. 7. Now, to find any file by keywords among the files stored user could only use standard Windows search system of very limited abilities (no hyperlinks, no cookies, etc.).

Conclusion It was an absolutely fair description of the steps every user should take each day to get and to use the information he needs. Use of some helpful tools and hints (iHarvest software, Telnet software, MyYahoo module, schedulers, etc.) does not change the situation substantially.

Special tool needed Nowadays market lacks software that would be designated to do the following: Search for information through the Web on regular basis. Try links found and filter Internet content. Collect filtered data. Classify collected data. Store classified data providing the ways of flexible and comfortable access to stored data.

Why is there no software like this now? Each of the existing software packages solves the problem partially (covering little part of the problem). A software tool to solve the problem as a whole should be considerably complex. It should combine modules of substantially different functionality: Surfing Web and downloading Internet-content Classifying downloaded information Storing data with comfortable access to it Complexity of some of these modules is usual programming complexity, and the task of classifying is not an easy mathematical task.

We did it! We did it! We have developed a software system called Avalanche Avalanche is an Internet Data Management System. IDMS Avalanche contains a number of new generation tools for: knowledge mining; knowledge storing; knowledge representing.

Avalanche has a number of competitive advantages Avalanche beats main competitors in: Extended syntactic data search Automatic filtration of data found Semantic data classification

Avalanche is a single product with a number of logically connected functions Syntactic and semantic definition of necessary information. Means of scheduled data search in WWW. Semantic filtration and classification of incoming data. Means of creating user’s personal encyclopedia.

Syntactic and semantic definition of necessary information Avalanche includes Internet Classifier that provides tools for building the Semantic Catalogue. This Catalogue defines the structure of necessary information. The folder in the Semantic Catalogue to place new document is defined in terms of: presence or absence of certain words and phrases in the new document; computable proximity of new document to number of sample documents.

Example of syntactic and semantic definition

Means of scheduled data search in World Wide Web Avalanche includes Internet Spider that provides: scheduled automatic search of requested information in the Web; automatic links following; automatic validation of links found; copying of found information from Internet to the user’s local computer.

Example of scheduled data search

Semantic filtration and classification of incoming data Avalanche Internet Classifier provides: Automatic classification of copied information in accordance with the Semantic Catalogue structure. Storage of classified information. Information is stored on the local computer in an efficient way. Re-classification of stored information. You can change your mind and reclassify information already received from Internet.

Example of semantic filtration and classification

Means of creating user’s personal encyclopedia Avalanche includes Knowledge Database that provides creation and management of user’s personal encyclopedia built as a local Internet site for adequate description and convenient maintenance of information stored.

Example of creating user’s personal encyclopedia

Avalanche is a well-structured product Avalanche consists of: Internet Spider to find necessary information Internet Classifier for automatic semantic filtering of data found Knowledge Database representing convenient mini- encyclopedia to deal with found and filtered information

Avalanche is a flexible and scalable product Avalanche could be a good fit either for expert’s analytical work or for common user’s Internet surfing.

Instruments and technologies Avalanche algorithms for data classification and texts proximity evaluation are developed on the strong mathematical basis. Avalanche is developed with the proven technology that means following the standards for all stages of project maintenance, programming and testing.

Different parts of Avalanche have been designed and developed using most up-to-date and efficient tools and algorithms. User interfaces have been developed using Borland RAD tools. Core code is written using object-oriented approach which makes Avalanche highly configurable and flexible. Class design has been developed using Rational Rose tools, which are considered to be the best OOP-design tools nowadays. Database is designed and optimized to Normal Form III, that’s why data is stored efficiently, without any redundancy. Data integrity is declared and applied on database level. Dictionary and document searching is optimized by using latest hashing and caching algorithms combined with the direct dictionary access. Instruments and technologies