CyberMiner Software Architecture Group

Slides:



Advertisements
Similar presentations
Fatma Y. ELDRESI Fatma Y. ELDRESI ( MPhil ) Systems Analysis / Programming Specialist, AGOCO Part time lecturer in University of Garyounis,
Advertisements

LIS618 lecture 6 Thomas Krichel structure DIALOG –basic vs additional index –initial database file selection (files) Lexis/Nexis.
Chapter 13 Review Questions
SOFTWARE ARCHITECTURE FOR CYBERMINER GLOBAL 14. Agenda Problem, Process and Solution Phase-wise Discussion o Requirements Analysis o Architecture o Quality.
Stefania Bergamasco, Cecilia Colasanti An integrated approach to turn statistics into knowledge combining data warehouse, controlled vocabularies and advanced.
Programming Logic and Design Fourth Edition, Introductory
Hestia: Aarthi Giridharan Govindarajan Panneerselvam Nirmal Chander Shesha Chandrika Moka Sriram Rajendran Suryanarayanan Ganesh Melarkode Vignesh Swaminathan.
Search Engines. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables.
Web Search – Summer Term 2006 VI. Web Search - Indexing (c) Wolfgang Hürst, Albert-Ludwigs-University.
“ The Anatomy of a Large-Scale Hypertextual Web Search Engine ” Presented by Ahmed Khaled Al-Shantout ICS
Anatomy of a Large-Scale Hypertextual Web Search Engine (e.g. Google)
© nCode 2000 Title of Presentation goes here - go to Master Slide to edit - Slide 1 Anatomy of a Large-Scale Hypertextual Web Search Engine ECE 7995: Term.
The Anatomy of a Large-Scale Hypertextual Web Search Engine Sergey Brin and Lawrence Page.
Presentation by: Martin Hoffman
Online Magazine Bryan Ng. Goal of the Project Product Dynamic Content Easy Administration Development Layered Architecture Object Oriented Adaptive to.
Project Title: Cobra Implementation on Association Service.
Architectural styles and Case studies 1 | Website for Students | VTU NOTES | QUESTION PAPERS | NEWS | RESULTS.
1 On the Criteria To Be Used in Decomposing Systems into Modules by D.L.Parnas Dec presented by Yuanhua Qu for spring 2003 CS5391.
XHTML Introductory1 Linking and Publishing Basic Web Pages Chapter 3.
Using Visual Basic 6.0 to Create Web-Based Database Applications
Architecture styles Pipes and filters Object-oriented design Implicit invocation Layering Repositories.
The Anatomy of a Large-Scale Hypertextual Web Search Engine Presented By: Sibin G. Peter Instructor: Dr. R.M.Verma.
ECE450 - Software Engineering II1 ECE450 – Software Engineering II Today: Key Principles of Software Architecture and Design (I) adapted from Dave Penny’s.
TOPIC CENTRIC QUERY ROUTING Research Methods (CS689) 11/21/00 By Anupam Khanal.
NoteSearch - Find what you’re looking for. Prototype Team B.
The Anatomy of a Large-Scale Hypertextual Web Search Engine Sergey Brin & Lawrence Page Presented by: Siddharth Sriram & Joseph Xavier Department of Electrical.
INF 212 Modularity Instructors: Crista Lopes Copyright © Instructors.
The Anatomy of a Large-Scale Hyper textual Web Search Engine S. Brin, L. Page Presenter :- Abhishek Taneja.
Keyword Searching Weighted Federated Search with Key Word in Context Date: 10/2/2008 Dan McCreary President Dan McCreary & Associates
SOFTWARE ARCHITECTURE FOR A KWIC SYSTEM TEAM: GLOBAL 14.
“Architecture” The outcome of top-level design, reflecting principal design decisions Can (and should) be modified and updated Analogous to architecture.
Introduction to Files in VB Chapter 9.1, 9.3. Overview u Data Files  random access  sequential u Working with sequential files  open, read, write,
Comparing Designs By Chris McCall. Comparing Designs A decision-making method for deciding between many designs for a single specification Provides a.
Keyword In Context Presented by Team 2 Francisco Soria Zhiyu Zhang Xuheng Xu Irene Peysakhov.
1 Google: Case Study cs430 lecture 15 03/13/01 Kamen Yotov.
Chapter 1: Preliminaries Lecture # 2. Chapter 1: Preliminaries Reasons for Studying Concepts of Programming Languages Programming Domains Language Evaluation.
The Anatomy of a Large-Scale Hypertextual Web Search Engine (The creation of Google)
The Anatomy of a Large-Scale Hyper-textual Web Search Engine 전자전기컴퓨터공학과 G 김영제 Database Lab.
1 Software Requirements Descriptions and specifications of a system.
Presented By: Carlton Northern and Jeffrey Shipman The Anatomy of a Large-Scale Hyper-Textural Web Search Engine By Lawrence Page and Sergey Brin (1998)
KWIC example The KWIC [key word in context] system accepts an ordered set of lines; each line is an ordered set of words, and each word is an ordered set.
WHAT IS COMPUTER ? . A computer is a complex system consisting of both hardware and software components.
The purpose of a CPU is to process data Custom written software is created for a user to meet exact purpose Off the shelf software is developed by a software.
Architecture Review 10/11/2004
Running a Forms Developer Application
CS 325: Software Engineering
CS522 Advanced database Systems
On the Criteria to Be Used in Decomposing Systems into Modules
Methods and Apparatus for Ranking Web Page Search Results
Chapter 19 PHP Part III Credits: Parts of the slides are based on slides created by textbook authors, P.J. Deitel and H. M. Deitel by Prentice Hall ©
Software Engineering Architectural Design Chapter 6 Dr.Doaa Sami
Tutorial on BoxScript: A Component-Oriented Language
Week 12 Option 3: Database Design
Web Engineering.
CHAPTER 3 Architectures for Distributed Systems
Prepared by Charlie Meyer, May 2009
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
CYBERMINER WEB SEARCH ENGINE
Chapter 5 Designing the Architecture Shari L. Pfleeger Joanne M. Atlee
Ashutosh Rana Rahul Nori 7/17/2018
CS 425/625 Software Engineering Architectural Design
Introduction to Computer Concept
Software Design Lecture : 8
Advanced UNIX progamming
Spreadsheets, Modelling & Databases
Final Project Presentation
PHASE 1 - PRESENTATION Presented By - Karthik Kannambadi Sridhar
Information Retrieval and Web Design
CS/SE ADVANCED SOFTWARE ARCHITECTURE AND DESIGN FALL 2015
Module Structure David Parnas Discusses “modularization”
Presentation transcript:

CyberMiner Software Architecture Group Kimberly West, Nadia Noori, Stanislav Minkevych Basic Goal : Web Search Engine that : Accepts list of keywords Returns list of URLs whose description contains any of the given keywords Uses KWIC Key Word In Context to maintain database of URL & description

Requirements Specification Functional : After input, the descriptor part of the line is circularly shifted by repeatedly removing the first word and appending it to the end of the line Outputs a list of all circular shifts of the descriptor parts of all lines in alphabetically ascending order, together with their corresponding URLs No noise words such as “a”, “the”, or “of” at the start of output list lines Grow indices with possible later additions

Requirements Specification Non-Functional : Easily Understood & Used – clear use capabilities, features, simplicity to design Portability/ Reuse – not restricted to certain operating systems, machines, or certain developers, anyone can use the system & understand its architecture to adapt it to their environment, few system limitations Traceability – object oriented style using abstract data types, each process is linked to a specific individual module Good Performance & Responsive – readily & easily reacts to changes, output to input ratio, time factor

Components & Connections : Indexing Repository contains the full HTML of every web page documents are stored one after the other and are prefixed by ID, length, and URL requires no other data structures to be used in order to access it (helps with data consistency and makes development easier) Index keeps information about each document, is a fixed width index, ordered by docID contains current document status, pointer into the repository, a document checksum, various statistics If the document has been crawled, also contains a pointer into a variable width file called docinfo which contains its URL and title Otherwise the pointer points into the URL list which contains just the URL

Line Storage Create, access, and possibly delete character, words, and lines listens for InputEvent using the interface LSListener Store the lines LineStorage generates event called LSEvent

Line Storage Procedure setchar (l-line, w-word, c-char, a) Function char (l-line, w-word, c-char) returns an character representing the c-th character in the w-th word of l-th line return blank if out-of-range Function word ( l-line) returns the number of words in line l

Subprogram call System I/O Implicit invocation Master Control Line Storage Alphabetizing Control Input Input medium Output Output medium Circular Shift Searcher

CyberMiner Engine Searches indexed keywords Uses Boolean arguments Case-sensitivity selector