PDSC: P2P Document Sharing Community Team No. 4 R91922001 黃振修 PM R91922020 羅婉琪 RD B89902012 葉家齊 RD R91725032 李宜儒 RD R91922015 張燕君 QA R91922028 張靜雯 QA Team.

Slides:



Advertisements
Similar presentations
Operating System.
Advertisements

® Microsoft Office 2010 Browser and Basics.
COURSE: COMPUTER PLATFORMS
1 CSCI N305 C Language Programming Welcome to CSCI N305! Compiling Your First Program Using Microsoft Visual Studio 2008.
Online School Registration System Solomon Ng Pei-Yu Wang Evan Chiu Curtis Wong.
Web Server Administration
(NHA) The Laboratory of Computer Communication and Networking Network Host Analyzer.
Website Development with PHP and MySQL Introduction.
How Clients and Servers Work Together. Objectives Learn about the interaction of clients and servers Explore the features and functions of Web servers.
Systems Architecture, Fourth Edition1 Internet and Distributed Application Services Chapter 13.
SUSE Linux Enterprise Server Administration (Course 3037) Chapter 1 Introduction to Managing the SUSE Linux Enterprise Server.
Browser and Basics Tutorial 1. Learn about Web browser software and Web pages The Web is a collection of files that reside on computers, called.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
Installing software on personal computer
FIRST COURSE Computer Concepts Internet and Microsoft Office Get to Know Your Computer.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
Hands-On Microsoft Windows Server 2008 Chapter 8 Managing Windows Server 2008 Network Services.
Sharepoint Portal Server Basics. Introduction Sharepoint server belongs to Microsoft family of servers Integrated suite of server capabilities Hosted.
Presented By: Shashank Bhadauriya Varun Singh Shakti Suman.
FTP. SMS based FTP Introduction Existing System Proposed Solution Block Diagram Hardware and Software Features Benefits Future Scope Conclusion.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
Microsoft Windows 2003 Server. Client/Server Environment Many client computers connect to a server.
Section 2.1 Identify hardware Describe processing components Compare and contrast input and output devices Compare and contrast storage devices Section.
Version Control with Subversion. What is Version Control Good For? Maintaining project/file history - so you don’t have to worry about it Managing collaboration.
INTRODUCTION TO WEB DATABASE PROGRAMMING
Operating Systems Operating System
Virtual Mechanics Fall Semester 2009
Introduction To Computer System
One-To-One Instruction How to install MS Exchange with a two user setup on Windows Server 2003.
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
 Introduction to Operating System Introduction to Operating System  Types Of An Operating System Types Of An Operating System  Single User Single User.
Course Presentation EEL5881, Fall, 2003 Project: Network Reliability Tests Project: Network Reliability Tests Team: Gladiator Team: Gladiator Shuxin Li.
Module 7: Fundamentals of Administering Windows Server 2008.
LinkWare LinkWare is a web-enabled, open platform for generation and distribution of electronic technical documentation and e–catalogues. The LinkWare.
Bonrix SMPP Client. Index Introduction Software and Hardware Requirements Architecture Set Up Installation HTTP API Features Screen-shots.
1 Distributed Systems: an Introduction G53ACC Chris Greenhalgh.
Tunis International Centre for Environmental Technologies Small Seminar on Networking Technology Information Centers UNFCCC secretariat offices Bonn, Germany.
Statistics Monitor of SPMSII Warrior Team Pu Su Heng Tan Kening Zhang.
FTP Server and FTP Commands By Nanda Ganesan, Ph.D. © Nanda Ganesan, All Rights Reserved.
Section 2 Section 2.1 Identify hardware Describe processing components Compare and contrast input and output devices Compare and contrast storage devices.
Hands-On Microsoft Windows Server Implementing Microsoft Internet Information Services Microsoft Internet Information Services (IIS) –Software included.
ABCD Forge Presentation to abcd-softdev Norton Allen DEAS/CCB March 29, 2006.
GAAIN Virtual Appliances: Virtual Machine Technology for Scientific Data Analysis Arihant Patawari USC Stevens Neuroimaging and Informatics Institute July.
Subject Outline Grade 12 Computer Studies HG. Final Mark 4 Components External Theory (140 marks 3hrs) External Practical (100 marks 3hrs) Project (60.
XP Browser and Basics COM111 Introduction to Computer Applications.
Mercury. One single online platform: Mercury Highlights – USP’s Web-based platform: accessible from any computer in any location without installing any.
Internet addresses By Toni Grey & Rashida Swan HTTP Stands for HyperText Transfer Protocol Is the underlying stateless protocol used by the World Wide.
DDM Central Catalogs and Central Database Pedro Salgado.
CITA 171 Section 1 DOS/Windows Introduction. DOS Disk operating system (DOS) –Term most often associated with MS-DOS –Single-tasking operating system.
January 2010 – GEO-ISC KickOff meeting Christian Gräf, AEI 10 m Prototype Team State-of-the-art digital control: Introducing LIGO CDS.
CACI Proprietary Information | Date 1 PD² SR13 Client Upgrade Name: Semarria Rosemond Title: Systems Analyst, Lead Date: December 8, 2011.
IPEmotion License Management PM (V1.2).
T Iteration Demo LicenseChecker I2 Iteration
Functions of Operating Systems V1.0 (22/10/2005).
IST 201 Chapter 11 Lecture 2. Ports Used by TCP & UDP Keep track of different types of transmissions crossing the network simultaneously. Combination.
XNAT 1.7: Getting Started 6 June, Introduction In this presentation we’ll discuss:  Features and functions in XNAT 1.7  Requirements  Installing.
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
Architecture Review 10/11/2004
Fundamental of Databases
Section 2.1 Section 2.2 Identify hardware
Module 8: Networking Services
Hands-On Microsoft Windows Server 2008
Lecture: Protocols in Detail
MICROSOFT OUTLOOK and Outlook service Provider
T Project Review Group: pdm I2 Iteration
Introduction to Operating System (OS)
Computer Basics Section 2.1 YOU WILL LEARN TO… Identify hardware
HC Hyper-V Module GUI Portal VPS Templates Web Console
Information Technology Ms. Abeer Helwa
Sending data to EUROSTAT using STATEL and STADIUM web client
Presentation transcript:

PDSC: P2P Document Sharing Community Team No. 4 R 黃振修 PM R 羅婉琪 RD B 葉家齊 RD R 李宜儒 RD R 張燕君 QA R 張靜雯 QA Team No. 4 R 黃振修 PM R 羅婉琪 RD B 葉家齊 RD R 李宜儒 RD R 張燕君 QA R 張靜雯 QA

Introduction The original idea comes from research groups such as CML Laboratory of NTU. People want to share their document over Internet and need the functionality of keyword search. Thus we need a peer-to-peer mechanism for document exchange to achieve the goal of knowledge management. And we also need full text search to find/filter the sharing resources before downloading.

Features Peer-to-peer document sharing over Internet. Full text keyword searching / search result ranking within community. Direct document exchange by sending to and downloading from others. We developed our own URL format Ex: dsc://download/hostname/path/to/file

Market Requirement A simple application can be installed to connect to the community. Entering/leaving the community at any time. Sharing documents with each other. The sharing resources must keep up to date. Easy to see what's on the community. User can enter keywords to search the community for documents. User can direct send files with each other.

Project Roadmap Version 1.0 Basic functionality: Version 2.0 Duplication multi-copies in community Provide central backup mechanism Version 3.0 User management/authentication User acknowledge of document exchange More document formats will be supported in the future

Stage Goals Stage 1: Community browsing Stage 2: Search functionality Stage 3: Download/send file functionality

Schedule Notes 5/3: 黃振修 should finish the document digest module 5/10: 葉家齊 should finish the architecture prototype and server side protocol communication 5/12: 羅婉琪 should finish the client browsing functionality 5/10: QA finishes doc conversion testing (binary/code) 5/10: 李宜儒 should finish Win32 file hook mechanism 5/13: Download/send file should be OK 5/24: Document search QA finishes testing 5/24: The search result should be OK 5/28 ~ 6/4: Code freeze and final testing

Project Meetings Two types of meetings are defined: [PRJ]: Project meeting [DEV]: Developing meeting Meeting dates: [PRJ] 4/15, Tue. R319 of CSIE building [DEV] 4/23, Wed. R505 of CSIE building [DEV] 4/28, Mon, R505 of CSIE building [DEV] 4/29, Tue. R107 of CSIE building [DEV] 4/29, Tue. R105 of CSIE building [DEV] 5/6, Tue. R519 of CSIE building [PRJ] 6/9, Mon. R503 of CSIE building After 5/6, no formal meeting is held until the final. Instead, several small meetings are held in QAs and RDs; sometimes PM also calls RDs and QAs to cooperate.

Documentation MRD: Market requirement Document [PM] PRD: Project Requirement Document [PM] PED: Project Execution Document [PM] PDD: Project Development Document [RD] QAD: Quality Assurance Document [QA] BTD: Bug Tracking Document [QA] WDD: Working Discussion Document [PM] User’s Manual [QA]

Development Tools Microsoft VC Borland C++ Builder CVS for source control Central FTP server for file exchange Install Shield for SETUP program

Architecture Graphics User Interface KernelProtocol API for GUI Host Lookup Thread Server Thread Client Server Document Keyword Processor Database Local Shared File Database Host Database Task Database

Technical Notes (1/2) Pure peer-to-peer mechanism is implemented. Each application embeds both the client and server. (for the efficiency reason) When search request issued, the application will search its own document collection and then forward the message to other computers Dynamically monitoring of the sharing folder. Once the documents in the sharing folder are modified, the digest module will re-digest it real- time; keeping the latest information toward the community. See PDD for more detail

Technical Notes (2/2) Support three main document formats: MS Word, MS PowerPoint, and PDF files. (No Chinese support) Digest is the technique used to extract document’s feature vector. Searching is based on those digest vectors. An algorithm is developed to rate the searching and the result is ranked according to the points. Digest for the sharing documents are saved once exiting the program; only first time initialization is needed. See PDD for more detail

Demonstration

Testing Plans What is to be tested? Platform Network status Command File Conversion Download/Upload Where is going to be tested? Win32 environment, Windows 2000 OS PIII 500 CPU, 256 MB RAM, 100 Mbps ethernet See QAD for more detail

Testing Cases Document format conversion (binary tools testing) Document format conversion (integrated as program module, test for robustness and accuracy) P2P sharing community (test for the feature functionalities for UI program) The sharing module (test for the digest/searching and sharing folder monitoring) Setup program (test for the installer’s functionality) Performance report (memory usage, CPU utilization, memory leak) See QAD for more detail

Bug Tracking (1/2) Empty document files may cause fatal error Solved by check file completeness first. Some PDF file may cause the conversion module to get the wrong page number, causing fatal error. Check the validity of page number first. Duplication list when browsing Stupid bug Get file list waits too long Stupid bug See BTD for more detail

Bug Tracking (2/2) Download/sending file too slow Stupid bug (sleeping in the sending loop) Can not get file list/browsing when clients using DHCP Not solved because of the time limit. Keyword search in sharing folder do not recursively applied Solved by writing the recursive code Keyword search is too slow Improve the algorithm See BTD for more detail

Bug Statistics 5/35/105/246/86/95/7

Change Control History Change from client-server architecture to peer-to-peer architecture [4/23] Change the document digest from full-text to digest vector based. [5/6] Decide to allow recursively sharing in sharing folder [6/1]

Future Plan Version 2.0 Duplication multi-copies in community Provide central backup mechanism Version 3.0 User management/authentication User acknowledge of document exchange Bug fix and support for more document formats

The END Project Shipping Checklist: Source Code Include all surveyed components, CVS repository. Development Document MRD, PRD, PED, PDD, QAD, BTD, and WDD User’s Manual Presentation file Install Program Project CD with all the stuff