3/1/2004MSE Presentation I1 ESTMD System -- A Web-based EST Model Database System Yinghua Dong.

Slides:



Advertisements
Similar presentations
4. Internet Programming ENG224 INFORMATION TECHNOLOGY – Part I
Advertisements

Airline Reservation System
1 XSLT – eXtensible Stylesheet Language Transformations Modified Slides from Dr. Sagiv.
ELP Helper MSE Project Presentation II Aghsan Ahmad Major professor: Dr. Hankley.
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
Cocomo II Constructive Cost Model [Boehm] Sybren Deelstra.
Web applications using JavaServer Faces (JSF) A brief introduction 1JavaServer Faces (JSF)
Project Risks and Feasibility Assessment Advanced Systems Analysis and Design.
Soft. Eng. I, Fall 2006Dr Driss Kettani, from I. Sommerville1 CSC-3324: Chapter 6 Software Design Section 10.3 (except )
BTW (“By The Way…”) Information Annotation By Rudd Stevens, Jason Endo University of San Francisco.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Outline IS400: Development of Business Applications on the Internet Fall 2004 Instructor: Dr. Boris Jukic Server Side Web Technologies: Part 2.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
CSC 395 – Software Engineering
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
A Choice between Interoperability and Migration By Chaitanya Kurada Masters Defense Major professor: Dr.Daniel Andresen.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 13 Slide 1 Application architectures.
BRUE Behavioral Reverse Engineering in UML as Eclipse Plugin MSE Presentation 1 Sri Raguraman.
Systems Analysis And Design © Systems Analysis And Design © V. Rajaraman MODULE 14 CASE TOOLS Learning Units 14.1 CASE tools and their importance 14.2.
AgentTool (III) Dynamic MSE Presentation 1 Binti Sepaha.
1 ECE 453 – CS 447 – SE 465 Software Testing & Quality Assurance Lecture 22 Instructor Paulo Alencar.
COCOMO Models Ognian Kabranov SEG3300 A&B W2004 R.L. Probert.
Multi-agent Research Tool (MART) A proposal for MSE project Madhukar Kumar.
Planning and Writing Your Documents Chapter 6. Start of the Project Start the project by knowing the software you will write about, but you should try.
MSE Presentation 3 By Padmaja Havaldar- Graduate Student
© Copyright by Deitel & Associates, Inc. and Pearson Education Inc. All Rights Reserved. 1 Tutorial 30 – Bookstore Application: Client Tier Introducing.
4/16/2004MSE Presentation II1 ESTMD System -- A Web-based EST Model Database System Yinghua Dong.
MET280: Computing for Bioinformatics Introduction to databases What is a database? Not a spreadsheet. Data types and uses DBMS (DataBase Management System)
Lecturer: Prof. Piero Fraternali, Teaching Assistant: Alessandro Bozzon, Advanced Web Technologies: Struts–
March 28, 2001XSP Session O’Reilly Enterprise Java Conference 1 XSP Session Sue Spielman President/Consulting Engineer President/Consulting Engineer
Project Overview Graduate Selection Process Project Goal Automate the Selection Process.
Mastergoal Machine Learning Environment Phase 1 Completion Assessment MSE Project Kansas State University Alejandro Alliana.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Student Curriculum Planning System MSE Project Presentation I Kevin Sung.
Project Overview Graduate Selection Process Project Goal Automate the Selection Process.
Grup.bio.unipd.it CRIBI Genomics group Erika Feltrin PhD student in Biotechnology 6 months at EBI.
REAL TIME GPS TRACKING SYSTEM MSE PROJECT PHASE I PRESENTATION Bakor Kamal CIS 895.
Software Project Planning Part II. Troutman's Postulates Profanity is the one language understood by all programmers. Not until a program has been in.
Slide 12.1 Chapter 12 Implementation. Slide 12.2 Learning outcomes Produce a plan to minimize the risks involved with the launch phase of an e-business.
MSE Presentation 1 By Padmaja Havaldar- Graduate Student Under the guidance of Dr. Daniel Andresen – Major Advisor Dr. Scott Deloach-Committee Member Dr.
PatentScope - Electronic Publication World Intellectual Property Organization.
PIRSF Classification System PIRSF: Evolutionary relationships of proteins from super- to sub-families Homeomorphic Family: Homologous proteins sharing.
Using SWARM service to run a Grid based EST Sequence Assembly Karthik Narayan Primary Advisor : Dr. Geoffrey Fox 1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Database Java Bean Feng Chen. Outline Tools and approaches Project overview Major contribution Project plan Cost estimate References Acknowledgment.
Graphical User Interface and Job Distribution Optimizer for a Virtual Pipeline Simulation Testbed Walamitien Oyenan October 8, 2003 MSE Presentation 1.
CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis
Function Points Synthetic measure of program size used to estimate size early in the project Easier (than lines of code) to calculate from requirements.
Estimating “Size” of Software There are many ways to estimate the volume or size of software. ( understanding requirements is key to this activity ) –We.
Performance Study of Message Passing in an Event Service: Java RMI vs. TCP Sockets Laxminarayan Muktinutalapati (Lux) Department of Computing and Information.
MSE Presentation 1 Lakshmikanth Ganti
Preface IIntroduction Objectives I-2 Course Overview I-3 1Oracle Application Development Framework Objectives 1-2 J2EE Platform 1-3 Benefits of the J2EE.
Department of Computing and Information Sciences MSE Project Presentation 1 A Three-tier On-line Model For Transaction- based Applications Using VB.NET.
Communication Model for Cooperative Robotics Simulator MSE Presentation 1 Acharaporn Pattaravanichanon.
Kansas State University Purchasing Contracts Management System (KSU – PCMS) Presentation 1 Date : 14 th October 2010 By Arthi Subramanian CIS 895 – MSE.
Integration of BioInformatics tools at NUS. GenBank Growth Chart Year Bases.
Project Planning. Overview Planning and the software process Estimating duration and cost Software project management plan components Software project.
Welcome to the Protein Database Tutorial. This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
Bogor-Java Environment for Eclipse
CMPE 412 Software Engineering
Systems Biology Tools for working with BIND data
Interactions and Ontologies
Design and Maintenance of Web Applications in J2EE
EXTENDING GENE ANNOTATION WITH GENE EXPRESSION
JavaServer Faces: The Fundamentals
COCOMO Models.
Jincheng Gao CIS895 – MSE Project
Database Java Beans Feng Chen 2019/7/3 MSE presentation III.
COCOMO MODEL.
Presentation transcript:

3/1/2004MSE Presentation I1 ESTMD System -- A Web-based EST Model Database System Yinghua Dong

3/1/2004MSE Presentation I2 Outline Project Overview Requirements Cost Estimation Project Plan Potential Risks Demonstration References Acknowledgments

3/1/2004MSE Presentation I3 Project Overview -- Objective Build a web-based, user-friendly Expressed Sequence Tags model database (ESTMD) system to help biology scientists search expression sequences and related information to make further decision

3/1/2004MSE Presentation I4 Project Overview -- Background ESTs: Expressed Sequence Tags, are partial sequences of randomly chosen cDNA, obtained from the results of a single DNA sequencing reaction. Typically, EST processing includes raw sequence cleaning, cleaned sequence assembling, and unique sequence annotation and functional assignment. Trace Files Raw (clone) sequences Cleaned (EST) sequences Assembled (unique) sequences Phred Cross_match & PERL program Cap3 Unique sequences with hit Blast

3/1/2004MSE Presentation I5 Project Overview -- Background (cont’d) Gene Ontology A set of controlled vocabularies used to describe biological features within a specified domain of biological knowledge. Gene Ontology describes the molecular functions, biological processes and cellular components of gene. Pathway The sequence of enzyme catalyzed reactions by which an energy-yielding substance is utilized by protoplasm.

3/1/2004MSE Presentation I6 Project Overview -- System Architecture Client Tier Responsible for presenting data, and receiving user inputs Application-server Tier Responsible for recording and abstracting business processes Data-server Tier Responsible for data storage Three-tier Architecture

3/1/2004MSE Presentation I7 Project Overview -- Technologies and Tools HTML with JavaScript will be used to build client interfaces Java Servlets, JSP (Java ServerPage) and JDBC will be used on the server-side XML and XSLT will be used to describe and present Gene Ontology tree structure MySQL4.0 is chosen as database management system

3/1/2004MSE Presentation I8 Project Overview -- Technologies and Tools (cont’d) JBuilder Enterprise9 is used as development tool Rational Rose is used to create UML models MS-Project is used for project plan Some verification and validation software (such as Alloy, USE, or SPIN) will be used for formal requirement specification

3/1/2004MSE Presentation I9 Project Overview -- E-R Model

3/1/2004MSE Presentation I10 Requirements

3/1/2004MSE Presentation I11 Requirements (cont’d) Search in Detail Users search detail information by gene name or symbol, sequence ID, FlyBase ID, or GenBank ID Users can decide the fields shown in the result The output format is html/text (A sample output is shown on the right side) unisequenceID: Contig1 uniSeq: CGCGGCCGCGTCGACGAGATTCGGAGGTTAG AAACATGACTCGCAAACGCCGTAATGGAGGA CGGGCTAAGCACGGCCGTGGCCACGTTAAGG CGGTGAGATGCACCAACTGCGCGCGTTGCGT GCCTAAGGACAAAGCTATCAAAAAGTTCGTG ATCAGGAATATTGTCGAAGCGGCTGCCGTCA GGGATATCAACGAAGCTTCCGTATATGCATC ATTCCAGCTGCCGAAGCTGTATGCAAAGCTC CACTACTGCGTCTCCTGCGCCATCCACAGCA AAGTTGTGCGCAACAGGTCTAAGAAGGACAG GAGAATCCGCACACCACCCAAGAGCACCTTC CCCAGGGACATGCAGCGCCCACAGAATGTGC AAAGGAAGTGAAGTGATTTACAATAAATTTT AAGAAAACCC flybaseID: FBgn evalue: 2.00E-49 hitLength: 114 bitScore: 190 identity: 93/115

3/1/2004MSE Presentation I12 Requirements (cont’d) Search by Keyword Users search the sequences at each stage by keyword The output includes sequence ID, length (with a link to sequence), gene name, symbol and a link to contig view image A sample output cloneIDRaw Length Cleaned Length Unisequence ID Unisequence Length Gene Name symbolContig View pb42ad- 1_001_a07.pb42primer Contig1413Ribosomal protein S26 RpS26View link pb42ad- 1_001_f07.pb42primer Contig1413Ribosomal protein S26 RpS26View link pyes2-ct_012_c12.p1ca291286Contig1413Ribosomal protein S26 RpS26View link pyes2-ct_034_h06.p1ca803398Contig1413Ribosomal protein S26 RpS26View link

3/1/2004MSE Presentation I13 Requirements (cont’d) Gene Ontology Search Users search gene ontology information by gene names, symbols, IDs, or a text file. The output is a table including GO ID, term, type, sequence ID, hit ID, and gene symbol. The hyperlinks on terms can show gene ontology tree structure. A sample output GO IDTermTypeSequence IDHit IDGene Symbol GO: protein biosynthesisBiological_processContig1FBgn RpS26 GO: cytosolic small ribosomal subunit (sensu Eukarya) Cellular_componentContig1FBgn RpS26 GO: ribosomeCellular_componentContig1FBgn RpS26 GO: structural constituent of ribosome Molecular_functionContig1FBgn RpS26

3/1/2004MSE Presentation I14 Requirements (cont’d) Gene Ontology Classification Users input a batch of gene names/symbols, or a local text file containing sequence IDs. Users can choose the gene ontology types which they want to classify. The output is a table including gene ontology type, subtype, sequence count, and percentage of sequences. A sample output typesubtypesequence_count% Cellular_componentcell375% Biological_processCell growth and/or maintenance375% Molecular_functionenzyme125% Molecular_functionProtein tagging125% Molecular_functionStructural molecule375%

3/1/2004MSE Presentation I15 Cost Estimation The effort of the project is estimated by Function Point Analysis (FPA) COCOMO II Model

3/1/2004MSE Presentation I16 Cost Estimation -- Function Point Analysis Unadjusted Function Points Function Type SimpleAverageComplexTotal UFP AmountWeightAmountWeightAmountWeight Inputs7 33 0 44 0 66 21 Outputs2 44 5 55 0 77 33 Inquires11 33 0 44 2 66 43 Files0 77 3  10 0  Interfaces1 55 1 77 0  Total UFP138

3/1/2004MSE Presentation I17 Cost Estimation -- Function Point Analysis (cont’d) Function Point Analysis Total Unadjusted Function Points (UFP) = 138 Product Complexity Adjustment (PC) = (0.01× 40) = 1.05 Total Adjusted Function Points (FP) = UFP × PC = Language Factor (LF) for Java assumed as 35 Source Lines of Code (SLOC) = FP × LF =

3/1/2004MSE Presentation I18 Cost Estimation -- COCOMO II For application programs: Delivered Source Instructions (KDSI) = Programmer Effect (PM) = 2.4 × (KDSI) 1.05 = 13.2 person-month Development Time in month (TDEV) = 2.5 × (PM) 0.38 = 6.66 months

3/1/2004MSE Presentation I19 Project Plan Phase I: Requirement ( 1/12/04 ~3/1/04) Phase II: Design (2/23/04 ~ 4/23/04) Phase III: Implementation and Test (4/26/04 ~ 7/30/04)

3/1/2004MSE Presentation I20 Project Plan (cont’d)

3/1/2004MSE Presentation I21 Potential Risks The requirements may change continually Some biology knowledge is needed Some new technologies, such as XML, XSLT, need to be leaned

3/1/2004MSE Presentation I22 Demonstration

3/1/2004MSE Presentation I23 References IEEE STD , IEEE Recommended Practice for Software Requirements Specifications, 1998 Edition, IEEE, 1998 IEEE Standard for SW Quality Assurance Plans (IEEE Std ) Walker Royce, Software Project Management -- A United Framework, 1998 Marty Hall, Core Servlets and JavaServer Pages, 2000 Roger. S. Pressman, Software Engineering: A practitioner’s Approach, 5 th Edition. Dr. Gustafson, CIS 540 lecture

3/1/2004MSE Presentation I24 Acknowledgments Committee: Dr. Mitchell L. Neilsen Dr. Gurdip Singh Dr. Daniel Andresen

3/1/2004MSE Presentation I25 Suggestions and Comments Thank You!