IIC Information Flow Interesting ions? Priority list of interesting ions Empty priority list? QA/QC? Peptide identification Protein identification External.

Slides:



Advertisements
Similar presentations
Protein Quantitation II: Multiple Reaction Monitoring
Advertisements

1.Data categorization 2.Information 3.Knowledge 4.Wisdom 5.Social understanding Which of the following requires a firm to expend resources to organize.
N-Glycopeptide Identification from CID Tandem Mass Spectra using Glycan Databases and False Discovery Rate Estimation Kevin B. Chandler, Petr Pompach,
Database Searches. Peptide mass fingerprinting digestMS Search HIT SCORE Protein X 1000 Protein Y 50 Protein Z 5 Protein X theoretical digestProtein Y.
Knowledge Enabled Information and Services Science What can SW do for HCLS today? Panel at HCSL Workshop, WWW2007 Amit Sheth Kno.e.sis Center Wright State.
Flying to the Top, One Tweet at a Time: Using Social Media to Rank Online Search Results Robyn B. Reed, MA, MLIS Co-authors: Carrie L. Iwema, PhD, MLS.
Proposal for a Standard Representation of the Results of GC-MS Analysis: A Module for ArMet Helen Fuell 1, Manfred Beckmann 2, John Draper 2, Oliver Fiehn.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Two main requirements: 1. Implementation Inspection policies (scheduling algorithms) that will extand the current AutoSched software : Taking to account.
Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics.
CSIT530 Projects -- 1 H.Lu/HKUST CSIT530: Suggested Projects  Three types of projects  System implementation  Literature survey  Research  General.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Introduction Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Introduction Facebook How does Facebook use your data? Where do you think.
Evaluated Reference MS/MS Spectra Libraries Current and Future NIST Programs.
Overview of Bioinformatics A/P Shoba Ranganathan Justin Choo National University of Singapore A Tutorial on Bioinformatics.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
The CCE 5 th Annual Retreat Global Proteomics & Determination of Vitamin D Metabolites Update Jiri Adamec.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Business Process Performance Prediction on a Tracked Simulation Model Andrei Solomon, Marin Litoiu– York University.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
DOE Genomics: GTL Program IT Infrastructure Needs for Systems Biology David G. Thomassen Office of Biological and Environmental Research DOE Office of.
CIS 9002 Kannan Mohan Department of CIS Zicklin School of Business, Baruch College.
An Ontology for Protein- Protein Interaction Data Karen Jantz CIS Honors Project December 7, 2006.
Introduction The GPM project (The Global Proteome Machine Organization) Salvador Martínez de Bartolomé Bioinformatics support –
Chapter 1 Introduction to Data Mining
Common parameters At the beginning one need to set up the parameters.
Methodological Foundations of Biomedical Informatics (BMSC-GA 4449) Kelly Ruggles & David Fenyo.
A Comprehensive Comparison of the de novo Sequencing Accuracies of PEAKS, BioAnalyst and PLGS Bin Ma 1 ; Amanda Doherty-Kirby 1 ; Aaron Booy 2 ; Bob Olafson.
Laxman Yetukuri T : Modeling of Proteomics Data
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
INF380 - Proteomics-101 INF380 – Proteomics Chapter 10 – Spectral Comparison Spectral comparison means that an experimental spectrum is compared to theoretical.
CS 461b/661b: Bioinformatics Tools and Applications Software Algorithm Mathematical Models Biology Experiments and Data.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
XML Standards for Proteomics Data Andrew Jones, Dr Jonathan Wastling and Dr Ela Hunt Department of Computing Science and the Institute of Biomedical and.
Proteomics databases for comparative studies: Transactional and Data Warehouse approaches Patricia Rodriguez-Tomé, Nicolas Pinaud, Thomas Kowall GeneProt,
Advanced Analytics on Hadoop Spring 2014 WPI, Mohamed Eltabakh 1.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Central dogma: the story of life RNA DNA Protein.
1 Limitations of BLAST Can only search for a single query (e.g. find all genes similar to TTGGACAGGATCGA) What about more complex queries? “Find all genes.
Enabling Grids for E-sciencE Astronomical data processing workflows on a service-oriented Grid architecture Valeria Manna INAF - SI The.
Reverse Interactomics
Bioinformatics Research Overview Outline Biomedical Ontologies oGlycO oEnzyO oProPreO Scientific Workflow for analysis of Proteomics Data Framework for.
Data Management Support for Life Sciences or What can we do for the Life Sciences? Mourad Ouzzani
Automatic Discovery and Processing of EEG Cohorts from Clinical Records Mission: Enable comparative research by automatically uncovering clinical knowledge.
Advanced Database Concepts
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
High throughput biology data management and data intensive computing drivers George Michaels.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
ArrayExpress Ugis Sarkans EMBL - EBI
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Database Replication and Monitoring
Optimizing Biological Data Integration
Modern Data Management
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
A Unifying View on Instance Selection
Data Warehousing and Data Mining
Data Mining.
Analytics, BI & Data Integration
Presentation transcript:

IIC Information Flow Interesting ions? Priority list of interesting ions Empty priority list? QA/QC? Peptide identification Protein identification External Databases query Y N Y N N Step 1 Step 2 Step 3 sample N Y

IIC Architecture

Intelligent Instrument Control Algorithms design –Design of the MS simulator, Task 1: Hazem –Spectra Deconvolution (Data filters and noise removal) Task 2: Mohamed Eltabakh –Protein/peptide identification Task3: Mingwu –Other simple algorithms, e.g., priority list, –IIC design and architecture (API, … ) Dr. Ahmed –Integrated Access to external databases ( protDB to support identification and other BioDBs to support correlation with other information, for example interactive proteins, related metabolites, etc.) Experimental Simulation –In silico generation of spectrum, Task 1: Hazem –Algorithms implementation (simulation)

Intelligent Instrument Control Integration with the instrument –Data collection (API) –Control of the instrument (API) –Actual implementation (algorithms) –Database design: Data representation (streams, database) Optimized storage of massive data –Raw data –Analyzed data Experimental settings Dr. –Selection of a biology system, e.g., yeast –Two types of experiments Target analysis Global analysis

Intelligent Instrument Control Prediction of upcoming peaks ( need more data to build math model ) Online data mining

Integrated Access to External Bio-databases Context: Informatics tools –Glycosylated peptide identification –Non-glycosylated peptide identification Goal: Enabling uniform access to different protein databases

Integrated Access to External Bio-databases Tasks –Database description and organization, and Schema mediation –Query Processing –Data correlation E.g., Sequence alignment Non-overlapping schemas Contradictory information –Web service enabled access Settings: Target databases selection (focus)