GMT files and Network are Related Content of GMT files can be converted to networks based on co-occurrence.

Slides:



Advertisements
Similar presentations
Network inference from repeated observations of node sets Neil Clark, Avi Ma'ayan.
Advertisements

Numerical Integration
Significance Testing.  A statistical method that uses sample data to evaluate a hypothesis about a population  1. State a hypothesis  2. Use the hypothesis.
+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.
Multi-label Relational Neighbor Classification using Social Context Features Xi Wang and Gita Sukthankar Department of EECS University of Central Florida.
Fast Algorithms For Hierarchical Range Histogram Constructions
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
1 Statistical Modeling  To develop predictive Models by using sophisticated statistical techniques on large databases.
Computational characterization of biomolecular networks in physiology and disease Kakajan Komurov, Ph.D Department of Systems Biology University of Texas.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada ISP-Friendly Peer Matching without ISP Collaboration Mohamed Hefeeda (Joint.
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
Geographic Information Systems
Structure discovery in PPI networks using pattern-based network decomposition Philip Bachman and Ying Liu BIOINFORMATICS System biology Vol.25 no
Database Management: Getting Data Together Chapter 14.
Data Mining.
Fitting models to data. Step 5) Express the relationships mathematically in equations Step 6)Get values of parameters Determine what type of model you.
Protein Homology Detection Using String Alignment Kernels Jean-Phillippe Vert, Tatsuya Akutsu.
Biostatistics Frank H. Osborne, Ph. D. Professor.
8/28/97Information Organization and Retrieval Files and Databases University of California, Berkeley School of Information Management and Systems SIMS.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Predicting Matchability - CVPR 2014 Paper -
Automatic assignment of NMR spectral data from protein sequences using NeuroBayes Slavomira Stefkova, Michal Kreps and Rudolf A Roemer Department of Physics,
Role of editing and imputation in integration of sources for structural business statistics Svein Gåsemyr, Statistics Norway Svein Nordbotten, University.
Knowledge Integration for Gene Target Selection Graciela Gonzalez, PhD Juan C. Uribe Contact:
Cookies, Spreadsheets, and Modeling: Dynamic, Interactive, Visual Science and Math Scott A. Sinex Prince George’s Community College Presented at Network.
2015 AprilUNIVERSITY OF HAIFA, DEPARTMENT OF STATISTICS, SEMINAR FOR M.A 1 Hastie, Tibshirani and Friedman.The Elements of Statistical Learning (2nd edition,
PROBABILITY REVIEW PART 2 PROBABILITY FOR TEXT ANALYTICS Thomas Tiahrt, MA, PhD CSC492 – Advanced Text Analytics.
Conclusions As empirical data about sets of related entities accrues, there are more constraints on possible network realizations that can fit the data;
A Neural Network MonteCarlo approach to nucleon Form Factors parametrization Paris, ° CLAS12 Europen Workshop In collaboration with: A. Bacchetta.
An Introduction to Artificial Intelligence and Knowledge Engineering N. Kasabov, Foundations of Neural Networks, Fuzzy Systems, and Knowledge Engineering,
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Jon Kleinberg (Cornell), Christos.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 1 Statistics: The Art and Science of Learning from Data Section 1.3 Using Calculators.
Abstract Background: In this work, a candidate gene prioritization method is described, and based on protein-protein interaction network (PPIN) analysis.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
, Patrik Huber.  One of our goals: Evaluation of the posterior p(Z|X)  Exact inference  In practice: often infeasible to evaluate the posterior.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Overview of Bioinformatics 1 Module Denis Manley..
Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering.
Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Understanding Network Concepts in Modules Dong J, Horvath S (2007) BMC Systems Biology 2007, 1:24.
Programming Practicum Day 3: Problem Solving with Graphs Aaron Tan NUS School of Computing.
By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Data Mining Consultant GlaxoSmithKline: US Pharma IT
Manuel Gomez Rodriguez Bernhard Schölkopf I NFLUENCE M AXIMIZATION IN C ONTINUOUS T IME D IFFUSION N ETWORKS , ICML ‘12.
Feature Extraction Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and.
Inference of Non-Overlapping Camera Network Topology by Measuring Statistical Dependence Date :
Peter Moore 10/05/051 ANN survival prediction for cancer patients Peter Moore High Energy Physics University of Manchester.
Fitting normal distribution: ML 1Computer vision: models, learning and inference. ©2011 Simon J.D. Prince.
Design and Analysis of Algorithms Introduction to graphs, representations of a graph Haidong Xue Summer 2012, at GSU.
Approximation by Constraints Why do we need approximation/interpolation? –Record values at some finite set of points in space and time. –Does not capture.
Regulatory Signatures Inferred From Gene Expression Data Jayanth (Jay) Krishnan SBCNY Fellow Mahopac High School Mount Sinai School of Medicine Bio-Engineering/Bioinformatics.
Your friend has a hobby of generating random bit strings, and finding patterns in them. One day she come to you, excited and says: I found the strangest.
Show & Tell Limsoon Wong Kent Ridge Digital Labs Singapore Role of Bioinformatics in the Genomic Era.
Biostatistics Class 2 Probability 2/1/2000.
Statistical Applications in Biology and Genetics
ATOM Accelerating Therapeutics for Opportunities in Medicine
CSSE463: Image Recognition Day 11
وكالة الكلية للجودة والتطوير
Mixture of Mutually Exciting Processes for Viral Diffusion
CS223 Advanced Data Structures and Algorithms
Discovering Larger Network Motifs
The Arc-Node Data Model
Volume 5, Issue 6, Pages e3 (December 2017)
Systems Biology Strikes Gold
Using clinical trial data as real-world evidence
Presentation transcript:

GMT files and Network are Related Content of GMT files can be converted to networks based on co-occurrence

A synthetic example Neil Clark Neil R. Clark, Ruth Dannenfelser, Christopher M. Tan, Michael E. Komosinski, and Avi Ma'ayan. Sets2Networks: network inference from repeated observations of sets. BMC Systems Biology. In press

Inference live!

Analytic Approximation When applying this approach to real data typically there are many nodes Sample space of networks can be very large -> computationally demanding We developed a simple analytical approximation which mimics the action of the algorithm. n ij,k is the size of the lines to which both vertices v i and v j belong

Validation of PPI Predictions Network inferred from 3290 IP/MS experiments next to known PPI network made from 20 online PPI databases

ROC Curves to Evaluate PPI Predictions

Validation of Predictions with CORUM Database of Protein Complexes

Inference of Statistical Relationships in the AERS Database Adverse Event Reporting System (AERS) database contains records of.... Patient ID 1Drug 1, Drug 2,...Side-effect 1, Side-effect 2,... Patient ID 2Dug 3, Drug 4,...Side-effect 3, Side effect 4,...…

AERS Large-scale Adjacency Matrix

AERS subnetwork Drugs Adverse Events

Mount Sinai Collaboration Network