Gene-Disease Associations Based on network

Slides:



Advertisements
Similar presentations
Analysis of High-Throughput Screening Data C371 Fall 2004.
Advertisements

Co Training Presented by: Shankar B S DMML Lab
Perceptron Learning Rule
+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.
Olivier Duchenne , Armand Joulin , Jean Ponce Willow Lab , ICCV2011.
1 Statistical Modeling  To develop predictive Models by using sophisticated statistical techniques on large databases.
CGeMM – University of Louisville Mining gene-gene interactions from microarray data - Coefficient of Determination Marcel Brun – CGeMM - UofL.
The multi-layered organization of information in living systems
Computational characterization of biomolecular networks in physiology and disease Kakajan Komurov, Ph.D Department of Systems Biology University of Texas.
Copyright, ©, 2002, John Wiley & Sons, Inc.,Karp/CELL & MOLECULAR BIOLOGY 3E Molecular Portraits of Cancer Microarrays and Cell Biology.
T Monday, June 15, 2015Monday, June 15, 2015Monday, June 15, 2015Monday, June 15, 2015.
Lesson 8: Machine Learning (and the Legionella as a case study) Biological Sequences Analysis, MTA.
Discrimination Methods As Used In Gene Array Analysis.
Semi-Supervised Clustering Jieping Ye Department of Computer Science and Engineering Arizona State University
Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Radial-Basis Function Networks
1 Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data Presented by: Tun-Hsiang Yang.
1 Harvard Medical School Transcriptional Diagnosis by Bayesian Network Hsun-Hsien Chang and Marco F. Ramoni Children’s Hospital Informatics Program Harvard-MIT.
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
Learning from Imbalanced, Only Positive and Unlabeled Data Yetian Chen
Richard Socher Cliff Chiung-Yu Lin Andrew Y. Ng Christopher D. Manning
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Networks and Interactions Boo Virk v1.0.
GRAPHS CSE, POSTECH. Chapter 16 covers the following topics Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component,
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
NEURAL NETWORKS FOR DATA MINING
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Mining Social Networks for Personalized Prioritization Shinjae Yoo, Yiming Yang, Frank Lin, II-Chul Moon [KDD ’09] 1 Advisor: Dr. Koh Jia-Ling Reporter:
Combining multiple learners Usman Roshan. Bagging Randomly sample training data Determine classifier C i on sampled data Goto step 1 and repeat m times.
Meng-Han Yang September 9, 2009 A sequence-based hybrid predictor for identifying conformationally ambivalent regions in proteins.
Data Structures & Algorithms Graphs
N. Saoulidou & G. Tzanakos1 ANN Basics : Brief Review N. Saoulidou, Fermilab & G. Tzanakos, Univ. of Athens.
Advisor : Prof. Sing Ling Lee Student : Chao Chih Wang Date :
Understanding Network Concepts in Modules Dong J, Horvath S (2007) BMC Systems Biology 2007, 1:24.
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification.
1 A text-mining analysis of the human phenome Marc A van Driel 1, Jorn Bruggeman 2, Gert Vriend 1, Han G Brunner *,3 and Jack AM Leunissen 2 European Journal.
A Pathway-Based View of Human Diseases and Disease Relationships Li Y, Agarwal P (2009) A Pathway-Based View of Human Diseases and Disease Relationships.
Computational Approaches for Biomarker Discovery SubbaLakshmiswetha Patchamatla.
1 Classification and Feature Selection Algorithms for Multi-class CGH data Jun Liu, Sanjay Ranka, Tamer Kahveci
A comparative study of survival models for breast cancer prognostication based on microarray data: a single gene beat them all? B. Haibe-Kains, C. Desmedt,
GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function Sara Mostafavi, Debajyoti Ray, David Warde-Farley,
GRAPHS. Graph Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component, spanning tree Types of graphs: undirected,
Enhanced Regulatory Sequence Prediction Using Gapped k-mer Features 王荣 14S
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Bioinformatics: Cool stuff you can do with Computers and Biology Oded Magger Tel Aviv University / Autodesk inc. GIP course 2010.
Hybrid Classifiers for Object Classification with a Rich Background M. Osadchy, D. Keren, and B. Fadida-Specktor, ECCV 2012 Computer Vision and Video Analysis.
Combining multiple learners Usman Roshan. Decision tree From Alpaydin, 2010.
Privacy Preserving in Social Network Based System PRENTER: YI LIANG.
Ubiquitination Sites Prediction Dah Mee Ko Advisor: Dr.Predrag Radivojac School of Informatics Indiana University May 22, 2009.
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
(1) Genotype-Tissue Expression (GTEx) Largest systematic study of genetic regulation in multiple tissues to date 53 tissues, 500+ donors, 9K samples, 180M.
Semi-Supervised Clustering
Trees, bagging, boosting, and stacking
Analysis of bio-molecular networks through RANKS (RAnking of Nodes
Final Year Project Presentation --- Magic Paint Face
Learning with information of features
Chapter 3. Artificial Neural Networks - Introduction -
Multitask Learning Using Dirichlet Process
Figure S1: Gene importance plot derived from Variable/ Feature selection using machine learning on the training dataset. MeanDecreaseGini is the measure.
The Impact of Network Medicine in Gastroenterology and Hepatology
ADAGE model example. ADAGE model example. For one sample in the expression compendium (one column in the figure with red or green colors, representing.
Volume 10, Issue 4, Pages (October 2011)
HeteroMed: Heterogeneous Information Network for Medical Diagnosis
Phylogenetic comparison among selected Pasteurella multocida and Haemophilus influenzae species with completed genome sequences. Phylogenetic comparison.
Heterogeneous Graph Convolutional Network
Learning to Detect Human-Object Interactions with Knowledge
Presentation transcript:

Gene-Disease Associations Based on network 报告人:李金金

Heterogeneous network Contents Background 1 Heterogeneous network 2 Methods 3

Background Correctly identifying association of genes with diseases has long been a goal in biology. Identifying association of genes with diseases has contributed to improving medical care and understanding of gene functions and interactions. Clinical diseases are characterized by distinct phenotypes. To identity disease genes, the relationship between genes and phenotypes is involved.

Background Problems Pheno -type Gene Disea -se Association

Construction heterogeous network Gene network based on HPRD g3 g1 g2 g5 g6 g4 g7

Construction heterogeous network Phenotype network using MinMiner p1 p2 p4 p5

Construction heterogeous network Gene-Phenotype network based on OMIM p1 p2 p4 p5 g3 g1 g2 g5 g6 g4 g7

Construction heterogeous network

Methods Katz CATAPULT Methods CIPHER GeneWalker Prince RWRH

Methods Methods Katz CATAPULT is successfully applied for link prediction in social networks. CATAPULT is a supervised learning method. Features are derived from hybrid walks through the heterogeneous network.

Katz g1 g2 g3 g4 g5 g6

Katz g1 g1 g2 g2 g3 g3 g3 g4 g5 g5 g6 g6 ……

Katz How to get the similarity matrix? Katz measure: , Small values of k (k=3 or k=4) are known to yield competitive performance in the task of recommending similar nodes.

Katz on the heterogeneous network Adjacency matrix of heterogeneous network: AG gene-gene network B the bipartite network genes and phenotypes APHS the similarity matrix of human diseases APS the similarity matrix of phenotypes of other species

Katz on the heterogeneous network Katz similarity measure specialized to A: K=3,the similarities between gene nodes and human disease nodes could be denoted by

the number of bootstraps CATAPULT How to train a biased SVM? T the number of bootstraps the sets of positive the set of unlabeled gene-phenotype pairs n+ the number of examples in A Step 1: Draw a bootstrap sample of size n+ . Step 2: Train a linear classifier θ using the positive training examples A and U as negative examples.

CATAPULT How to train a biased SVM? Step 2: Training classifier

CATAPULT How to train a biased SVM? Step 3: For any update:

Thank You !