Project of CZ5225 Zhang Jingxian:

Slides:



Advertisements
Similar presentations
BiGCaT Bioinformatics Hunting strategy of the bigcat.
Advertisements

Affymetrix Gene Expression Microarrays Application to Pulmonary Arterial Hypertension Bob Stearman 02/24/2014.
Introduction to BioConductor Friday 23th nov 2007 Ståle Nygård Statistical methods and bioinformatics for the analysis of microarray.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Timothy H. W. Chan, Calum MacAulay, Wan Lam, Stephen Lam, Kim Lonergan, Steven Jones, Marco Marra, Raymond T. Ng Department of Computer Science, University.
OHRI Bioinformatics Introduction to the Significance Analysis of Microarrays application Stem.
Capture Packets using Wireshark. Introduction Wireshark – – Packet analysis software – Open source.
Microarray GEO – Microarray sets database
Public data - available for projects 6 data sets: –Human Tissues –Leukemia –Spike-in –FARO compendium – Yeast Cell Cycle –Yeast Rosetta Find one yourself.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Integrating domain knowledge with statistical and data mining methods for high-density genomic SNP disease association analysis Dinu et al, J. Biomedical.
Demonstration Trupti Joshi Computer Science Department 317 Engineering Building North (O)
Artificial Intelligence Term Project #3 Kyu-Baek Hwang Biointelligence Lab School of Computer Science and Engineering Seoul National University
Cluster Analysis Hierarchical and k-means. Expression data Expression data are typically analyzed in matrix form with each row representing a gene and.
Geometric Approaches to Reconstructing Time Series Data Project Update 29 March 2007 CSC/Math 870 Computational Discrete Geometry Connie Phong.
A Bioinformatics Meta-analysis of Differentially Expressed Genes in Colorectal Cancer Simon Chan, Thursday Trainee Seminar – October 11.
Lecture 19 Classification analysis – R code
Multiple Examples of tumor tissue (public data from Whitehead/MIT) SVM Classification of Multiple Tumor Types DNA Microarray Data Oracle Data Mining 78.25%
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
ONLINE BIOMARKER VALIDATION OF SURVIVAL- ASSOCIATED BIOMARKERS IN BREAST AND OVARIAN CANCER USING MICROARRAY DATA OF 3,862 4,323 PATIENTS Balázs Győrffy.
Enabling biomarker validation in breast cancer molecular subtypes: sensitivity and specificity of array-based subtype classification in 983 patients Balázs.
Experiment Title Include the following somewhere on this slide! Lab Number (Example, Lab 2 for Free Fall etc.) Instructor: Date of the Lab: Team: (Names.
Graph and Topological Structure Mining on Scientific Articles Fan Wang, Ruoming Jin, Gagan Agrawal and Helen Piontkivska The Ohio State University The.
1 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent.
RNAseq analyses -- methods
Introduction to BioConductor 許家維 許文馨 游崇善 陳彥如. Bioconductor BioConductor 起初是由 Fred Hutchinson 癌症研究 中心發起的計畫,之後有許多來自不同國家的研 究人員參與,這個計畫是一個為了分析理解基因 體資料的開放源碼計劃。
Agenda Introduction to microarrays
1.DATABASE construction  n=1,715  Median OS=40.0 months, age: 64+/-10 yrs  Histology (adeno/squamous/large): 50% / 45% / 5%  Stage 1/2/3/4: 63% / 27%
BioQUEST / SCALE-IT Module From Omics Data to Knowledge Case 1: Microarrays Namyong Lee Minnesota State University, Mankato Matthew Macauley Clemson University.
EDRN Biomarker Review Process Heather Kincaid, Andrew Hart, Kristen Anton, Mark Thornquist, Dan Crichton and Christos Patriotis.
Logics for Data and Knowledge Representation Semantic Matching Lab.
Project of CZ5225 Zhang Jingxian:
Artificial Intelligence Project #3 : Analysis of Decision Tree Learning Using WEKA May 23, 2006.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
1 Introduction Description of the video series Meta-analysis in R with Metafor.
Introduction to Microarrays Dr. Özlem İLK & İbrahim ERKAN 2011, Ankara.
Introduction to Statistical Analysis of Gene Expression Data Feng Hong Beespace meeting April 20, 2005.
Developed at the Broad Institute of MIT and Harvard Reich M, Liefeld T, Gould J, Lerner J, Tamayo P, and Mesirov JP. GenePattern 2.0. Nature Genetics 38.
Gene Expression Omnibus (GEO)
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Artificial Intelligence Project #3 : Diagnosis Using Bayesian Networks May 19, 2005.
1 Econometrics (NA1031) Lecture 1 Introduction. 2 ”How much” type questions oBy how much a unit change in income affects consumption? oBy how much should.
Getting the story – biological model based on microarray data Once the differentially expressed genes are identified (sometimes hundreds of them), we need.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
PROGRAMMING IN R Introduction to R. In this session I will: Introduce you to the R program and windows Show how to install R Write basic programs in R.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 –Multiple hypothesis testing Marshall University Genomics.
Using official data-sources Tom Spencer, Social Justice Analysis.
CCLE Cancer Cell Line Encyclopedia Alexey Erohskin.
Introduction to Oncomine Xiayu Stacy Huang. Oncomine is a cancer-specific microarray database and has a web-based data-mining platform aimed at facilitating.
Bioinformatics Shared Resource Introduction to Gene Expression Omnibus (GEO) bsrweb.sanfordburnham.org
Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.
GEO (Gene Expression Omnibus) Deepak Sambhara Georgia Institute of Technology 21 June, 2006.
David Amar, Tom Hait, and Ron Shamir
CellExpress Tutorial A Comprehensive Microarray-Based Cancer Cell Line and Clinical Sample Gene Expression Analysis Online System :8080 NTU.
Cardiac Agatston Scoring
CellExpress Examples A Comprehensive Microarray-Based Cancer Cell Line and Clinical Sample Gene Expression Analysis Online System :8080 NTU.
Impact of Formal Methods in Biology and Medicine Final Review
Impact of Formal Methods in Biology and Medicine
Impact of Formal Methods in Biology and Medicine
Project of CZ5225 case study by using SOFT format from GEO
The mRNA stem cell signature.
Pan Du, Simon Lin Robert H. Lurie Comprehensive Cancer Center
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
Assignment 1: Classification by K Nearest Neighbors (KNN) technique
Course: Statistics in Bioinformatics Date: 指導教授: 陳光琦 學生: 吳昱賢
Cancer Cell Line Encyclopedia
Getting Data into R & Bioconductor
PD-L1 expression correlates with T-cell markers and an IFN response signature in human melanomas. PD-L1 expression correlates with T-cell markers and an.
PSMA expression in pre- and posttreatment prostate carcinoma serum.
Presentation transcript:

Project of CZ5225 Zhang Jingxian:

Identifying biomarkers of drug response for cancer patients Aims: Aims: To develop of predictors of response to drugs To develop of predictors of response to drugs To learn how to get public microarray data To learn how to get public microarray data To learn how to preprocess microarray raw data To learn how to preprocess microarray raw data To annotate the genes of interest To annotate the genes of interest

Requirements Each group investigates: Each group investigates: ONE kind of cancer patient drug response ONE kind of cancer patient drug response Need Two datasets from different studies Need Two datasets from different studies Download the raw data Download the raw data Use Bioconductor in R to prepossess raw data Use Bioconductor in R to prepossess raw data Identify certain number of genes Identify certain number of genes Annotate those identified genes in your report Annotate those identified genes in your report Each group needs only ONE report Each group needs only ONE report

Requirements All kinds of affymatrix expression datasets related to drug response of cancer patients are available All kinds of affymatrix expression datasets related to drug response of cancer patients are available Dataset needs to contain at least 20 samples Dataset needs to contain at least 20 samples Dataset needs two comparable outcome groups: response vs. non-response; resistance vs. non-resistance, et al. Dataset needs two comparable outcome groups: response vs. non-response; resistance vs. non-resistance, et al.

Bioconductor & R

Advantages Advantages Cross platform Cross platform Linux, windows and MacOS Linux, windows and MacOS Comprehensive and centralized Comprehensive and centralized Analyzes both Affymetrix and two color spotted microarrays, and covers various stages of data analysis in a single environment Analyzes both Affymetrix and two color spotted microarrays, and covers various stages of data analysis in a single environment Cutting edge analysis methods Cutting edge analysis methods New methods/functions can easily be incorporated and implemented New methods/functions can easily be incorporated and implemented Quality check of data analysis methods Quality check of data analysis methods Algorithms and methods have undergone evaluation by statisticians and computer scientists before launch. And in many cases there are also literature references Algorithms and methods have undergone evaluation by statisticians and computer scientists before launch. And in many cases there are also literature references Good documentations Good documentations Comprehensive manuals, documentations, course materials, course notes and discussion group are available Comprehensive manuals, documentations, course materials, course notes and discussion group are available A good chance to learn statistics and programming A good chance to learn statistics and programming

Installation R & Bionconductor Install R from: Install R from: Open R platform then execute: Open R platform then execute:>source(" Check library by execute: >library() Check library by execute: >library()

Case study Dataset source (GSE19697): Dataset source (GSE19697):

Decompress raw data into: D://gse19697 Decompress raw data into: D://gse19697 Create title.txt : Create title.txt :

Open R Open R Set workdir by execute: Set workdir by execute: >setwd( ‘ d://gse19697 ’ ) >setwd( ‘ d://gse19697 ’ ) Load simpleaffy module by execute: Load simpleaffy module by execute: >library(simpleaffy) >library(simpleaffy) Load data by: Load data by: >eset eset <- read.affy('title.txt')

Calculate expression by: Calculate expression by: >eset.rma eset.rma <- call.exprs(eset,'rma') Compare two groups by: Compare two groups by: >pc.result pc.result <- pairwise.comparison(eset.rma, "title", c("pCR", "RD"), eset)

Filter significant change markers between two goups by: Filter significant change markers between two goups by: >significant significant <- pairwise.filter(pc.result,fc=log2(1.5), tt=0.001)

Plot significant changed markers: Plot significant changed markers: >plot(significant) >plot(significant) Annotate selected markers: Annotate selected markers: >significant >significant

Annotate selected markers: Annotate selected markers:

Materials submitted ONE report: ONE report: Introduction of your project Introduction of your project Datasets used in the project Datasets used in the project Procedures of processing data Procedures of processing data Annotation of selected markers Annotation of selected markers Conclusion Conclusion File: title.txt File: title.txt

Useful resources me/R_BioCondManual#biocon_simpleaffy me/R_BioCondManual#biocon_simpleaffy me/R_BioCondManual#biocon_simpleaffy me/R_BioCondManual#biocon_simpleaffy ease/bioc/html/simpleaffy.html ease/bioc/html/simpleaffy.html ease/bioc/html/simpleaffy.html ease/bioc/html/simpleaffy.html