CSLS Retreat 2007 Matan Hofree & Assaf Weiner 1. Outline  A brief introduction to microRNA  Project motivation and goal  Selecting the data sets 

Slides:



Advertisements
Similar presentations
Combined analysis of ChIP- chip data and sequence data Harbison et al. CS 466 Saurabh Sinha.
Advertisements

Stochastic Analysis of Bi-stability in Mixed Feedback Loops Yishai Shimoni, Hebrew University CCS Open Day Sep 18 th 2008.
Two short pieces MicroRNA Alternative splicing.
Improving miRNA Target Genes Prediction Rikky Wenang Purbojati.
Functional Non-Coding DNA Part I Non-coding genes and non-coding elements of coding genes BNFO 602/691 Biological Sequence Analysis Mark Reimers, VIPBG.
Chromosome structure and chemical modifications can affect gene expression
Naveen K. Bansal and Prachi Pradeep Dept. of Math., Stat., and Comp. Sci. Marquette University Milwaukee, WI (USA)
MiRNA in computational biology 1 The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig C. Mello for their discovery of "RNA interference.
A turbo intro to (the bioinformatics of) microRNAs 11/ Peter Hagedorn.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
Predicting RNA Structure and Function
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Microarray analysis shows that some microRNAs downregulate large numbers of target mRNAs LEE P. LIM, NELSON C. LAU, PHILIP GARRETT-ENGELE ANDREW GRIMSON,
Computational biology seminar
Investigating the Importance of non-coding transcripts.
The Model To model the complex distribution of the data we used the Gaussian Mixture Model (GMM) with a countable infinite number of Gaussian components.
Predicting RNA Structure and Function. Nobel prize 1989Nobel prize 2009 Ribozyme Ribosome RNA has many biological functions The function of the RNA molecule.
Presenting: Asher Malka Supervisor: Prof. Hermona Soreq.
Predicting RNA Structure and Function. Following the human genome sequencing there is a high interest in RNA “Just when scientists thought they had deciphered.
Predicting RNA Structure and Function
Chapter 15 Noncoding RNAs. You Must Know The role of noncoding RNAs in control of cellular functions.
Predicting RNA Structure and Function. Nobel prize 1989 Nobel prize 2009 Ribozyme Ribosome.
More regulating gene expression. Fig 16.1 Gene Expression is controlled at all of these steps: DNA packaging Transcription RNA processing and transport.
MicroRNA Targets Prediction and Analysis. Small RNAs play important roles The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig.
Where Will They Strike Next? microRNA targeting tactics in the war on gene expression Jeff Reid Miller “Lab” Baylor College of Medicine.
Xuhua Xia Signposts for translation initiation: An illustration of formulating a research project Xuhua Xia.
Regulating Gene Expression from RNA to Protein. Fig 16.1 Gene Expression is controlled at all of these steps: DNA packaging Transcription RNA processing.
Finish up array applications Move on to proteomics Protein microarrays.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
MicroRNA identification based on sequence and structure alignment Presented by - Neeta Jain Xiaowo Wang†, Jing Zhang†, Fei Li, Jin Gu, Tao He, Xuegong.
RNA Folding. RNA Folding Algorithms Intuitively: given a sequence, find the structure with the maximal number of base pairs For nested structures, four.
Small RNAs and their regulatory roles. Presented by: Chirag Nepal.
More regulating gene expression. Combinations of 3 nucleotides code for each 1 amino acid in a protein. We looked at the mechanisms of gene expression,
MBP1007/ Nucleic Acids A functional mRNA: The cytoplasmic story Objectives (1) To discuss the iNUTS and iBOLTS of how mRNAs function in the cytoplasm.
Marco Magistri , Journal Club. A non-coding RNA (ncRNA) is any RNA molecule that is not translated into a protein “Structural genes encode proteins.
The generalized transcription of the genome Víctor Gámez Visairas Genomics Course 2014/15.
RNA Structure Prediction
Protein and RNA Families
MicroRNAs and Other Tiny Endogenous RNAs in C. elegans Annie Chiang JClub Ambros et al. Curr Biol 13:
Proposed redefinition of “gene” requires it to have a biological role Gerstein MB, …, Snyder M Genome Res 17: example of complexities observed.
Control of Gene Expression Chapter Proteins interacting w/ DNA turn Prokaryotic genes on or off in response to environmental changes  Gene Regulation:
Nature, 2008, Doi: /nature07103 Semrah Kati
Improving Intergenic miRNA Target Genes Prediction Rikky Wenang Purbojati.
RNA-ligand interactions and control of gene expression
Motif Search and RNA Structure Prediction Lesson 9.
Chapter 19 The Organization & Control of Eukaryotic Genomes.
Finding genes in the genome
RNA Structure Prediction
Expression profile of oncomiRs and tumor-suppressor miRs in urothelial carcinoma of the bladder Apostolos Zaravinos 1, George I. Lambrou 2, Jelena Radojicic.
Homework #2 is due 10/17 Bonus #1 is due 10/24 Office hours: M 10/ :30am 2-5pm in Bio 6.
Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
Chapter 13 Regulatory RNA Introduction  RNA functions as a regulator by forming a region of secondary structure (either inter- or intramolecular)
Chapter 18 – Gene Regulation Part 2
The Transcriptional Landscape of the Mammalian Genome
Fig Prokaryotes and Eukaryotes
Characterization of Transition Metal-Sensing Riboswitches
ANIMAL TARGET PREDICTION - TIPS
LncRNAs exert their effects by diverse mechanisms. LncRNAs exert their effects by diverse mechanisms. (A) lncRNAs can.
Chapter 13 Regulatory RNA.
Volume 19, Issue 5, Pages (May 2017)
Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets  Benjamin P. Lewis, Christopher B. Burge,
Volume 19, Issue 5, Pages (May 2017)
mRNA Degradation and Translation Control
Identification of miR‐499 targets
Volume 48, Issue 5, Pages (December 2012)
Pairing beyond the Seed Supports MicroRNA Targeting Specificity
Prediction of Plant MicroRNA Targets
Derek de Rie and Imad Abuessaisa Presented by: Cassandra Derrick
Presentation transcript:

CSLS Retreat 2007 Matan Hofree & Assaf Weiner 1

Outline  A brief introduction to microRNA  Project motivation and goal  Selecting the data sets  Features and data analysis  Results and conclusions 2

What are microRNAs?  Non-coding RNAs that regulate gene expression post- transcriptionaly  22~ nucleotide long  Evolutionary conserved  Located in intergenic regions or introns  Often specific to a particular tissue or a developmental stage  Human genomes are predicted to encode as many as 1000 unique miRs, that together possibly regulate one third of all genes 1 1 Bartel et. al. Cell (2005) 3

How do miRs work?  Ambion, Inc. Explorations 4

Possible mechanisms of miR-mediated inhibition of gene expression Two Types of mechanisms have been observed: 1. Direct effect on translation - Inhibition of initiation, elongation or termination. 2. Indirect effects – Affects the number of available transcripts by degradation or sequestration. 5 Timothy W. Nilsen (2007) Mechanisms of microRNA-mediated gene regulation in animal cells, Trends in Gen. 23, 5,

miR-target interaction principles 6  Target site in mRNA 3’UTR  Incomplete base pairing  G:U base-pairing  7 fully complementary bases in 5’ of miRNA – ‘Seed’ He-L& Hannon-GJ (2004)

miR-target interaction principles 7  Evolutionary conservation of target site  Repeating binding site in many of the known target genes Reinhart-BJ et. al. (2000) Nature 403: 901

Motivation for our work “Perfect seed pairing is not generally a reliable predictor for miRNA-target interactions” Didiano D. & Hobert O. (2006) Nat. Struct. Mol. 8 Experiment (modification in target) FunctionalNon- functional wt target site(control) 13 putative targets 3’ UTR with seed match and phylogenetic conservation Binding site in random 3’UTR Original target site in other miR regulated 3’UTR

Motivation for our work Conclusions from Hobert’s work:  The presence of a well matched miR target site in 3’ UTR alone, is not a reliable predictor for a miR- target functional interaction.  Proposed miR-target interactions possibly occur in very specific contexts. 9

The seed is not enough… 10

Discerning the true targets Possible factors contributing to miR-target functionality: 1. Unique miR-target duplex secondary structure 2. 3’ UTR accessibility and target site accessibility 3. RNA-binding protein cofactors 4. Sequence and structural motifs 5. Presence or absence of neighboring binding sites of the same or different miRs 11

The long and tedious tale of preparing the data sets... AATTCACATTCCGACTTT GGTAGACATTCCCTTAGA GGGCTACATTCCTTACGT CCGATACATTCCTACGGA AGATTACATTCCTGACCT Functional miR-target interactionNon-functional miR-target interaction... GCCTCACATTCCCCTTGT CAGTGACATTCCCAGAGA CATTCACATTCCTCAGTT TGTCTACATTCCTTATCA GACCTACATTCCTCATTG......

The long and tedious tale of preparing the data sets 13 Over expression of miR miR silencing (antisense) treated wt wt treated Search for down- regulated genes Search for up- regulated genes

The long and tedious tale of preparing the data sets 14 Over expression of miR miR silencing (antisense) treated wt wt treated Search for down- regulated genes Search for up- regulated genes Lim et al. (2005) over expression of miR-1, miR- 124 and miR-373 in human Hela cells Esau et al. (2006) & Krutzfeldt et al. (2005) silenced miR-122 in mouse liver cells

Lim et al. (2005)  Fold change down regulation >= 1.4  Expression change P-Val <  Get equally expressed genes ( P-Val > 0.85)  Expression above median  Checked fold change ~1 miR-regulated groupmiR-non-regulated group 15 Screen out detection anomalies as defined by array Consistency between 12H and 24H sets Full 2-8(7 nt) seed match Screen out detection anomalies as defined by array Consistency between 12H and 24H sets Full 2-8(7 nt) seed match

Lim et al. (2005) 16 * Taking into account overlaps between groups

Results so far… 17

CLDN12 AAGAAAACTTCTTGTAGCCTCACATTCCCCTTGTGCAAAGAGCTC TAGLN2 TAGATATATATTTTAGCAGTGACATTCCCAGAGAGCCCCAGAGCT ARCN1 GCCACAGTCTGTAATCCATTCACATTCCTCAGTTTCACCACCTCC TIP120A TTTCATTCCGTTTGGATGTCTACATTCCTTATCAAAGGATATAAA PGM2 TTATGTGTTTTACAAAGACCTACATTCCTCATTGTTTCATGTTTG... Target conservation 18 (a)Targets alignment around seed

Target conservation 19 CLDN12 C A C A T T C C C (a)Targets alignment around seed (b)Extracting conservation values for each target (UCSC hg17)

CLDN12 C A C A T T C C C TAGLN2 G A C A T T C C C 0.5 ARCN1 C A C A T T C C T 0.5 TIP120A T A C A T T C C T 0.6 PGM2 T A C A T T C C T 0.7 ======================================================== Conservation: 0.5 Target conservation 20 (a)Targets alignment around seed (b)Extracting conservation values for each target (UCSC hg17) (c)Selecting the median value for each position

Target conservation 21 Corrected α =

Target conservation 22 Corrected α =

Target conservation 23 Corrected α =

Seed repeats in 3’ UTR 24 The positive group has statistically significantly more sequences with repeating seeds (p= )

Neighboring seeds density CLDN12 AAGAAAACTTCTTGTAGCCTCACATTCCCCTTGTGCAAAGAGCTC TAGLN2 TAGATATATATTTTAGCAGTGACATTCCCAGAGAGCCCCAGAGCT ARCN1 GCCACAGTCTGTAATCCATTCACATTCCTCAGTTTCACCACCTCC TIP120A TTTCATTCCGTTTGGATGTCTACATTCCTTATCAAAGGATATAAA PGM2 TTATGTGTTTTACAAAGACCTACATTCCTCATTGTTTCATGTTTG hsa-miR-302b hsa-miR Seed density of window Seeds density = mean of targets density 0.12

Neighboring seeds density 26 Corrected α=  Overlapping seeds (miR binding sites) may interfere with miR function  Neighboring seeds contribute to functionality  Possibly favored periodical distances between seeds ?

miR-mRNA binding pattern 27  We need a way to determine what is the preferable secondary structure of the miR-target complex  RNAcofold from the Vienna package target 5' UUGGCUCACUUGCCUUAGTCATCGCTA 3' miRNA 3' ACCGUAAGUGGCGCACGGAAUU 5' target 5' UUGGUCAUUUUUUUUAUAUUGCCUUAU 3' miRNA 3' ACCGUAAGUGGCGCACGGAAUU 5' target 5' AUCAUAUUAAUUGUGCCUUAUAGTAAT 3' miRNA 3' ACCGUAAGUGGCGCACGGAAUU 5'

(a) Using Vienna package RNAcofold we estimated the interaction between the miR and putative targets target 5' U G 3' UGGC UCACU UGCCUUA ACCG AGUGG ACGGAAU miRNA 3‘ UA CGC U 5‘ target 5' U U UUUUUAUAU U 3‘ UGG CAUUU UGCCUUA ACC GUAAG ACGGAAU miRNA 3‘ UGGCGC U 5‘ target 5' A AUAUUAAU U 3‘ UCA UGUGCCUUA AGU GCACGGAAU miRNA 3' ACCGUA GGC U 5' miR-mRNA binding pattern 28 Bound Not bound

target 5' U G 3' UGGC UCACU UGCCUUA ACCG AGUGG ACGGAAU miRNA 3‘ UA CGC U 5‘ target 5' U U UUUUUAUAU U 3‘ UGG CAUUU UGCCUUA ACC GUAAG ACGGAAU miRNA 3‘ UGGCGC U 5‘ target 5' A AUAUUAAU U 3‘ UCA UGUGCCUUA AGU GCACGGAAU miRNA 3' ACCGUA GGC U 5' (b) For each miR position we calculated the fraction of times the position was found bound in each set. miR-mRNA binding pattern 29 2/ 3 3/3

miR-mRNA binding pattern miR-1miR Corrected p-value α =

miR-mRNA binding pattern 31

miR-target bound positions 32

miR-target bound positions 33 Seed extension contributes to functionality miR another example of difference in its operation mechanism Seed extension contributes to functionality miR another example of difference in its operation mechanism

Target accessibility 34 Corrected α =

Target accessibility 35 Corrected α =

Conclusions 36 miR specific conclusions  Different binding patterns characterize each miR  miR-1 position 13  miR-373 low seed affinity  The function of certain miRs is perhaps more dependent on multiple binding sites (miR-373).

Conclusions 37 Generalizing properties  Positive group targets show evolutionary conservation  Positive group targets are more accessible  Positive group is enriched in repeating binding sites of the same miR  Indication for miR cooperation or interference (periodic binding sites?)

What’s next?  Apply all the tests on additional data sets  Redefine the data sets with imperfect seeds  Further search for structural motifs enriched in our positive set  Explore 5’ UTR of the sets we defined for sequence or structural motifs  Suggest a biological model to mechanism for miR-target specificity  Possibly try to generalize the features we’ve found in an SVM based classification program 38

Acknowledgements  Dr. Yael Altuvia  Prof. Hanah Margalit  All the wonderful people in Hanah’s lab. 39