Work presentation Gaurav Moghe Feb 4 th, 2008 – March 17 th, 2008.

Slides:



Advertisements
Similar presentations
COMPARATIVE FUNGAL GENOMICS : KNOWN KNOWNS & KNOWN UNKNOWNS. Josh Herr, Plant Biology Interdepartmental Program.
Advertisements

Homology Based Analysis of the Human/Mouse lncRNome
What are the Methods and Approaches Used to Identify and Study Arabidopsis Seed Knock- Out Mutations? Eric Newton Garen Polatoglu Rena Schweizer.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Gene Prediction Methods G P S Raghava. Prokaryotic gene structure ORF (open reading frame) Start codon Stop codon TATA box ATGACAGATTACAGATTACAGATTACAGGATAG.
Hidden Markov Models Sasha Tkachev and Ed Anderson Presenter: Sasha Tkachev.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Project Proposals Due Monday Feb. 12 Two Parts: Background—describe the question Why is it important and interesting? What is already known about it? Proposed.
Comparative ab initio prediction of gene structures using pair HMMs
Whole Genome Polymorphism Analysis of Regulatory Elements in Breast Cancer AAGTCGGTGATGATTGGGACTGCTCT[C/T]AACACAAGCGAGATGAAGAAACTGA Jacob Biesinger Dr.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
EVOLUTIONARY AND COMPUTATIONAL GENOMICS Shin-Han Shiu Plant Biology / CMB / EEBB / Genetics / QBMI.
Gene Regulation: What it is, and how to detect it By Jordan, Jennifer, and Brian.
High Throughput Sequencing
Gene Finding Genome Annotation. Gene finding is a cornerstone of genomic analysis Genome content and organization Differential expression analysis Epigenomics.
Lecture 19, Chapter 11 Analysis of transgenic plants part II Neal Stewart.
Analysis of Transgenic Plants. 1.Regeneration on Selective Medium Selectable Marker Gene.
Fine Structure and Analysis of Eukaryotic Genes
Wellcome Trust Workshop Working with Pathogen Genomes Module 3 Sequence and Protein Analysis (Using web-based tools)
Tomato genome annotation pipeline in Cyrille2
Genome Annotation using MAKER-P at iPlant Collaboration with Mark Yandell Lab (University of Utah) iPlant: Josh Stein (CSHL) Matt Vaughn.
First Rotation Experiences in Steve Whitham’s Lab By Jesse Walsh.
NCBI Review Concepts Chuong Huynh. NCBI Pairwise Sequence Alignments Purpose: identification of sequences with significant similarity to (a)
Figure S1. Genomic PCR of in vitro potato plants transformed with StPTB1 prom (top) and StPTB6 prom (bottom) constructs using nptII-specific primers. Thirty.
BME 110L / BIOL 181L Computational Biology Tools October 29: Quickly that demo: how to align a protein family (10/27)
BME 110L / BIOL 181L Computational Biology Tools February 19: In-class exercise: a phylogenetic tree for that.
Expression of the Genome The transcriptome. Decoding the Genetic Information  Information encoded in nucleotide sequences contained in discrete units.
Module 3 Sequence and Protein Analysis (Using web-based tools) Working with Pathogen Genomes - Uruguay 2008.
Welcome to DNA Subway Classroom-friendly Bioinformatics.
CISC667, F05, Lec9, Liao CISC 667 Intro to Bioinformatics (Fall 2005) Sequence Database search Heuristic algorithms –FASTA –BLAST –PSI-BLAST.
BLAST Basic Local Alignment Search Tool (Altschul et al. 1990)
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
Construction of Substitution Matrices
DDF1 and DDF2: 2 transcription factor genes involved in salt stress responses Juyeon Park PI: Dr. Shinhan Shiu Mentor: Dr. Cheng Zou Shiu Lab at Plant.
Mark D. Adams Dept. of Genetics 9/10/04
Comp. Genomics Recitation 9 11/3/06 Gene finding using HMMs & Conservation.
Chapter 11: Functional genomics
Plant Biology Division Post-process of IMGAG M.t. 2.0 Release Affymetrix Medicago Probe set – IMGAG 2.0 / MTGI 8.0 Mapping Zhao Bioinformatics Lab.
Figure S1 Enod11 Mtc27 MtGshs cDNA gDNA S. meliloti DPI Supporting Information Fig. S1. Validation of the selected biological conditions for.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Fig. S1. Amino acid sequence alignment of MYBS3 proteins. MYBS3 protein sequences of Arabidopsis thaliana (MYBH; NP_199550); (At3g16350; NP_188256), Glycine.
AtPAT10 TIP1 Akr1p1 Akr2p Erf2p Swf1p Pfa5 Pfa3 GODZ HIP14 Pfa4 AtPAT10 TIP1 Akr1p1 Akr2p Erf2p Swf1p Pfa5 Pfa3 GODZ HIP14 Pfa4 AtPAT10 TIP1 Akr1p1 Akr2p.
Gene discovery using combined signals from genome sequence and natural selection Michael Brent Washington University The mouse genome analysis group.
Construction of Substitution matrices
SRB Genome Assembly and Analysis From 454 Sequences HC70AL S Brandon Le & Min Chen.
Annotation of eukaryotic genomes
What is BLAST? Basic BLAST search What is BLAST?
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
Work Presentation Novel RNA genes in A. thaliana Gaurav Moghe Oct, 2008-Nov, 2008.
Reliable Identification of Genomic Variants from RNA-seq Data Robert Piskol, Gokul Ramaswami, Jin Billy Li PRESENTED BY GAYATHRI RAJAN VINEELA GANGALAPUDI.
Figure S1. Co-expression network for MNM1: Shown is the co-expression network obtained from the ATTED-II database using MNM1 as a bait gene. The red dots.
What is BLAST? Basic BLAST search What is BLAST?
bacteria and eukaryotes
2470 bp 1891 bp WT bp 2314 bp A B Fig. S1. Verification with PCR amplification of the.
Basics of BLAST Basic BLAST Search - What is BLAST?
Sequence based searches:
Experimental Verification Department of Genetic Medicine
Sequencing Data Analysis
Eukaryotic Gene Finding
Bioinformatics and BLAST
Nucleic Acid Interactions Practicalities
Gene Annotation with DNA Subway
Cis-regulatory evolution of duplicate genes in yeasts
Identify D. melanogaster ortholog
Rotation review Gaurav Moghe Genetics Program
Volume 3, Issue 2, Pages (March 2010)
Volume 13, Issue 16, Pages (August 2003)
Basic Local Alignment Search Tool
Sequencing Data Analysis
Polymerase Chain Reaction (PCR)
Presentation transcript:

Work presentation Gaurav Moghe Feb 4 th, 2008 – March 17 th, 2008

Projects Divergence between DDF1 and DDF2 Expression differences between A.thaliana and A.lyrata under JA-induced stress Novel coding regions in A. thaliana Amanitus-related toxins in other fungal genomes

1. Divergence between DDF1 and DDF2 Objective: To find protein-level differences between DDF1 and DDF2 using translational fusions Subobjective: Verify transformants for GUS marker -- GUS Staining -- Homo/Heterozygosity using GUS-specific primers

Strategy: Translational fusions DDF1 DDF2 GUS Endogenous promoter Spatial and temporal differentiation of expression in Arabidopsis But, Existing lines from older A. thaliana strains. Moving everything to the newer Col0 strains. 10 lines 1 line

GUS Staining results Tried GUS staining from leaves, flowers, roots and first inter-node No GUS activity detected Progeny of transformants not differentiable by Hyg resistance phenotype

Future plans 1) New Col0 strains (DDF1-GUS) put on soil 2) Perform GUS staining with new Col0 strains under salt-stress from roots and inter-node 3) Verify homo / heterozygosity using GUS- specific primers

ChIP-Sequencing part 35S-cMyc-DDF1/2 construct (1 probable plant) 35S-DDF1/2-GFP construct Transformant should show over-expression phenotype, but that was not observed for GFP construct. So, will have to be re-done.

2. JA-induced stress expression A. thalianaA. lyrata JA treated22 Control22 Extract RNA using RNAeasy Solexa sequencingqRT-PCR Analysis

RNAEasy (Qiagen) SampleYield260/280260/230 AT JAFrom ng/micL Above 2.0 (low/none protein contamination) AT JA AT Control AL JA AL Control

DNA contamination The extracted RNA samples (200ng) were PCR-amplified using GoTaq with: 1) actin_6 (flanks an intron) 2) DDF2 primers for 40X and 32X cycles. Both the reactions showed significant presence of DNA (~20ng after amplification)

DNA contamination: options The SuperArray RT2 First strand kit (Problem: Two-step RTPCR. No kit mentioned for Eppendorf Realplex machine) Design primers flanking the longest introns If possible, keep a smaller extension time(?) (Problem: Can affect RNA amplification) Keep RT(-) control during qRT-PCR (Problem: What is the best way to subtract effect of DNA?) Re-do the extraction

3. Novel coding regions in A.thaliana Two directions of study Gene predictorsRNA genes 1.Augustus 2.GlimmerM 3.GeneID 4.TwinScan 5.EuGene 6.CONTRAST 7.SNAP Whole genome alignments BioPerl installation issue For some reason, not accepting Chr4 and Chr 5

Predicting existence of yet un-annotated RNA genes Question: Are there any regions in the genome that: 1) Are not known proteins/RNA? 2) Conserved across phylogeny? Are such regions: 1) RNA? 2) Protein?

RNA genes: Strategy Download A.thaliana PUTs from PlantGDB (best alternative to FLcDNA) BLASTx against UniProt (UniRef90) (1) RFAM database of known/predicted RNA molecules BLASTx against UniProt (UniRef90) (2) PUTs not known to be proteins Filter (1) using (2) distribution Discard significant matches Retain non-matching and not significantly matching PUTs

RNA genes: Strategy PUTs not known to be proteins BLASTn against RFAM (1) randomly generated PUTs for pre-selected size classes of the original PUTs BLASTn against RFAM (2) PUTs not known to be proteins or RNA Filter (1) using (2) distribution Discard significant matches Retain non-matching and not significantly matching PUTs

RNA genes: Strategy PUTs not known to be proteins or RNA BLAST against PUTs from different plants strategically placed in phylogeny Find conserved PUTs RNA prediction softwares 1)RNAZ 2)QRNA 3)ddbRNA 4)EvoFold 5)MSARI Conserved regions not yet annotated Are these PUTs mapped to A. thaliana genome? Yes

4. Amanita toxin project (Walton lab) Multiple sequence alignment of several toxin genes from two fungal genera – Amanita and Galerina Sequenced fungal genomes tBLASTn with relaxed parameters does not give any significant hits

Use of Hidden Markov Model (HMM) to generate a PWM for MSA of toxins MARLP PCE HMM Scan 6-frame translated fungal genomes Hits of very low e-value ( ), but some degree of relevant sequence similarity Ama-like toxins MSA

Thank you