The iPlant Collaborative Vision www.iPlantCollaborative.org Enable life science researchers and educators to use and extend cyberinfrastructure.

Slides:



Advertisements
Similar presentations
Welcome Each of You to My Molecular Biology Class.
Advertisements

The DNA Story Germs, Genes, and Genomics 4. Heredity Genes DNA Manipulating DNA.
Human Genome Project What did they do? Why did they do it? What will it mean for humankind? Animation OverviewAnimation Overview - Click.
Genome organization Lesk, Ch 2 (Lesk, 2008). Genomes and proteomes Genome of a typical bacterium comes as a single DNA molecule of about 5 million characters.
BIOINFORMATICS Ency Lee.
Lecture 2 Molecular Biology Primer Saurabh Sinha.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Bio 344 Molecular Biology Old web site: bio344/
Data-intensive Computing: Case Study Area 1: Bioinformatics B. Ramamurthy 6/17/20151.
ECE 501 Introduction to BME
Genes. Outline  Genes: definitions  Molecular genetics - methodology  Genome Content  Molecular structure of mRNA-coding genes  Genetics  Gene regulation.
Bioinformatics Lecture 2. Bioinformatics: is the computational branch of molecular biology Using the computer software to analyze biological data The.
2.7 DNA Replication, transcription and translation
--- History of Molecular Biology
Introduction to Biological Sequences. Background: What is DNA? Deoxyribonucleic acid Blueprint that carries genetic information from one generation to.
Using DNA Subway in the Classroom Red Line Lesson Sketch.
Elements of Molecular Biology All living things are made of cells All living things are made of cells Prokaryote, Eukaryote Prokaryote, Eukaryote.
Alternative Splicing. mRNA Splicing During RNA processing internal segments are removed from the transcript and the remaining segments spliced together.
Using DNA Subway in the Classroom Red Line Lesson Sketch.
1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview پرتال پرتال بيوانفورماتيك ايرانيان.
CSE 6406: Bioinformatics Algorithms. Course Outline
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
Genome Annotation using MAKER-P at iPlant Collaboration with Mark Yandell Lab (University of Utah) iPlant: Josh Stein (CSHL) Matt Vaughn.
Eukaryotic Gene Expression The “More Complex” Genome.
Manifestations of a Code Genes, genomes, bioinformatics and cyberspace – and the promise they hold for biology education.
Manifestations of a Code Genes, genomes, bioinformatics and cyberspace – and the promise they hold for biology education.
Introduction to Bioinformatics CPSC 265. Interface of biology and computer science Analysis of proteins, genes and genomes using computer algorithms and.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
Sevas Educational Society All Rights Reserved, 2008 Module 1 Introduction to Bioinformatics.
Sequence & course material repository Annotation (sequences & evidence) Manuals (DNA, Subway, Apollo, JalView) Presentations.
What is Genetic Research?. Genetic Research Deals with Inherited Traits DNA Isolation Use bioinformatics to Research differences in DNA Genetic researchers.
Welcome to DNA Subway Classroom-friendly Bioinformatics.
I. Introduction and Red Line Education for Data-unlimited Science.
Chapter 21 Eukaryotic Genome Sequences
Molecular Biology in a Nutshell (via UCSC Genome Browser) Personalized Medicine: Understanding Your Own Genome Fall 2014.
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
IPlant Genomics in Education
David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione The Eukaryotic.
Bailee Ludwig Quality Management. Before we get started…. ….Let’s see what you know about Genomics.
How can we find genes? Search for them Look them up.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
Introduction to molecular biology Data Mining Techniques.
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
Transforming Science Through Data-driven Discovery Genomics in Education University of Delaware – February 2016 Jason Williams, Education, Outreach, Training.
Using DNA Subway in the Classroom Genome Annotation: Red Line.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
Alternative Splicing. mRNA Splicing During RNA processing internal segments are removed from the transcript and the remaining segments spliced together.
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
CS515: Bioinformatic Algorithms
bacteria and eukaryotes
Ch 12: Genomes.
Data-intensive Computing: Case Study Area 1: Bioinformatics
JMC CGEMS SUMMER GENOMICS TRAINING WORKSHOPS
DNA Structure RNA Protein Synthesis History Of DNA
Genomes and Their Evolution
Genome organization and Bioinformatics
A Brief History What is molecular biology?
Bioinformatics Vicki & Joe.
DNA Basics What do you know about DNA?
Biology JEOPARDY click here to PLAY.
From Mendel to Genomics
Evolution of Genomes Chapter 21.
Applying principles of computer science in a biological context
Presentation transcript:

The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure

1986 DOE announces Human Genome Initiative-- $5.3 million to develop technology DOE & NIH present their HGP plan to Congress Escherichia coli genome published 1997 Yeast genome published 2000 Fruit fly (Drosophila) genome published Working draft of the human genome announced Thale cress (Arabidopsis) genome published (2x) Rice genome published (2x) Human genome published First tree genome published in Science First metagenomics study published Important Dates in Genomics

¢0.57 ¢0.19 ¢0.35 Sequence production (Billions of bases/month) ¢0.50 ¢1 0 0 Cost: Cents per base ¢0.46 ¢ Human Genome completed Economics of Scale Human Genome launched > ¢0.05 Slide: JGI, 2009

Another angle Slide: Stein, 2010

Just as computer software is rendered in long strings of 0s and 1s, the GENOME or “software” of life is represented by a string of the four nucleotides, A, G, C, and T. To understand the software of either - a computer or a living organism - we must know the order, or sequence, of these informative bits. What is sequencing? Slide: JGI, 2009

A GENOME is all of a living thing’s genetic material. The genetic material is DNA (DeoxyriboNucleic Acid) DNA, a double helical molecule, is made up of four nucleotide “letters”: A-- G-- T-- C-- What is a genome? Slide: JGI, 2009

Exciting? >mouse_ear_cress_1080 GAAATAATCAATGGAATATGTAGAGGTCTCCTGTACCTTCACAGAGATTCTAGGCTGAGAGCAGTGCATATAGATATCTTT CGTACTCATCTGCTTTTTCTGGTCTCCATCACAAAAGCCAACTAGGTAATCATATCAATCTCTCTTTACCGTTTACTCGAC CTTTTCCAATCAGGTGCT TCTGGTGTGTCTACTACTATCAGTTTTAGGTCTTTGTATACCTGATCTTATCTGCTACTG AGGCTTGTAAAAGTGATTAAAACTGTGACATTTACTCTAAGAGAAGTAACCTGTTTGATGCATTTCCCTAATATACCGGTG TGGAAAAGTGTAGGTATCTGTACTCAGCTGAAATGGTGGACGATTTTGAAGAAGATGAACTCTCATTGACTGAAAGCGGGT TGAAGAGTGAAGATGGCGTTATTATCGAGATGAATGTCTCCTGGATGCTTTTATTATCATGTTTGGGAATTTACCAAGGGA GAGGTATCAGAATCTATCTTAGAAGGTTACATTTAGCTCAAGCTTGCATCAACATCTTTACTTAGAGCTCTACGGGTTTTA GTGTGTTTGAAGTTTCTTAACTCCTAGTATAATTAGAATCTTCTGCAGCAGACTTTAGAGTTTTGGGATGTAGAGCTAACC AGAGTCGGTTTGTTTAAACTAGAATCTTTTTATGTAGCAGACTTGTTCAGTACCTGAATACCAGTTTTAAATTACCGTCAG ATGTTGATCTTGTTGGTAATAATGGAGAAACGGAAGAATAATTAGACGAAACAAACTCTTTAAGAACGTATCTTTCAGTTT TCCATCACAAATTTTCTTACAAGCTACAAAAATCGAACTATATATAACTGAACCGAATTTAAACCGGAGGGAGGGTTTGAC TTTGGTCAATCACATTTCCAATGATACCGTCGTTTGGTTTGGGGAAGCCTCGTCGTACAAATACGACGTCGTTTAAGGAAA GCCCTCCTTAACCCCAGTTATAAGCTCAAAGTTGTACTTGACCTTTTTAAAGAAGCACGAAACGAAAAACCCTAAAATTCC CAAGCAGAGAAAGAGAGACAGAGCAAGTACAGATTTCAACTAGCTCAAGATGATCATCCCTGTTCGTTGCTTTACTTGTGG AAAGGTTGATATTTTCCCCTTCGCTTTGGTCTTATTTAGGGTTTTACTCCGTCTTTATAGGGTTTTAGTTACTCCAAATTT GGCTAAGAAGAGATCTTTACTCTCTGTATTTGACACGAATGTTTTTAATCGGTTGGATACATGTTGGGTCGATTAGAGAAA TAAAGTATTGAGCTTTACTAAGCTTTCACCTTGTGATTGGTTTAGGTGATTGGAAACAAATGGGATCAGTATCTTGATCTT CTCCAGCTCGACTACACTGAAGGGTAAGCTTACAATGATTCTCACTTCTTGCTGCTCTAATCATCATACTTTGTGTCAAAA AGAGAGTAATTGCTTTGCGTTTTAGAGAAATTAGCCCAGATTTCGTATTGGGTCTGTGAAGTTTCATATTAGCTAACACAC TTCTCTAATTGATAACAGAAGCTATAAAATAGATTTGCTGATGAAGGAGTTAGCTTTTTATAATCTTCTGTGTTTGTGTTT TACTGTCTGTGTCATTGGAAGAGACTATGTCCTGCCTATATAATCTCTATGTGCCTATCTAGATTTTCTATACAATTGATA TTTGATAGAAGTAGAAAGTAAGACTTAAGGTCTTTTGATTAGACTTGTGCCCATCTACATGATTCTTATTGGACTAATCAT TCTTTGTGTGAAAATAGAATACTTTGTCTGAACATGAGAGAATGGTTCATAATACGTGTGAAGTATGGGATTAGTTCAACA ATTTCGCTATTGGAGAAGCAAACCAAGGGTTAATCGTTTATAGGGTTAAGCTAATGCTCTGCTCTTTATATGTTATTGGAA CAGACTATTGTTGTGCCTATCTTGTTTAGTTGTAGATTCTATCTCGACTGTTATAAGTATGACTGAAGGCTTGATGACTTA TGATTCTCTTTACACCTGTAGAAGGATTTAAGCTTGGTGTCTAGATATTCAATCTGTGTTGGTTTTGTCTTTCTTTTGGCT CTTAGTGTTGTTCAATCTCCTCAATAGGTATGAAGTTACAATATCCTTATTATTTTGCAGGGACGCACTTGATGCACTCCA GCTAGTCAGATACTGCTGCAGGCGTATGCTAATGACCTTGCATCAACATCTTTACTTAGAGCTCTACGGGTTTTAGTGTGT

This better?

Using Plants to Explore Genomics A large number of genomes is publicly & freely available for analysis.

Find Gene Families Generate mathematical evidence Analyze large data amounts Browse in context Build gene models Gather biological evidence Annotation workflow Get DNA sequence

Walk or…

Early concept (2009)

DNA Subway 2014

Coming into the Genome Age For the first time in the history of science students can work with the same data and tools that are used by researchers. Learning by asking and answering question. Students generate new knowledge.

Workshop Objectives Illustrate the evolving concept of “gene.” Conceptualize a “big picture” of complex, dynamic genomes. Guide students to address real problems through modern genome science. Use educational and research interfaces for bioinformatics. Work with “real” genome sequences gathered by students – in the lab or online.

Molecular biology and bioinformatics concepts RepeatMasker Eukaryotic genomes contain large amounts of repetitive DNA. Transposons can be located anywhere; they can mutate like any other DNA. FGenesH Gene Predictor Protein-coding information begins with start, followed by codons, ends in stop. Codons in mRNA (AUG, UAA,…) have sequence equivalents in DNA (ATG, TAA,…). Most eukaryotic introns have “canonical splice sites,” GT---AG (mRNA: GU---AG). Gene prediction programs search for patterns to predict genes and their structure. Different gene prediction programs may predict different genes and/or structures. Multiple Gene Predictors The protein coding sequence of a mRNA is flanked by untranslated regions (UTRs). UTRs hold regulatory information. BLAST Searches Gene or protein homologs share similarities due to common ancestry. Biological evidence is needed to curate gene models predicted by computers. mRNA transcripts and protein sequence data provide “hard” evidence for genes.

What is a gene? Can we define a gene? Has the definition of a gene changed? How can we find genes?

An Evolution of Sorts… Genes as “independent hereditary units (1866), Mendel Genes as “beads on strings” (1926), Morgan One gene, one enzyme (1941), Beadle & Tatum DNA is molecule of heredity (), Avery DNA > RNA > Protein (1953), Crick, Watson, Wilkins Transposons (1940s-50s), McClintock Reverse transcription (1970), Temin & Baltimore Split genes (1977), Roberts & Sharp RNA interference (1998), Fire and Mello

Sequence & course material repository Don’t open items, save them to your computer!! Annotation (sequences & evidence) Manuals (DNA, Subway, Apollo, JalView) Presentations (.ppt files) Prospecting (sequences) Readings (Bioinformatics tools, splicing, etc.) Worksheets (Word docs, handouts, etc.) BCR-ABL (temporary; not course-related)