Access to Sequence Data and Related Information

Slides:



Advertisements
Similar presentations
Online Counseling Resource YCMOU ELearning Drive… School of Architecture, Science and Technology Yashwantrao Chavan Maharashtra Open University, Nashik.
Advertisements

Molecular Markers.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
GENBANK, SWISSPROT AND OTHERS As Problem Sources for CSE 549 Andriy Tovkach Genetics.
Genome Browsers Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Alignment of mRNAs to genomic DNA Sequence Martin Berglund Khanh Huy Bui Md. Asaduzzaman Jean-Luc Leblond.
Genome Related Biological Databases. Content DNA Sequence databases Protein databases Gene prediction Accession numbers NCBI website Ensembl website.
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
Genomic Database - Ensembl Ka-Lok Ng Department of Bioinformatics Asia University.
How to access genomic information using Ensembl August 2005.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Goals of the Human Genome Project determine the entire sequence of human DNA identify all the genes in human DNA store this information in databases improve.
Molecular Genetics Introduction to The Structures of DNA and RNA
Last lecture summary. recombinant DNA technology DNA polymerase (copy DNA), restriction endonucleases (cut DNA), ligases (join DNA) DNA cloning – vector.
Doug Brutlag 2011 Genome Databases Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University School of Medicine Genomics, Bioinformatics.
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Genome Databases Computational Molecular Biology Biochem 218 – BioMedical Informatics.
Doug Brutlag 2011 Next Generation Sequencing and Human Genome Databases Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University.
Reading the Blueprint of Life
Chapter 20: Biotechnology. Essential Knowledge u 3.a.1 – DNA, and in some cases RNA, is the primary source of heritable information (20.1 & 20.2)
From Haystacks to Needles AP Biology Fall Isolating Genes  Gene library: a collection of bacteria that house different cloned DNA fragments, one.
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Sequence Databases What are they and why do we need them.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Bioinformatics Overview, NCBI & GenBank JanPlan 2012.
Manipulation of DNA. Restriction enzymes are used to cut DNA into smaller fragments. Different restriction enzymes recognize and cut different DNA sequences.
Copyright © 2010 Pearson Education Inc. Lecture 01 – Genetics & Genomics: An Introduction Based on Chapter 1 – Genetics: An introduction.
19.1 Techniques of Molecular Genetics Have Revolutionized Biology
Accessing information on molecular sequences Bio 224 Dr. Tom Peavy Sept 1, 2010.
Biological databases Exercises. Discovery of distinct sequence databases using ensembl.
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
Highlights of DNA Technology. Cloning technology has many applications: Many copies of the gene are made Protein products can be produced.
EB3233 Bioinformatics Introduction to Bioinformatics.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
ESTs Ian Keller Laboratory Techniques in Molecular Bio.
Simple-Sequence Length Polymorphisms SSLPs Short tandemly repeated DNA sequences that are present in variable copy numbers at a given locus. Scattered.
Chapter 20 DNA Technology and Genomics. Biotechnology is the manipulation of organisms or their components to make useful products. Recombinant DNA is.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Gene Technologies and Human ApplicationsSection 3 Section 3: Gene Technologies in Detail Preview Bellringer Key Ideas Basic Tools for Genetic Manipulation.
Genome Analysis. This involves finding out the: order of the bases in the DNA location of genes parts of the DNA that controls the activity of the genes.
Vectors Bacteria, viruses or liposomes into which DNA can be inserted. These can be used to grow genes, harvest the proteins they code for or deliver them.
Simple-Sequence Length Polymorphisms
Chapter 2: Access to Information Jonathan Pevsner, Ph.D.
Introduction to Genes and Genomes with Ensembl
Part 3 Gene Technology & Medicine
Introduction to Bioinformatics
Molecular Marker Characterization of plant genotypes
Introduction to Bioinformatics and Functional Genomics
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
One method of rapidly analyzing and comparing DNA is gel electrophoresis. Gel electrophoresis separates macromolecules - nucleic acids or proteins - on.
Introduction to bioinformatics
Section 3: Gene Technologies in Detail
생물정보학 Bioinformatics.
DNA Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Next Generation Sequencing and Human Genome Databases
Chapter 3. THE GENBANK SEQUENCE DATABASE
Gene Safari (Biological Databases)
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Restriction Fragment Length Polymorphism (RFLP)
Problems from last section
9-2 Replication of DNA.
Practical Contents DNA Extraction Gel Electrophoresis
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

Access to Sequence Data and Related Information From Bioinformatics and Functional Genomics, Third Edition, Jonathan Pevsner.2015

Learning objectives define the types of molecular databases; define accession numbers and the significance of RefSeq identifiers; describe the main genome browsers and use them to study features of a genomic region; and use resources to study information about both individual genes (or proteins) and large sets of genes/proteins.

Introduction to Biological Databases In 1995 the complete genome of a free-living organism was sequenced for the first time, the bacterium Haemophilus influenzae DNA sequence data collected from over 300,000 different species of organisms 1970s dideoxynucleotide sequencing (“Sanger sequencing”) Since 2005 next-generation sequencing (NGS) technology

Centralized Databases Store DNA Sequences

NCBI

EBI

DDBJ

Growth of DNA sequence in repositories

Scales of DNA base pairs

Contents of DNA, RNA, and Protein Databases

Genbank data file division

Types of Data in GenBank/EMBL-Bank/DDBJ

Genomic DNA Databases DNA-Level Data: Sequence-Tagged Sites (STSs) DNA-Level Data: Genome Survey Sequences (GSSs) DNA-Level Data: High-Throughput Genomic Sequence (HTGS)

Sequence-tagged site A sequence-tagged site (or STS) is a short (200 to 500 base pair) DNA sequence that has a single occurrence in the genome and whose location and base sequence are known. STSs can be easily detected by the polymerase chain reaction (PCR) using specific primers. For this reason they are useful for constructing genetic and physical maps from sequence data reported from many different laboratories. They serve as landmarks on the developing physical map of a genome. When STS loci contain genetic polymorphisms (e.g. simple sequence length polymorphisms, SSLPs, single nucleotide polymorphisms), they become valuable genetic markers, i.e. loci which can be used to distinguish individuals. They are used in shotgun sequencing, specifically to aid sequence assembly. STSs are very helpful for detecting microdeletions in some genes. For example, some STSs can be used in screening by PCR to detect microdeletions in Azoospermia (AZF) genes in infertile men. fromf

RNA data RNA-Level Data: cDNA Databases Corresponding to Expressed Genes RNA-Level Data: Expressed Sequence Tags (ESTs) RNA-Level Data: UniGene

Protein Databases UniProt

Central Bioinformatics Resources: NCBI and EBI

Accession Numbers to Label and Identify Sequences

The Reference Sequence (RefSeq) Project

Access to Information via Gene Resource at NCBI

Flatfile type&Fasta

Command-Line Access to Data at NCBI

Access to Information: Genome Browsers The University of California, Santa Cruz (UCSC) Genome Browser The Ensembl Genome Browser The Map Viewer at NCBI