Finding the needle in your DNAstack Ana Teresa Freitas Ciência 2010 – Encontro com a Ciência e Tecnologia em Portugal FIL, July 7, 2010 http://kdbio.inesc-id.pt.

Slides:



Advertisements
Similar presentations
Created as a part of NLM in 1988 Establish public databases Research in computational biology Develop software tools for sequence analysis Disseminate.
Advertisements

Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
CS 177 Hands-on lab with databases Quiz #1 Summary: Nucleotide and protein databases Sequence formats Lab exercises Quiz #1 Summary: Nucleotide and protein.
The National Center for Biotechnology Information (NCBI) a primary resource for molecular biology information Database Resources.
Bioinformatics Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 1 Introduction Aleppo University Faculty of technical engineering.
Sequence Analysis MUPGRET June workshops. Today What can you do with the sequence? What can you do with the ESTs? The case of SNP and Indel.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Data-intensive Computing: Case Study Area 1: Bioinformatics B. Ramamurthy 6/17/20151.
Bioinformatics Primer HC Lee 2000 July. What is Bioinformatics? Biomedical/biotechnical information Reproduction and annotation of biosequences – DNA.
Genome Related Biological Databases. Content DNA Sequence databases Protein databases Gene prediction Accession numbers NCBI website Ensembl website.
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
prepared with some help from friends...
Scientific Data Mining: Emerging Developments and Challenges F. Seillier-Moiseiwitsch Bioinformatics Research Center Department of Mathematics and Statistics.
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
Lawrence Hunter, Ph.D., Director Computational Bioscience Program UCHSC School of Medicine Mathematics.
19 January, 2007 Spring 2007, CS584: Computational and Life Science Kim Gernert BimCore, School of Medicine James Lu Mathematics and Computer Science.
Many genes have unknown function 30% have unknown function only 9% are experimentally verified The Arabidopsis Genome Initiative, Nature 2000 of the 25,498.
Bioinformatics for your classroom Seth Bordenstein Discover the Microbes Within! March 12, 2006 NCBI BLAST 1. No programming skills needed 2.Familiarity.
An Introduction to Bioinformatics Molecular Biology Databases.
Login: BITseminar Pass: BITseminar2011 Login: BITseminar Pass: BITseminar2011.
Course Module: Introduction to Bioinformatics – CS 2001 July CS Databases.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Bioinformatics.
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Introduction to databases Tuomas Hätinen. Topics File Formats Databases -Primary structure: UniProt -Tertiary structure: PDB Database integration system.
Bioinformatics and medicine: Are we meeting the challenge?
2 February, 2007 Life Science: Organisms. 2 February, 2007 Genomics “The genetic blueprints of all people generally have the same information, with approximately.
Doug Raiford Lesson 3.  More and more sequence data is being generated every day  Useless if not made available to other researchers.
Bioinformatics Overview, NCBI & GenBank JanPlan 2012.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
جلسه اول بیو انفورماتیک گردآوری:مسعود رسول آبادی
Organizing information in the post-genomic era The rise of bioinformatics.
Harbin Institute of Technology Computer Science and Bioinformatics Wang Yadong Second US-China Computer Science Leadership Summit.
Biological databases Exercises. Discovery of distinct sequence databases using ensembl.
Sequencing the World of Possibilities for Energy & Environment MGM workshop. 19 Oct 2010 Information Sources for Genomics Konstantinos Mavrommatis Genome.
Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
MGM workshop. 19 Oct 2010 Functional annotation Datasources Konstantinos Mavrommatis
An approach to carry out research and teaching in Bioinformatics in remote areas Alok Bhattacharya Centre for Computational Biology & Bioinformatics JAWAHARLAL.
Databases, Ontologies and Text mining Session Introduction Part 2 Carole Goble, University of Manchester, UK Dietrich Rebholz-Schuhmann, EBI, UK Philip.
A Field Guide to GenBank and NCBI Molecular Biology Resources
Modelling from Sequence to Gene Regulatory Network to Phenotype.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
NCBI: something old, something new. What is NCBI? Create automated systems for knowledge about molecular biology, biochemistry, and genetics. Perform.
High throughput biology data management and data intensive computing drivers George Michaels.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Entrez, dbSNP, GEO, OMIM & LinkOut JanPlan Entrez Distributed by NCBI in 1991 on CD-ROM Included linked nodes: GenBank & PDB Translated GenBank,
Introduction to Bioinformatics
Applications of the Interspace Analysis for Community Repositories
Databases, Ontologies and Text mining Session Introduction Part 2
EMBL’s European Bioinformatics Institute
Data-intensive Computing: Case Study Area 1: Bioinformatics
Retrieving Information: Using Entrez
Bioinformatics for your classroom
Statistical Applications in Biology and Genetics
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
Lecture 2.1.
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
생물정보학 Bioinformatics.
electronic PharmacoGenomics Assistant (ePGA)
High-throughput Biological Data The data deluge
Functional Annotation of the Horse Genome
Access to Sequence Data and Related Information
Genomes and Their Evolution
The Future of Genetic Research
Dr.s Khem Ghusinga and Alan Jones
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Next Generation Sequencing Market. Report Description and Highlights According to Renub Research market research report “Next Generation Sequencing (NGS)
Presentation transcript:

Finding the needle in your DNAstack Ana Teresa Freitas Ciência 2010 – Encontro com a Ciência e Tecnologia em Portugal FIL, July 7, 2010 http://kdbio.inesc-id.pt KDBIO Group 26-12-2018

Finding the Needle 26-12-2018

Biology 2.0 Economist, June 2010 26-12-2018

The new law Computing has increased in potency according to Moore’s law It double in power roughly every two years Sequencing the Human genome took 13 years and $3 billion Now, Illumina can read the genome in 8 days and $10.000 Pacific Biosciences has a technology that in 3 years’ time will be able to map a human genome in 15 minutes for less than $1,000 Economist, June 2010 26-12-2018

RefSeq GenBank UniGene Biological databases C GA ATT GA C GA C ATT GA Curators RefSeq TATAGCCG ACGTGC TATAGCCG AGCTCCGATA CCGATGACAA ATTGACTA CGTGA TTGACA Labs TTGACA ACGTGC TTGACA Genome Assembly TATAGCCG CGTGA ATTGACTA ACGTGC TATAGCCG CGTGA CGTGA TATAGCCG ATTGACTA ATTGACTA ATTGACTA ATTGACTA TATAGCCG TTGACA TATAGCCG TATAGCCG TATAGCCG TATAGCCG ATT C GenBank GA UniGene AT C C ATT C Algorithms GA ATT GA GA ATT GA C GA ATT GA C GA C ATT GA

NIH NIG EMBL GenBank EMBL DDBJ Entrez NCBI Submissions Updates Francis Ouellette August 3rd, 1999 NIH Entrez NCBI GenBank Submissions Updates Submissions Updates EMBL DDBJ EBI CIB NIG Submissions Updates SRS EMBL getentry Lecture 2.0

So why do biologists care?

Database proliferation Three main reasons Database proliferation Hundreds at the moment More and more scientific discoveries result from inter-database analysis and mining Rising complexity of required data-combinations E.g. translational medicine: “from bench to bedside” (genomic data vs. clinical data) Proliferation = great and rapid increase in numbers; Grid = a network of evenly space horizontal and vertical lines (rooster); Semantic = related to the meaning;

26-12-2018

Research at the KDBIO group Algorithms on Strings, Trees and Graphs Programming and Database Systems Machine Learning Understanding genetic regulatory networks Sequence analysis Genome analysis Whole genome sequencing and re-sequencing Gene expression analysis Haplotype inference Genotype-phenotype linkage Discovery of motifs in DNA and RNA Improving clinical diagnosis Genotyping methods Modeling of metabolic networks Inference and modeling of regulation networks Information systems KDBIO Group 26-12-2018

YEASTRACT www.yeastract.com 26-12-2018

YEASTRACT USERS YEASTRACT KNOWLEDGE not DATA 26-12-2018

http://geneglob.inesc-id.pt/public/home.jsf PTDC/AGR-GPL/66564/2006 (Jorge Paiva PI, IICT) 26-12-2018

SDLink Web-based data management system Management and analysis of heterogeneous clinical and biological data Linked Data “… a recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web using URIs and RDF." KDBIO Group 26-12-2018

Semantic model prototype Harnessing Genetics and Imaging to Improve Diagnosis and Management of Hypertrophic Cardiomyopathy in Portugal PTDC/SAU-GMG/112538/2009 (submitted, Alexandra Fernandes PI, CQE IST) 26-12-2018

Biology 2.0 Computer Sc 2.0 Web 2.0 Medicine 2.0 DNA, 26-12-2018

KDBIO Group Members 8 PhDs Ana Teresa Freitas Arlindo Oliveira Susana Vinga Sara Madeira Paulo Fonseca Sara Silva Alexandre Francisco Luís Russo 3 Invited researchers João Carriço Jonas Almeida Marie-France Sagot 12 PhD Students 11 Graduate fellowships 26-12-2018