Bioinformatics and Biostatistics in Limagrain / Biogemma

Slides:



Advertisements
Similar presentations
Martin John Bishop UK HGMP Resource Centre Hinxton Cambridge CB10 1 SB
Advertisements

Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
System Biology October 2013 Gustavo de Souza IMM, OUS.
LESSON 1: What is Genetic Research? PowerPoint slides to accompany Using Bioinformatics : Genetic Research.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
6 Mark Tester Australian Centre for Plant Functional Genomics University of Adelaide Research developments in genetically modified grains.
Bioinformatics What is bioinformatics? Why bioinformatics? The major molecular biology facts Brief history of bioinformatics Typical problems of bioinformatics:
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Next-generation sequencing and PBRC. Next Generation Sequencer Applications DeNovo Sequencing Resequencing, Comparative Genomics Global SNP Analysis Gene.
Gene expression analysis summary Where are we now?
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics.
STAT115 STAT215 BIO512 BIST298 Introduction to Computational Biology and Bioinformatics Spring 2015 Xiaole Shirley Liu Please Fill Out Student Sign In.
ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.
Pharmacogenomics and personalized medicines Jean-Marie Boeynaems
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
 The institute started in 1989 as a UNDP funded project called the National Agricultural Genetic Engineering Laboratory (NAGEL).  The Agricultural.
Bioinformatics.
ARC Biotechnology Platform: Sequencing for Game Genomics Dr Jasper Rees
CEITEC BRNO | CZECH REPUBLIC central european institute of technology CEITEC Genomics and proteomics at MU Jiří Fajkus.
Syngenta Biotechnology
Detecting enriched regions (Chip- seq, RIP-seq) Statistical evaluation of enriched regions Data displayed in Genome Browser Detection of enriched motifs.
Center for Human Health and the Environment
Network requirements from Ukrainian Biotechnology communities Lubov N. Shynkarenko FBB.
讲 座 提 纲讲 座 提 纲 1 什么是分子育种 2 历史回顾 3 全基因组策略 4 基因型鉴定 5 表现型鉴定 6 环境型鉴定 (etyping) 7 标记 - 性状关联分析 8 标记辅助选择 9 决策支撑系统 10 展望.
Introduction of Plant Biotechnology
Integrating the Bioinformatic Technology Group into your research programme Introduction People and Skills Examples Integrating the BTG Contacts BHRC Away.
BREEDING AND BIOTECHNOLOGY. Breeding? Application of genetics principles for improvement Application of genetics principles for improvement “Accelerated”
System Level Science and System Level Models Ian Foster Argonne National Laboratory University of Chicago Improving IAM Representations of a Science-Driven.
Harbin Institute of Technology Computer Science and Bioinformatics Wang Yadong Second US-China Computer Science Leadership Summit.
Introduction to Bioinformatics (Lecture for CS397-CXZ Algorithms in Bioinformatics) Jan. 21, 2004 ChengXiang Zhai Department of Computer Science University.
Network requirements from Ukrainian Biotechnology communities Lubov N. Shynkarenko FBB.
Genomics and Forensics
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
TOXICOGENOMICS.
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
High throughput biology data management and data intensive computing drivers George Michaels.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
Milanesi Luciano Catania, Italy 13/03/2007 Bioinformatics challenges in European projects in Grid. Milanesi Luciano National Research Council Institute.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Advances and challenges in computational modeling and statistical learning of biological systems Qi Liu Department of Biomedical Informatics Vanderbilt.
MarketsandMarkets Presents Bioinformatics Market worth $7.5 Billion By 2017
Genomics and the Growing World Steve Rounsley Dow Agrosciences.
STAT115 STAT215 BIO512 BIST298 Introduction to Computational Biology and Bioinformatics Spring 2016 Xiaole Shirley Liu.
Million Veteran Program: Industry Day Genomic Data Processing and Storage Saiju Pyarajan, PhD and Philip Tsao, PhD Million Veteran Program: Industry Day.
Published: Aug 2017 Single User PDF: US$ 2500 No. of Pages: 499
Post-GWAS and Mechanistic Analyses
“Proteomics is a science that focuses on the study of proteins: their roles, their structures, their localization, their interactions, and other factors.”
The effect of using sequence data instead of a lower density SNP chip on a GWAS EAAP 2017; Tallinn, Estonia Sanne van den Berg, Roel Veerkamp, Fred van.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Proteomics Informatics David Fenyő
Schedule for the Afternoon
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
Multi-Omics of Single Cells: Strategies and Applications
University of Wisconsin, Madison
Strategic command of living processes
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Jan – Dec RuminOmics Connecting the animal genome, the intestinal microbiome and nutrition to enhance the efficiency of ruminant.
Proteomics Informatics David Fenyő
Alisdair R. Fernie, Jianbing Yan  Molecular Plant 
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
M-H Pinard-van der Laan
Precision animal breeding
MarketsandMarkets Presents Agrigenomics Market by Application & Region - Global Forecast 2021.
Biotechnology & Bioinformatics
Presentation transcript:

Bioinformatics and Biostatistics in Limagrain / Biogemma JOBIM Conference, July 2015

An international agricultural cooperative group 4th largest seed company worldwide Nearly 2,000 farmer members Sales of nearly 2 billion Euros Nearly 9,000 employees Subsidiaries in 42 countries 13.5% of turnover re-invested in research A portfolio of strong brands

A group that specializes in seeds and cereal products Field Seeds Field Seeds Limagrain Coop Vegetable Seeds Vegetable Seeds Cereal Products Bakery Products Garden Products Cereal Ingredients

A European group open to the world 64% of sales 64% of workforce Nearly 9,000 employees 66 nationalities 69% of sales achieved outside France Subsidiaries in 42 countries 23% of sales 16% of workforce 7% of sales 12% of workforce Americas Asia & Pacific 6% of sales 8% of workforce Africa & Middle East

An innovative group 13.5% of turnover invested in research 200 M€ with collabora- tions) 13.5% 10.2%* 5.4%* 2.25%* Average industry Automobile industry Pharmaceutical industry Limagrain * Source : Leem - April 2013

BIOGEMMA, a research partnership Biotechnologies 9.5% 16% 55 % 10% Field Seeds

Biogemma Identification of genes associated with agronomic traits Development of GM varieties in cereals Development of tools and knowledge BIOINFORMATICS |

Bioinformatics for breeding Molecular Breeding Biostatistics Discover Associations Bioanalysis Explain Associations Tools Bioinformatics db Analyze NGS-based data Develop databases and tools to store and analyse biological data

HPLC Crystallo-graphy Omics analysis Phenotype Environment Chromatin Silencing Regulation of transcription miRNA, siRNA Protein modification, interaction, turnover Regulation of translation RNA stability What we measure Markers mRNA Transcription levels, DGE Protein Quantity, Activity levels Trait Phenome Regulation of expression How we Genotyping Sequencing RNA-Seq microarrays HPLC Crystallo-graphy IA, NIR, HPLC, eyeball DNA Genes, Genomes Biological material RNA mRNA, rRNA Transcriptome Enzyme Proteome Metabolome Transcription Translation Expression LD mapping, GWAS, GS

A great deal of complex information to correlate Environment Genotype Phenotype Data processing tools getting more and more sophisticated

Data analysis & processing Data Life Cycle Data production & acquisition Results interpretation & decision support field trials predicting cross value genotyping sequencing genomics LIMS, databases evaluation of individuals data retrieval quality control building predictive model statistical analyses Data analysis & processing

Data production & acquisition Sequencing NGS based: whole genome, targeted sequencing, transcriptome Deliverables: SNP, structural variations, gene expression level, genomes Genotyping High density chips 103 – 105 SNP 105 samples Automate calling / quality control Steem_Z30_rep1 Steem_Z30_rep2 Steem_Z32_rep1 Steem_Z32_rep2 Steem_Z65_rep1 Steem_Z65_rep2

Data production & acquisition Phenotypic data Automate data collection Sensors, images, NIR spectrometry… Adjustments/corrections by geostatistical methods Extraction of relevant information

Data production & acquisition Environmental data Local / internal: Sensors, airborne imagery, … Global / external: Databases, internet, satellite images, … Precise description of the growing conditions Air temperature Relative humidity Dew point

Modelling Molecular data Cost  Availability  Predict: genotype  phenotype QTL/GWAS – identify genomic regions involved genomic selection – "black box" approach

Modelling Statistical methods Linear mixed models Bayesian approaches More and more complex models GxE Epistasis  computationally intensive methods (from Van Eeuwijk et al., 2010)

Data management Integrative viewer for genomic data Databases BIG DATA: large volume of structured and unstructured data

Infrastructure Local on-the-premises computing "data-centric computing" Central enterprise resources Security NGS data analysis on BIOGEMMA HPC (912 cores) Elastic (cloud) flexibility low cost / hour CPU

Take Home Messages Bioinformatics: a major activity supporting a large range of applications in Limagrain Genomics Phenomics Enviromics Biostatistics, Modelling and Prediction Big Data (HPC, data management) Both R&D and Applied In a highly competitive and challenging research area Pied de page

More information… Pied de page

Thank you