Problem Statement and Motivation Key Achievements and Future Goals Technical Approach Investigators: Yang Dai Prime Grant Support: NSF High-throughput.

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

Test-tube or keyboard? Computation in the life sciences.
Biological pathway and systems analysis An introduction.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Presented by Next Generation Simulations in Biology: Investigating biomolecular structure, dynamics and function through multiscale modeling Pratul K.
Prof. Jesús A. Izaguirre Department of Computer Science and Engineering Computational Biology and Bioinformatics at Notre Dame.
Bioinformatics at IU - Ketan Mane. Bioinformatics at IU What is Bioinformatics? Bioinformatics is the study of the inherent structure of biological information.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Bioinformatics Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 1 Introduction Aleppo University Faculty of technical engineering.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Gene expression analysis summary Where are we now?
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Topics in Computational Biology (COSI 230a) Pengyu Hong 09/02/2005.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Nonequilibrium, Single-Molecule Studies of Protein Unfolding Ching-Hwa Kiang, Rice University, DMR We used the atomic force microscope to manipulate.
Bioinformatics and it’s methods Prepared by: Petro Rogutskyi
CSE 6406: Bioinformatics Algorithms. Course Outline
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
Application of e-infrastructure to real research.
JM - 1 Introduction to Bioinformatics: Lecture I An Overview of the Course Jarek Meller Jarek Meller Division of Biomedical Informatics,
A New Oklahoma Bioinformatics Company. Microarray and Bioinformatics.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Finish up array applications Move on to proteomics Protein microarrays.
PROTEIN STRUCTURE CLASSIFICATION SUMI SINGH (sxs5729)
Computing and Communications and Biology Molecular Communication; Biological Communications Technology Workshop Arlington, VA 20 February 2008 Jeannette.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
Knowledge Discovery from Biological and Clinical Data: BASIC BACKGROUND.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
Fundamental Design of Nanocatalysts Randall J. Meyer, Chemical Engineering Department Prime Grant Support: NSF, PRF Collaborations Technical Approach Future.
Harbin Institute of Technology Computer Science and Bioinformatics Wang Yadong Second US-China Computer Science Leadership Summit.
Computational Protein Topographics for Health Improvement Jie Liang, Ph.D. Bioengineering Prime Grant Support: National Science Foundation Career Award,
Genomes To Life Biology for 21 st Century A Joint Initiative of the Office of Advanced Scientific Computing Research and Office of Biological and Environmental.
Engineering Better Brain Implants for the Future of Medicine Patrick J. Rousche, Ph.D. Bioengineering, and co-PI Laxman Saggere, Ph.D. Mechancial Engineering.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Overview of Bioinformatics 1 Module Denis Manley..
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
Problem Statement and Motivation Key Achievements and Future Goals Technical Approach Investigator: Hui Lu, Ph.D., Bioengineering, Collaborators: Julio.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
COMPUTERS IN BIOLOGY Elizabeth Muros INTRO TO PERSONAL COMPUTING.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
Information Technology in the Natural Sciences Biology – Chemistry – Physics.
Data Mining and Decision Trees 1.Data Mining and Biological Information 2.Data Mining and Machine Learning Techniques 3.Decision trees and C5 4.Applications.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
Mining the Biomedical Research Literature Ken Baclawski.
Bioinformatics and Computational Biology
COMPUTATIONAL BIOLOGIST DR. MARTIN TOMPA Place of Employment: University of Washington Type of Work: Develops computer programs and algorithms to identify.
1 Bioinformatics at Norwegian University of Science and Technology Professor Finn Drabløs Department of Cancer Research and Molecular Medicine Finn Drabløs.
BIOINFOGRID: Bioinformatics Grid Application for life science MILANESI, Luciano National Research Council Institute of.
9 th Annual "Humies" Awards 2012 — Philadelphia, Pennsylvania Uday Kamath, Amarda Shehu,Kenneth A De Jong Department of Computer Science George Mason University.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Role of Theory Model and understand catalytic processes at the electronic/atomistic level. This involves proposing atomic structures, suggesting reaction.
Bioinformatics Research Overview Li Liao Develop new algorithms and (statistical) learning methods > Capable of incorporating domain knowledge > Effective,
Computer Science and Engineering PhD in Computer Science Monday, November 07, :00 a.m. – 11:00 a.m. Swearingen Conference Room 3A75 Network Based.
High throughput biology data management and data intensive computing drivers George Michaels.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Center for Bioinformatics and Genomic Systems Engineering Bioinformatics, Computational and Systems Biology Research in Life Science and Agriculture.
KnowEnG: A SCALABLE KNOWLEDGE ENGINE FOR LARGE SCALE GENOMIC DATA
High-throughput Biological Data The data deluge
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Genomes and Their Evolution
9 Future Challenges for Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
BIOINFORMATICS Summary
Introduction to Bioinformatic
Introduction to Bioinformatics
Presentation transcript:

Problem Statement and Motivation Key Achievements and Future Goals Technical Approach Investigators: Yang Dai Prime Grant Support: NSF High-throughput experiments generate new protein sequences with unknown function prediction In silico protein function prediction is in need Protein subcellular localization is a key element in understanding function Such a prediction can be made based on protein sequences with machine learners Feature extraction and scalability of learner are keys Developed highly sophisticated sequence coding methods Developed an integrated multi-classification system for protein subcellular localization Developed a preliminary multi-classification system for subnuclear localization Will incorporate various knowledge from other databases into the current framework Will design an integrative system for protein function prediction based on information of protein localizations, gene expression, and protein- protein interactions Use Fast Fourier Transform to capture long range correlation in protein sequence Design a class of new kernels to capture subtle similarity between sequences Use domains and motifs of proteins as coding vectors Use multi-classification system based on deterministic machine learning approach, such as support vector machine Use Bayesian probabilistic model Sequences specific subcellular and subnuclear localization MASVQLY... …HKEPGV Machine Learner Text File of Protein description Coding Vectors

Problem Statement and Motivation Key Achievements and Future Goals Technical Approach Jie Liang, Ph.D. Bioengineering Prime Grant Support: National Science Foundation Career Award, National Institutes of Health R01, Office of Naval Research, and the Whitaker Foundation The structure of proteins provide rich information about how cells work. With the success of structural genomics, soon we will have all human proteins mapped to structures. However, we need to develop computational tools to extract information from these structures to understand how cell works and how new diseases can be treated. Therefore, the development of computational tools for surface matching and for function prediction will open the door for many new development for health improvement. We have developed a web server CASTP (cast.engr.uic.edu) that identify and measures protein surfaces. It has been used by thousands of scientists world wide. We have built a protein surface library for >10,000 proteins, and have developed models to characterize cross reactivities of enzymes. We also developed methods for designing phage library for discovery of peptide drugs. We have developed methods for predicting structures of beta-barrel membrane proteins. Future: Understand how protein fold and assemble, and designing method for engineering better proteins and drugs. We use geometric models and fast algorithm to characterize surface properties of over thirty protein structures. We develop evolutionary models to understand how proteins overall evolve to acquire different functions using different combination of surface textures. Efficient search methods and statistical models allow us to identify very similar surfaces on totally different proteins Probablistc models and sampling techniques help us to understand how protein works to perform their functions. Protein surface matching Evolution of function

Problem Statement and Motivation Key Achievements and Future Goals Technical Approach Investigators: Hui Lu, Bioengineering Prime Grant Support: NIH, DOL Protein interacts with other biomolecules to perform a function: DNA/RNA, ligands, drugs, membranes, and other proteins. A high accuracy prediction of the protein interaction network will provide a global understanding of gene regulation, protein function annotation, and the signaling process. The understanding and computation of protein-ligand binding have direct impact on drug design. Developed the DNA binding protein and binding site prediction protocols that have the best accuracy available. Developed transcription factor binding site prediction. Developed the only protocol that predicts the protein membrane binding behavior. Will work on drug design based on structural binding. Will work on the signaling protein binding mechanism. Will build complete protein-DNA interaction prediction package and a Web server. Data mining protein structures Molecular Dynamics and Monte Carlo simulations Machine learning Phylogenetic analysis of interaction networks Gene expression data analysis using clustering Binding affinity calculation using statistical physics Protein-DNA complex: gene regulation DNA repair cancer treatment drug design gene therapy

Problem Statement and Motivation Key Achievements and Future Goals Technical Approach Investigators: Hui Lu, Ph.D., Bioengineering Primary Grant Support: Chicago Biomedical Consortium, NIH To efficiently function, cells need to respond properly to external physical and physical and chemical signals in their environment. Identifying disease states and designing drugs require a detailed understanding of the internal signaling networks that are activated in responses to external stimuli. In the center of these process is a particular group of protein that translocate to the cell membrane upon external activation. Developed highly accurate prediction protocols for identifying novel cases of membrane binding proteins, based on properties calculated from molecular surface of the protein structure. Determining membrane binding of properties of C2 domains in response to changes in ion placements and membrane lipid composition. Goal: To model the network dynamics to understand how changes in membrane binding properties of certain domains changes the efficiency of signal transduction in the cell. Combine machine learning techniques with characterization of the protein surface to identify unknown membrane binding proteins. Atomic scale molecular dynamics simulation of the interactions between proteins and membranes Mathematical modeling is used for studying the spatial and dynamic evolution of the signal transduction networks within the cell when changes in the external environment occurs.

Problem Statement and Motivation Key Achievements and Future Goals Technical Approach Investigators: Hui Lu, Ph.D., Robert Ezra Langlois, Ph.D.,Bioengineering; Grant Support: NIH, Bioinformatics online Massive amount of biomedical data are available from high-throughput measurement, such as genome sequence, proteomics, biological pathway, networks, and disease data. Data processing become the bottleneck of biological discovery and medical analysis Problem: Protein function prediction, protein functional sites prediction, protein interaction prediction, disease network prediction, biomarker discovery. Developed machine learning algorithms for protein-DNA, protein- membrane, protein structure prediction, disease causing SNP prediction, mass-spec data processing, DNA methylation prediction. Developed an open-source machine learning software MALIBU Goal: Biological network analysis and prediction. Formulate the problem in classification problem Derive features to represent biological objects Develop various classification algorithms Develop multiple-instance boosting algorithms

Problem Statement and Motivation Key Achievements and Future Goals Technical Approach Investigator: Hui Lu, Ph.D., Bioengineering, Collaborators: Julio Fernandez (Columbia University), Hongbin Li (U of British Columbia) Mechanical signals play key role in physiological processes by controlling protein conformational changes Uncover design principles of mechanical protein stability Relationship between protein structure and mechanical response; Deterministic design of proteins Atomic level of understanding is needed from biological understanding and protein design principles Identified key force-bearing patch that controlled the mechanical stability of proteins. Discovered a novel pathway switch mechanism for tuning protein mechanical properties. Calculated how different solvent affect protein’s mechanical resistance. Goal: Computationally design protein molecules with specific mechanical properties for bio-signaling and bio-materials. All-atom computational simulation for protein conformational changes – Steered Molecular Dynamics Free energy reconstruction from non-equilibrium protein unfolding trajectories Force partition calculation for mechanical load analysis Modeling solvent-protein interactions for different molecules Coarse-grained model with Molecular dynamics and Monte Carlo simulations