Batyr Charyyev.

Slides:



Advertisements
Similar presentations
Bayesian network for gene regulatory network construction
Advertisements

Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
DI FC UL1 Gene Function Prediction by Mining Biomedical Literature Pooja Jain Master in Bioinformatics Supervisor - Mário Jorge Costa Gaspar.
SING* and ToNC * Scientific Foundations for Internet’s Next Generation Sirin Tekinay Program Director Theoretical Foundations Communication Research National.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Mining the Medical Literature Chirag Bhatt October 14 th, 2004.
The LINDI Project Linking Information for New Discoveries UIs for building and reusing hypothesis seeking strategies. Statistical language analysis techniques.
Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.
Preparing for the Verbal Reasoning Measure. Overview Introduction to the Verbal Reasoning Measure Question Types and Strategies for Answering General.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Defining Text Mining Preprocessing Transforming unstructured data stored in document collections into a more explicitly structured intermediate format.
Implicit An Agent-Based Recommendation System for Web Search Presented by Shaun McQuaker Presentation based on paper Implicit:
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Flexible Text Mining using Interactive Information Extraction David Milward
Creating Metabolic Network Models using Text Mining and Expert Knowledge J.A. Dickerson, D. Berleant, Z. Cox, W. Qi, and E. Wurtele Iowa State University.
©2003 Paula Matuszek CSC 9010: Text Mining Applications Document Summarization Dr. Paula Matuszek (610)
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Playing Biology ’ s Name Game: Identifying Protein Names In Scientific Text Daniel Hanisch, Juliane Fluck, Heinz-Theodor Mevissen and Ralf Zimmer Pac Symp.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
1 CSC 594 Topics in AI – Text Mining and Analytics Fall 2015/16 3. Word Association.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Introduction to biological molecular networks
Opportunities for Text Mining in Bioinformatics (CS591-CXZ Text Data Mining Seminar) Dec. 8, 2004 ChengXiang Zhai Department of Computer Science University.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 1 Mining knowledge from natural language texts using fuzzy associated concept mapping Presenter : Wu,
A Patent Document Retrieval System Addressing Both Semantic and Syntactic Properties Liang Chen*,Naoyuki Tokuda+, Hisahiro Adachi+ *University of Northern.
Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation Bioinformatics, July 2003 P.W.Load,
Funded by the Library of Congress.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Compiling Information and Inferring Useful Knowledge for Systems Biology by Text Mining the Literature Anália Lourenço IBB – Institute for Biotechnology.
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Learning Bayesian Networks for Complex Relational Data
Mohammad Alqahtani, Dr. Eric Atwell
Business process management (BPM)
An Efficient Algorithm for Incremental Update of Concept space
Genomics research paper presentation
Biomedical Text Mining and Its Applications
KnowEnG: A SCALABLE KNOWLEDGE ENGINE FOR LARGE SCALE GENOMIC DATA
Business process management (BPM)
STRING Large-scale data and text mining
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Personalized Social Image Recommendation
Terminology problems in literature mining and NLP
Preparing for the Verbal Reasoning Measure
Multimedia Information Retrieval
Search Techniques and Advanced tools for Researchers
Huayi Zhagn and Haiyan Liang
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
CSE 635 Multimedia Information Retrieval
Ying Dai Faculty of software and information science,
Web Mining Department of Computer Science and Engg.
Introduction to Information Retrieval
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
Panagiotis G. Ipeirotis Luis Gravano
Network biology An introduction to STRING and Cytoscape
PolyAnalyst Web Report Training
Ying Dai Faculty of software and information science,
Network Models Michael Goodrich Some slides adapted from:
Presentation transcript:

Batyr Charyyev

Gene Relation Network Constructed by Mining Biomedical Literature Abstracts. As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Gene Relation Network Constructed by Mining Biomedical Literature Abstracts. As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Gene Relation Network Constructed by Mining Biomedical Literature Abstracts. Amount of data Unstructured format As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Gene Relation Network Constructed by Mining Biomedical Literature Abstracts. Amount of data Unstructured format Why it is important? Easy analyze and interpret data. Provide full data. Comprehensive view. As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Gene Relation Network Constructed by Mining Biomedical Literature Abstracts. Information retrieval Information extraction Generating relation network Three main components As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Information retrieval Source: PubMed-NCBI Technique: Number of co-occurences of phrases Challenges: Irrelevant papers. As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Information extraction Source: Abstracts Techniques: Rule based extraction, structure and semantics of sentence Challenges: Greek letters and acronyms As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Generating relation network Source: Output of our second component Relation: Activation, suppressing. Challenges: Verification of results. As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Related works “Text Mining Biomedical Literature for Constructing Gene Regulatory Networks” by Young Ling Song and Su Shing Chen “Context-rich biological network constructed by mining PubMed abstracts” by Hao Chen and Burt Sharp “How to infer gene networks from expression profiles” by Mukesh Bansal, Vincenzo Belcastro, Alberto Ambesimpiobato, Diego di Bernardo As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Text mining Biomedical Literature for Constructing Gene Regulatory Networks Goal: Automatically mine the data to extract gene regulatory information and construct gene regulatory network. Components: Text Mining Biomedical literature, Online tools, Analysis tools Source: Full-text paper Techniques: Rule-based Result: Precision and Recall As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Text mining Biomedical Literature for Constructing Gene Regulatory Networks As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Content-rich biological network constructed by mining PubMed abstracts Goal: Construct content-rich relationship networks between genes, proteins, drugs and biological concepts. Source: Abstracts of paper Techniques: Synonyms of user entered queries As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Content-rich biological network constructed by mining PubMed abstracts As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

How to infer gene networks from expression profiles Goal: Compare different algorithms and show they are indeed able to infer regulatory interactions among genes. Algorithms: Coexpression networks and clustering, Bayesian networks, Information theoretic approach. As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Coexpression networks and clustering Idea: Grouping genes with similar expression profiles in cluster As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Bayesian network Idea: Describe relationship among random genes with joint probability distribution. As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Information-theoretic approach Idea: Compute Mutual Information(MI) for each pairs of genes and put edge between them if it is greater than threshold. MI is symmetric As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

What is next? Focus on relation network component Start basic proceed incrementally Information theory model Small sample As the largest man-made complex network, the Internet grows with no central authority. Thousands of small and medium size Autonomous Systems connect individuals, businesses, universities, and agencies while focusing on optimizing their own communication efficiency and economic objectives make it challenging to maintain and control the growth

Questions