Medline Text Searching Tools – a Comparison Experiment McDermott Center for Human Growth and Development Center for Biomedical Inventions.

Slides:



Advertisements
Similar presentations
EndNote Web Reference Management Software (module 5)
Advertisements

Search Strategy and Information Retrieval By Rekha Gupta, NIC
Zoology 305 Library Databases/Indexes Lab Goals for session: 1) Meet your librarian Kevin Messner 2) Understand.
Siniša Ivković, Goran Rakočević, Prof. Veljko Milutinovic University of Belgrade School of Electrical Engineering.
NCBI/WHO PubMed/Hinari Course NCBI Literature Databases: PubMed Background.
New Features Update ISI Web of Knowledge. Copyright 2006 Thomson Corporation 2 New features added Mozilla Firefox web browser is now supported New access.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Genome databases and webtools for genome analysis Become familiar with microbial genome databases Use some of the tools useful for analyzing genome Visit.
© Wiley Publishing All Rights Reserved. How Most People Use Bioinformatics.
1.
On line (DNA and amino acid) Sequence Information Lecture 7.
ANALYSING RESEARCH – A GLOBAL PERSPECTIVE Krzysztof Szymanski – Country Manager Thomson Reuters October 2009.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Pharmacy Information Resources TTUHSC Preston Smith Library presents Rev. 08/2014.
BIOINFORMATICS Ency Lee.
NATIONAL LIBRARY OF MEDICINE The PubMed ID and Entrez, PubMed and PubMed Central Edwin Sequeira National Center for Biotechnology Information June 21,
How to use the web for bioinformatics Molecular Technologies Ethan Strauss X 1171
Bioinformatics and the Engineering Library ASEE 2008 Amy Stout.
Archives and Information Retrieval
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
How to use the web for bioinformatics Molecular Technologies February 11, 2005 Ethan Strauss X 1373
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
IST Computational Biology1 Information Retrieval Biological Databases 2 Pedro Fernandes Instituto Gulbenkian de Ciência, Oeiras PT.
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Sequence/Structure Alignment Resources from NCBI Steve Bryant Protein Data Bank Rutgers University November 19, 2005.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.
A Study of Cystic Fibrosis Using Web-Based Tools Anuradha Datta Murphy Graduate Student, Dept. of Molecular and Integrative Physiology, University of Illinois.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Bioinformatics.
Searching PubMed® NCBI, NLM Resources, Micromedex -GSBS TTUHSC Preston Smith Library presents Rev. 08/17/14.
STONY BROOK UNIVERSITY HEALTH SCIENCES LIBRARY CHECK OUT THE SERVICES & RESOURCES AVAILABLE TO YOU.
CANDID: A candidate gene identification tool Janna Hutz March 19, 2007.
A New Oklahoma Bioinformatics Company. Microarray and Bioinformatics.
Part 1 – PubMed Interface, Display options, Saving, Printing, and ing results. Instructions This part of the course is a PowerPoint demonstration.
MEDLINE for Medical Research Juliet Ralph and César Pimenta Hilary Term 2007.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist in Molecular Biology.
RESEARCH – DOING AND ANALYSING Gavin Coney Thomson Reuters May 2009.
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
EB3233 Bioinformatics Introduction to Bioinformatics.
A collaborative tool for sequence annotation. Contact:
Bioinformatics and Computational Biology
Computer Storage of Sequences
Applied Bioinformatics Week 9 Jens Allmer. Theory I Gene Expression Microarray.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
PubMed/How to Search, Display, Download & (module 4.1)
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
DISCUSSION Using a Literature-based NMF Model for Discovering Gene Functional Relationships Using a Literature-based NMF Model for Discovering Gene Functional.
NCBI: something old, something new. What is NCBI? Create automated systems for knowledge about molecular biology, biochemistry, and genetics. Perform.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Information retrieval and sliding window programs April 5, 2011 Hand in Homework #1. Homework #2 due Tuesday, April 12. Learning objectives- Understand.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
MEDLINE®/PubMed® PubMed for Trainers, Fall 2015 U.S. National Library of Medicine (NLM) and NLM Training Center An introduction.
NCBI PubMed NCBI Literature Databases: PubMed Session #1, April 28, 2005 Session #2, April 29, 2005 Ho Chi Minh City, VietNam.
Text Similarity: an Alternative Way to Search MEDLINE James Lewis, Stephan Ossowski, Justin Hicks, Mounir Errami and Harold R. Garner Translational Research.
Selection of Resources for the Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist.
Biological Databases By: Komal Arora.
Computer Software Lecture 5.
What is Bioinformatics?
Lesson 3 Bioinformatics Laboratory
PubMed Database Interface (Basic Course: Module 4 Part A)
Presentation transcript:

Medline Text Searching Tools – a Comparison Experiment McDermott Center for Human Growth and Development Center for Biomedical Inventions

Your first biomedical experiment should be on a computer - a literature search and a data search!

Databases and Tools to Exploit Them National Center for Biotechnology Information UTSW Bioinformatics Tools – GeneCards - Genome Data Base - US Patent Office - Research Genetics, Stratagene….

Bioinformatics research requires significant hardware and software investment. Hardware – ~120 workstations, 3 Apache www servers, Sun servers, 3 HP Enterprise 4/8 CPU servers, 32 CPU Linux cluster, 3 TB RAID 5 storage system, 3D laser scanners, SGI visualization stations, HP visualization stations. Software – standard genomics search tools, unique new text data mining tools, 3D analysis tools Databases – all major genetics/genomics databases, all of Medline, many custom local database.

Text data mining Genomic Annotation Gene collection identification and analysis Polymorphism Prediction A family of bioinformatics tools and their computed databases have been developed.

Applied Computational Biology Toolset. POMOUS and SNIDE – polymorphism prediction software. PANORAMA – A DNA/Protein sequence analysis and visualization tool. ARROGANT – A gene/clone collection analysis tool. eTBLAST, FRISC, TRITE, Text similarity tools IRIDESCENT – Text data mining tool ARGH – Acronym resolving software ELXR – Exon locator and extractor for resequencing. Microarray BLAST – UTSW BLAST utility for comparison against EST/cDNA sequences from UTSW microarrays. MarC-V, Signal, SNPCEQer …….

Text data mining can speed reduction of data to knowledge. DNA Sequencing Invented Human Genome Project Begun Gap Science (Genome Issue) 15 Oct. 1999

Now over 10 million articles in MEDLINE ® 400,000 new articles added each year Over 1 million are in Genetics and Molecular Biology Online biomedical literature is growing rapidly… Most Biomedical results are reported in scientific papers, that are now searchable. Can you keep up with the literature?

The differences in PubMed and eTBLAST PubMed is database driven PubMed performs a boolean search into Medline. PubMed returns hits sorted by date (and other). eTBLAST is currently written in C. eTBLAST automatically extracts common words from text and then the remainder are keywords. eTBLAST performs a similarity comparison using weighted keywords eTBLAST returns his sorted by similarity (and soon other).

Lets research a topic – Wilms’ Tumor 79) Wilms' tumour A childhood nephroblastoma (solid tumour of the kidney)affecting one in children, usually appearing within the first five years of life. A susceptibility to Wilms' tumour is associated with inheritance of defects in several different genes, including the TUMOUR SUPPRESSOR GENE WT-1 which maps to chromosome 11p13. Tumours arise from mesenchymal STEM CELLS that would normally differentiate into parts of the nephron. Around 5-10% also contain ectopic tissues such as bone and cartilage. Wilms' tumour also appears as one of the manifestations of the WAGR syndrome - a CONTIGUOUS GENE SYNDROME. The WT-1 gene has been cloned and encodes a zinc finger protein which is presumed to be a TRANSCRIPTION FACTOR.

Entrez is a search and retrieval system that integrates information from databases at NCBI. Follow That Link

Our text data mining tools – eTBLAST, FRISC, TRITE eTBLAST (2) – similarity comparison engine for electronic text using weighted keywords, concepts and grammar induction. Psi-eTBLAST is iterative. Example use of natural entry process.eTBLAST(2) Example use of natural entry process. FRISC (2) – using eTBLAST, a UTSW faculty research interests page is checked regularly against new Biomedical abstracts from Medline and ranks to cluster information that best fits interests of researcher.FRISC(2) TRITE (2) (3) (4) – using eTBLAST, topical interests will be searched regularly against new Biomedical abstracts in Medline.TRITE(2) (3) (4)

You are being asked to conduct a series of reference checks on a set of topics. Each student will be given 3 topics to research, out of a total of 120 total different topics. You are asked to research the topics, first, in the standard way using keyword-based searches using PubMed over the web, and then using a new code, eTBLAST, that performs keyword identification and searching automatically. You are asked to research each of the topics, reading their titles, abstracts and any other information to determine the relative sensitivity and selectivity of each of the methods for finding the documents. You will also be asked to compare and contrast the results of the approaches based on other criteria, like speed, user interface, etc.

What you will be doing. Please obtain floppy disk, instructions and 3 topic sheets. Find a computer, some on North Campus Library, South Campus Library, and the computer training room. Fill in the spread sheet with the results of your search on the topic. Then write a brief paragraph in word to compare and contrast the two methods. Come back to Lacynda’s office (NA2.504) to return the floppy disk. She will then log the disk in, and open it and verify that the files are there and complete. You are then free to go.