Download presentation
Presentation is loading. Please wait.
Published byEaster Washington Modified over 9 years ago
1
Copyright OpenHelix. No use or reproduction without express written consent1
2
2 XplorMed Software For Text Mining Abstracts Materials prepared by: Mary Mangan, Ph.D. www.openhelix.com Updated: Q3 2010 Version 2
3
Copyright OpenHelix. No use or reproduction without express written consent3 XplorMed Agenda XplorMed: www.ogic.ca/projects/xplormed/ Introduction & Credits Yellow Gate: PubMed Query Yellow Gate: Relations Green Gate: Stored Abstracts Red Gate: Identifiers Summary Exercises
4
Copyright OpenHelix. No use or reproduction without express written consent4 Text Mining Abstracts for Relevant Relationships XplorMed: get more from a PubMed search Text mining (statistical) PubMed abstracts XplorMed relevant word associations and context Discover new and more relevant relationships among the literature; use context to make better choices for reading
5
Copyright OpenHelix. No use or reproduction without express written consent5 Why Use XplorMed? PubMed is a great collection, but search results are often daunting…hundreds or thousands of abstracts Which to read? Will you miss some? Titles + guess? Example query: depression AND hypothyroid
6
Copyright OpenHelix. No use or reproduction without express written consent6 www.ogic.ca/projects/xplormed/ 3 Types of Queries or “Gates”: Yellow, Green, Red yellow r Input: identifiers Original site: Current site: www.bork.embl-heidelberg.de/xplormed/ yg Input: MEDLINE query Input: files of saved abstracts
7
Copyright OpenHelix. No use or reproduction without express written consent7 Overview of an XplorMed Analysis Analysis completed in a series of steps Sample: depression AND hypothyroid Many iterations possible to refine the set of abstracts 1. Start query 2. Select MeSH categories of interest 3. Find related words 4. Context, or iterate… Identify relevant abstracts
8
Copyright OpenHelix. No use or reproduction without express written consent8 Credits, References & Contact Information Developed at Peer Bork’s lab at EMBL Papers by: Carolina Perez-Iratxeta, Antonio Perez, HS Keer, Miguel Andrade and Peer Bork
9
Copyright OpenHelix. No use or reproduction without express written consent9 XplorMed Agenda XplorMed: www.ogic.ca/projects/xplormed/ Introduction & Credits Yellow Gate: PubMed Query Yellow Gate: Relations Green Gate: Stored Abstracts Red Gate: Identifiers Summary Exercises
10
Copyright OpenHelix. No use or reproduction without express written consent10 Yellow Gate: Start with a PubMed Query Yellow: plain PubMed query www.ogic.ca/projects/xplormed/
11
Copyright OpenHelix. No use or reproduction without express written consent11 Yellow Gate: Step 1, Options Text search, examples shown Or retrieve a previous search (stored 1 week) Submit: sort abstracts according to MeSH category step 1 enter text; can use Booleans searches stored click to submit marys_query1 See Entrez PubMed documentation, or OpenHelix’s PubMed tutorial
12
Copyright OpenHelix. No use or reproduction without express written consent12 Tips on Words to Use in Searching Word: use the “lemma” form of a word Lemma: Nouns: the singular form = gene [not genes] Stop words: a list of non-helpful words, which are ignored http://www.ogic.ca/projects/xplormed/stopwords.txt and, the, or, … analyze, between, only, … TreeTagger (Schmid) www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagger.html
13
Copyright OpenHelix. No use or reproduction without express written consent13 Sample Query: Depression AND Hypothyroid Yellow gate to use keywords in a PubMed search depression AND hypothyroid Step 1: just enter the text, and click NEXT ACTION enter textclick button
14
Copyright OpenHelix. No use or reproduction without express written consent14 MeSH Terms - Organizing your Results MeSH categories (Medical Subject Headings) National Library of Medicine (US NIH) Terms assigned to literature by professionals http://www.nlm.nih.gov/mesh/meshhome.html MeSH terms assigned to a record:
15
Copyright OpenHelix. No use or reproduction without express written consent15 Yellow Gate Step 1 Results Query: depression AND hypothyroid Step 1 results shown MeSH categories; same abstract can be in multiple categories
16
Copyright OpenHelix. No use or reproduction without express written consent16 Yellow Gate, Next Step: Categories of Interest You can use the whole set of abstracts, or Select a subset of interesting categories with checkboxes Sample: Diseases, Chemicals and Drugs, Psychiatry and Psychology date range, if desired
17
Copyright OpenHelix. No use or reproduction without express written consent17 Yellow Gate: Resulting Words Related words from abstracts displayed Ranked by score How much the word appears with others
18
Copyright OpenHelix. No use or reproduction without express written consent18 Association Score Perez-Iratxeta et al, BioTechniques 32(6): 1380 Computing fuzzy associations for the analysis of biological literature Relatedness: Essentially, the ratio of the number of abstracts that contain 2 words, compared to the number where either one or the other occurs. Keywords: Essentially, more relevant words have more strong relations to other words. You can count the co-occurances and create a score. http://www.ncbi.nlm.nih.gov/pubmed/12074170?dopt=Abstract
19
Copyright OpenHelix. No use or reproduction without express written consent19 Yellow Gate: Resulting Words Related words from abstracts displayed Ranked by Score Click word for context To PubMed
20
Copyright OpenHelix. No use or reproduction without express written consent20 XplorMed Agenda XplorMed: www.ogic.ca/projects/xplormed/ Introduction & Credits Yellow Gate: PubMed Query Yellow Gate: Relations Green Gate: Stored Abstracts Red Gate: Identifiers Summary Exercises
21
Copyright OpenHelix. No use or reproduction without express written consent21 Yellow Gate: Other Options Click [R] for relations to other words Click [X] for co- relationships
22
Copyright OpenHelix. No use or reproduction without express written consent22 Examined Shared Words and Context Using [X] If words appear in the same abstract, results shown Words in same sentence: sentence is BLUE If words are immediately adjacent, MAGENTA Links to complete abstract at PubMed
23
Copyright OpenHelix. No use or reproduction without express written consent23 Yellow Gate: Next Step = Find Chains From word list set, compute chains Chains are an ordered set of words Alpha: ↑ for fewer connections Score: ↑ for fewer relations Alpha, strength value Score, threshold Click here
24
Copyright OpenHelix. No use or reproduction without express written consent24 Chains of Related Words 2 chains found in this example Checkbox to proceed, rank by chains Large star denotes a review article
25
Copyright OpenHelix. No use or reproduction without express written consent25 Chains, with Extra Features Also with the chains, you can get other data types added to your output: OMIM, SwissProt, or SMART Diagrams will highlight available linked items
26
Copyright OpenHelix. No use or reproduction without express written consent26 Chains, Other Options Collect MeSH terms related to these abstracts Diseases shown, others available Must scroll down past abstracts to see these results
27
Copyright OpenHelix. No use or reproduction without express written consent27 Iterate Run XplorMed again on your results… Select YES to add “neighbors” of the papers you chose Go to the next level
28
Copyright OpenHelix. No use or reproduction without express written consent28 XplorMed Agenda XplorMed: www.ogic.ca/projects/xplormed/ Introduction & Credits Yellow Gate: PubMed Query Yellow Gate: Relations Green Gate: Stored Abstracts Red Gate: Identifiers Summary Exercises
29
Copyright OpenHelix. No use or reproduction without express written consent29 Green Gate: Start with Saved Abstracts If you already have a saved set Medline, EndNote, XML, or XplorMed format Upload file to begin file name/ location Click here on XplorMed homepage
30
Copyright OpenHelix. No use or reproduction without express written consent30 Input Abstract File Options: Medline Send to: File as “MEDLINE” or “XML” format PubMed Advanced Search: “microtubule AND muscle” Example: microtubule AND muscle, Limit to 1 year www.pubmed.gov
31
Copyright OpenHelix. No use or reproduction without express written consent31 Sample Query with My Saved File 1. Locate pubmed-results.txt file with Browse button 2. Indicate format used 3. Click “Sort abstracts…” button to submit 1 3 2
32
Copyright OpenHelix. No use or reproduction without express written consent32 Green Gate: Results with My Sample Saved Set Outcome of green gate search with saved abstracts Click for related words, proceed as in Yellow Gate searches proceed
33
Copyright OpenHelix. No use or reproduction without express written consent33 XplorMed Agenda XplorMed: www.ogic.ca/projects/xplormed/ Introduction & Credits Yellow Gate: PubMed Query Yellow Gate: Relations Green Gate: Stored Abstracts Red Gate: Identifiers Summary Exercises
34
Copyright OpenHelix. No use or reproduction without express written consent34 Red Gate: Start with Identifiers You can use a variety of IDs to collect abstracts Examples shown: SwissProt, OMIM, more… Click here on XplorMed homepage
35
Copyright OpenHelix. No use or reproduction without express written consent35 Red Gate: Example Starting with an OMIM ID Sample query: OMIM 608516, Major Depressive Disorder, MDD 608516
36
Copyright OpenHelix. No use or reproduction without express written consent36 Red Gate: Results Collection of abstracts, categorized Proceed with subsequent steps as for yellow, green gates proceed
37
Copyright OpenHelix. No use or reproduction without express written consent37 XplorMed Agenda XplorMed: www.ogic.ca/projects/xplormed/ Introduction & Credits Yellow Gate: PubMed Query Yellow Gate: Relations Green Gate: Stored Abstracts Red Gate: Identifiers Summary Exercises
38
Copyright OpenHelix. No use or reproduction without express written consent38 Text Mining in Abstracts: Refine Your Searches Input: MEDLINE query www.ogic.ca/projects/xplormed/ Input: Identifiers Input: files of abstracts
39
Copyright OpenHelix. No use or reproduction without express written consent39 Keywords, Context & Relationships Pinpoint only relevant abstracts
40
Copyright OpenHelix. No use or reproduction without express written consent40 XplorMed Agenda XplorMed: www.ogic.ca/projects/xplormed/ Introduction & Credits Yellow Gate: PubMed Query Yellow Gate: Relations Green Gate: Stored Abstracts Red Gate: Identifiers Summary Exercises
41
Copyright OpenHelix. No use or reproduction without express written consent41
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.