WP5 – Platform - Objective To design, build, establish, run, and sustain the Platform, and take primary responsibility for dissemination of the project.

Slides:



Advertisements
Similar presentations
Search Techniques. It is imperative students use proper techniques when searching information on a computer system. It is imperative students use proper.
Advertisements

Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
Search Techniques Boolean Logic and Keyword Searching.
1 2/14/05CS120 The Information Era Searching the Web Don’t we already know how to do this?
Quiz & Library Day Jared Peet. Warm Up We will begin our quiz as soon as class starts. Please remove EVERYTHING from your desk EXCEPT a pen. You will.
PubMed and its search options Jan Emmerich, Sonja Jacobi, Kerstin Müller (5th Semester Library Management)
Advanced Searching Engineering Village.
M.L.A. CITATION WORKSHOP FINDING PRINT RESOURCES ON YOUR TOPIC -books -anthologies -magazines articles - newspaper articles.
Engineering Village ™ ® Basic Searching On Compendex ®
Learn how to search for information the smart way Choose your own adventure!
Chapter 5 Searching for Truth: Locating Information on the WWW.
Web Searching. Web Search Engine A web search engine is designed to search for information on the World Wide Web and FTP servers The search results are.
Planned Giving Design Center. What is the Planned Giving Design Center? National network of websites dedicated to advancing philanthropy.
Chapter 5 Searching for Truth: Locating Information on the WWW.
Search Engines By: Big Cat Jaime DeBartolo, Rachel Adams, Michelle Knapp.
To begin—get their attention: New Spice, Study Like a Scholar
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
The internet is the richest source of genealogical information today. The amount, scope, and availability of data is staggering, even incomprehensible.
Basic Web Applications 2. Search Engine Why we need search ensigns? Why we need search ensigns? –because there are hundreds of millions of pages available.
Natural Resource Program Center Dissolving Data Boundaries Search Mar /17/2011 Dan Kocol Functional Analyst I&M.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
The Internet 8th Edition Tutorial 4 Searching the Web.
Advanced Searching with Created by Britny Lanunyon, UNT Practicum Student, September 9, 2009.
Microsoft ® Office Excel 2003 Training Using XML in Excel SynAppSys Educational Services presents:
Surfing With Google Google Search Features. In addition to providing easy access to billions of web pages, Google has many special features to help you.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Created by Branden Maglio and Flynn Castellanos Team BFMMA.
A GENDA WHY: learn how to research better by evaluating your questions for maximum results Watch KEYWORD PowerPoint presentation Practice with in-class.
1 UNIT 13 The World Wide Web. Introduction 2 Agenda The World Wide Web Search Engines Video Streaming 3.
1 UNIT 13 The World Wide Web. Introduction 2 The World Wide Web: ▫ Commonly referred to as WWW or the Web. ▫ Is a service on the Internet. It consists.
COMP 143 Web Development with Adobe Dreamweaver CC.
 Every word matters. Generally, all the words you put in the query will be used.  Search is always case insensitive. A search for [ new york times ]
TEN TIPS ON HOW TO SEARCH EBSCO DATABASES
The Palantir Platform… …Changes in 2.3
Jacynthe Touchette, MSI JGH Health Sciences Library
Information Retrieval in Practice
CONTENT MANAGEMENT SYSTEM CSIR-NISCAIR, New Delhi
Objective % Select and utilize tools to design and develop websites.
Education 499-R01 Search Basics.
Weebly Elements, Continued
Forms and Reports 09.
Internet Searching: Finding Quality Information
Introduction to In-Text Citations
Lesson 6: Databases and Web Search Engines
Software Documentation
Objective % Select and utilize tools to design and develop websites.
Tagging documents made easy, using machine learning
Searching for and Accessing Information
ITE 130 Web Searching.
Search Techniques and Advanced tools for Researchers
Eric Sieverts University Library Utrecht Institute for Media &
CAB Abstracts, Medline & Zoological Record
ZANZIBAR UNIVERSITY LIBRARY SERVICES Introduction
Hi and welcome to the Order Centre – Ordering training.
Alternative Internet Marketing Techniques
Effective Research and Integration Techniques
Research & M.L.A. Citation Workshop 2016
ZANZIBAR UNIVERSITY LIBRARY SERVICES Introduction
Searching for Truth: Locating Information on the WWW
ZANZIBAR UNIVERSITY LIBRARY SERVICES Introduction
Lesson 6: Databases and Web Search Engines
Chapter 11 user support.
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Searching for Truth: Locating Information on the WWW
Unit 3 Introduction.
Download from Zotero Home Page
Research & M.L.A. Citation Workshop 2016
Searching for Truth: Locating Information on the WWW
PubMed/Limits and Advanced Search (module 4.2)
Search for Article Citation
Presentation transcript:

WP5 – Platform - Objective To design, build, establish, run, and sustain the Platform, and take primary responsibility for dissemination of the project outputs. This package consists of a number of elements creating what is envisioned as a ‘virtual knowledge network’, which for practical reasons will build around the project website as a knowledge node.

What questions might a user pose to childhealthresearch.eu What is available that is applicaple to Europe on 1)Measles immunisation of transient populations 2)Parental discord and teenage self harm' 3)Nuitritional imbalance in low income families 4)Internet addiction - Problem or Falacy 5)Safe playspace and childrens wellbeing in urban settings 6)Researchers with experience interviewing children about eating habits 7)Comparative study into immunisation uptake in European countries

Website subjects ● Web site ● Collaborative tools ● Information ● Upload a document ● Auto analysis ● Classifiers – taxonomy ● Funding ● Capacity ● Document repository ● Metadata store ● Metadata harvesting ● Find a document ● Classifiers ● Free text? ● What is returned

Two search functions Repositories - Site ● Site – full text search – general site, not the subject of this discussion ● Repository ● Simple ● Search for Title, author, publisher, taxonomy term ● Text search for standard word combinations (), +, -, | ● Advanced Search – Search Builder like Pubmed ● Select from each Taxonomy axis ● Combine items with AND, OR, AND NOT

● The Basic search help article covers all the most common issues, but sometimes you need a little bit more power. This document will highlight the more advanced features of Google Web Search. Have in mind though that even very advanced searchers, such as the members of the search group at Google, use these features less than 5% of the time. Basic simple search is often enough. As always, we use square brackets [ ] to denote queries, so [ to be or not to be ] is an example of a query; [ to be ] or [ not to be ] are two examples of queries. ● * Phrase search ("") ● By putting double quotes around a set of words, you are telling Google to consider the exact words in that exact order without any change. Google already uses the order and the fact that the words are together as a very strong signal and will stray from it only for a good reason, so quotes are usually unnecessary. By insisting on phrase search you might be missing good results accidentally. For example, a search for [ "Alexander Bell" ] (with quotes) will miss the pages that refer to Alexander G. Bell. ● * Search within a specific website (site:) ● Google allows you to specify that your search results must come from a given website. For example, the query [ iraq site:nytimes.com ] will return pages about Iraq but only from nytimes.com. The simpler queries [ iraq nytimes.com ] or [ iraq New York Times ] will usually be just as good, though they might return results from other sites that mention the New York Times. You can also specify a whole class of sites, for example [ iraq site:.gov ] will return results only from a.gov domain and [ iraq site:.iq ] will return results only from Iraqi sites. ● * Terms you want to exclude (-) ● Attaching a minus sign immediately before a word indicates that you do not want pages that contain this word to appear in your results. The minus sign should appear immediately before the word and should be preceded with a space. For example, in the query [ anti-virus software ], the minus sign is used as a hyphen and will not be interpreted as an exclusion symbol; whereas the query [ anti-virus -software ] will search for the words 'anti-virus' but exclude references to software. You can exclude as many words as you want by using the - sign in front of all of them, for example [ jaguar -cars -football -os ]. The - sign can be used to exclude more than just words. For example, place a hyphen before the 'site:' operator (without a space) to exclude a specific site from your search results. ● * Fill in the blanks (*) ● The *, or wildcard, is a little-known feature that can be very powerful. If you include * within a query, it tells Google to try to treat the star as a placeholder for any unknown term(s) and then find the best matches. For example, the search [ Google * ] will give you results about many of Google's products (go to next page and next page -- we have many products). The query [ Obama voted * on the * bill ] will give you stories about different votes on different bills. Note that the * operator works only on whole words, not parts of words. ● * Search exactly as is (+) ● Google employs synonyms automatically, so that it finds pages that mention, for example, childcare for the query [ child care ] (with a space), or California history for the query [ ca history ]. But sometimes Google helps out a little too much and gives you a synonym when you don't really want it. By attaching a + immediately before a word (remember, don't add a space after the +), you are telling Google to match that word precisely as you typed it. Putting double quotes around a single word will do the same thing. ● * The OR operator ● Google's default behavior is to consider all the words in a search. If you want to specifically allow either one of several words, you can use the OR operator (note that you have to type 'OR' in ALL CAPS). For example, [ San Francisco Giants 2004 OR 2005 ] will give you results about either one of these years, whereas [ San Francisco Giants ] (without the OR) will show pages that include both years on the same page. The symbol | can be substituted for OR. (The AND operator, by the way, is the default, so it is not needed.) ● Exceptions ● Search is rarely absolute. Search engines use a variety of techniques to imitate how people think and to approximate their behavior. As a result, most rules have exceptions. For example, the query [ for better or for worse ] will not be interpreted by Google as an OR query, but as a phrase that matches a (very popular) comic strip. Google will show calculator results for the query [ 34 * 87 ] rather than use the 'Fill in the blanks' operator. Both cases follow the obvious intent of the query. Here is a list of exceptions to some of the rules and guidelines that were mentioned in this and the Basic Search Help article:

Simple Search

Search results Filtered out keyword (NOT immigrants)

Search results Grouped by language

Search results Grouped by year of publication

'Build a Search' Example from another application Drag axis onto workspace – select items – add boolean operator – drag next axis etc

Add a publication ● Upload file ● Auto analyse ● Provides information to assist the classifier ● Can drag from 'auto analysis' to Classification form ● Manual 'tag' based on Taxonomy selectors ● At least one selection per major axis

Three sources of 'papers' or research 1 Adding a paper from your local PC 2 Linking to a paper stored on another system – may sometimes require User Id and Password to access 3 Referencing the metadata of a paper – from our own metadata store – allows us to extend the classification from Riche

Auto Analysis Classification workspace Document view after upload

Auto Analysis Classification workspace Document view – accept some suggested metadata

Publication metadata view

Language ?? ● European multilingual thesaurus on health promotion in 12 languages.

Auto analysis and suggested classifiers ➢ Term extraction can be performed to provide quick insight on what a document is about. ➢ On a large site with a lot of content and tags (or subjects in the plone lingo) it might be difficult to assign tags to new content. In this case, a trained classifier could provide useful suggestions to an editor responsible for tagging content. ➢ Clustering can help you organize unclassified content into groups.

POS taggers, utilities for classifying words in a document as Parts Of Speech. Two are provided at the moment, a Penn TreeBank tagger and a trigram tagger. Both can be trained with some other language than english which is what we do here.Parts Of Speech Term extractors, utilities responsible for extracting the important terms from some document. The extractor we use here, assumes that in a document only nouns matter and uses a POS tagger to find those mostly used in a document. For details please look at the code and the tests. Content classifiers, utilities that can tag content in predefined categories. Here, a naive Bayes classifier is used. Basically, the classifier looks at already tagged content, performs term extraction and trains itself using the terms and tags as an input. Then, for new content, the classifier will provide suggestions for tags according to the extracted terms of the content.naive Bayes Clusterers, utilities that without prior knowledge of content classification can group content into groups according to feature similarity. At the moment NLTK's k-means clusterer is used.k-means How it works?

Document repository ● Sometimes we must store the document

What is the MOAI Server? MOAI is an open access server platform for institutional repositories. The server aggregates content from disparate sources, transforms it, stores it in a database, and (re)publishes the content, in one or many OAI feeds. Each feed has its own configuration. The server has a flexible system for combining records into sets and uses these sets in the feed configuration. MOAI also comes with a simple yet flexible authentication scheme that can easily be customized. Besides providing authentication for the feeds, the authentication also controls access to the assets. MOAI is a standalone system that can be used in combination with any repository software that comes with an OAI feed such as Fedora Commons, EPrints or DSpace. It can also be used directly with an SQL database or just a folder of XML files. Interaction with other systems and websites Feeds from MOAI can be picked up by any system or search engine that understands OAI metadata. If the system is a content management system and has harvesting capabilities, the feed data can be stored, presented, and searched within a website. Silva, a powerful CMS for organizations that manage complex sites, has OAI Pack extensions that provide these capabilities.