Exploring Scholarly Data with Rexplore

Slides:



Advertisements
Similar presentations
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Advertisements

Data Science for Business: Semantic Verses Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
What are the characteristics of academic journals
Information Retrieval: Human-Computer Interfaces and Information Access Process.
Features and Uses of a Multilingual Full-Text Electronic Theses and Dissertations (ETDs) System Yin Zhang Kent State University Kyiho Lee, Bumjong You.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Innovation in Search? Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
1. Scopus Update November 2004 American University of Beirut Presented by:Amanda Hart Date: 11 November 2004.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Mastering the Internet, XHTML, and JavaScript Chapter 7 Searching the Internet.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Information Retrieval: Human-Computer Interfaces and Information Access Process.
IST NeOn-project.org The Semantic Web is growing… #SW Pages Lee, J., Goodwin, R. (2004) The Semantic.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Guillaume Rivalle APRIL 2014 MEASURE YOUR RESEARCH PERFORMANCE WITH INCITES.
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE KIEV, 31 JANUARY.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
1 A Discriminative Approach to Topic- Based Citation Recommendation Jie Tang and Jing Zhang Presented by Pei Li Knowledge Engineering Group, Dept. of Computer.
Next generation library catalogs and the integration of gazetteer information for geographical research Julie Sweetkind-Singer Assistant Director of Geospatial,
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
SemSearch: A Search Engine for the Semantic Web Yuangui Lei, Victoria Uren, Enrico Motta Knowledge Media Institute The Open University EKAW 2006 Presented.
Towards an ecosystem of data and ontologies Mathieu d’Aquin and Enrico Motta Knowledge Media Institute The Open University.
Strategies for Conducting Research on the Internet Angela Carritt User Coordinator, Oxford University Library Services Angela Carritt User Education Coordinator,
Announcements Literature search lab on Wednesday (focus on your project) Keep track of your searching to document on the search log…for each search instance:
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Media Arts and Technology Graduate Program UC Santa Barbara MAT 259 Visualizing Information Winter 2006George Legrady1 MAT 259 Visualizing Information.
Database collection evaluation An application of evaluative methods S519.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Francesco Osborne KMi, The Open University, United Kingdom April 2016 Two roads to semantic publishing.
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Data Mining for Expertise: Using Scopus to Create Lists of Experts for U.S. Department of Education Discretionary Grant Programs Good afternoon, my name.
Microsoft Academic Search Search | Explore | Discover
Ricardo EIto Brun Strasbourg, 5 Nov 2015
Summon® 2.0 Discovery Reinvented
Bibliometrics toolkit: Thomson Reuters products
CCNT Lab of Zhejiang University
Simile poems for kids by Lawraine Guichard
Personalized Social Image Recommendation
User Interface HEP Summit, DESY, May 2008
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Data Discovery Paradigms Interest Group Report on Activities and Outputs Anita de Waard, Siri Jodha Singh Khalsa Fotis Psomopoulis Mingfang Wu.
Optimize your research performance using SciVal
Overview & Applications Welcome!
Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
An Efficient method to recommend research papers and highly influential authors. VIRAJITHA KARNATAPU.
Data Warehousing and Data Mining
Introduction into Knowledge and information
An ecosystem of contributions
Geospatial and Problem Specific Semantics Danielle Forsyth, CEO and Co-Founder Thetus Corporation 20 June, 2006.
NSDL Data Repository (NDR)
Exploratory search: New name for an old hat?
IL Step 3: Using Bibliographic Databases
Introduction of KNS55 Platform
Searching and browsing through fragments of TED Talks
Internet Search Tools Bonnie R. MacGregor San Jose State University School of Library Information and Science LIBR 204: Reference Services Pathfinder Presentation.
Magnet & /facet Zheng Liang
Objectives, activities, and results of the database Lituanistika
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Conducting a STEM Literature Review
Web archives as a research subject
Web Mining Research: A Survey
Information Retrieval and Web Design
Citation databases and social networks for researchers: measuring research impact and disseminating results - exercise Elisavet Koutzamani
Presentation transcript:

Exploring Scholarly Data with Rexplore Francesco Osborne1,2, Enrico Motta1, Paul Mulholland1 1Knowledge Media Institute, The Open University, 2Dept. of Computer Science, University of Torino

Outline Introduction State of the art Overview of Rexplore Empirical Evaluation Conclusions

Introduction Understanding what goes on in a research area is no easy task A variety of entities, such as publications, publication venues, researchers, research groups, events Relationships which exist between them Different categories of users Rexplore, which integrates statistical analysis, semantic technologies, and visual analytics To investigate research trends effectively at different levels of granularity To relate authors ‘semantically’ To perform fine-grained academic expert search along multiple dimensions

State of the art Providing an interface to a specific repository of bibliographic data Integrating multiple data sources to provide access to a richer set of data Some widely used academic search engine: Google Scholar(GS), FacetedDBLP, Microsoft Academic Search(MAS), CiteSeer, Saffron

Gap Analysis No semantic characterization of research areas Systems tend to use keywords as proxies for research areas Lack of granular analysis E.g.: MAS can visualize publication trends in ”Wrold Wide Web” and “Databases”, but cannot provide this feature for “Semantic Web” Digital library bias 缺少精确的分析 数字图书馆偏爱

Overview of Rexplore Detect and make sense of the important trends in one or more research areas Identify researchers and analyze their academic trajectory and performance in one or multiple areas, according to a variety of fine-gained requirements Discover and explore a variety of dynamic relations between researchers, between topics, and between researchers and topics Support ranking of specific sets of authors, generated through multi-dimensional filters, according to various metrics

Rexplore Architecture Using a combination of statistical methods and background knowledge Statistical methods and background

Means for Achieving Ontology population with Klink characterizes research areas and their relationships skos:broaderGeneric: “Semantic Web Service”-”Semantic Web” and “Web Service” contributesTo: “Ontology Engineering”-”Semantic Web” relatedEquivalent: “Ontology Matching”-”Ontology Alignment” Geographic Enrichment Universities, Research Labs, Hospitals Maps the affiliation to GeoNames -e.g., “University of Turin” and “University of Torino”

Means for Achieving Topic Analysis General information about the topic Access to relevant authors and publications The topic navigator Visual analytics on broaderGeneric and contributesTo topics Visual analytics on authors’ migration pattens from other topics to and from the topic in question

Means for Achieving Author Analysis General bio information Authors’ scores according to different bibliometric measures Topic analysis Co-author analysis Pattern analysis Graph view

Means for Achieving Faceted Search and Data Browsing Filter: name or a part of it, career range, topics of interest, venues in which they published Rank: number of publications, number of citations, H-Index, G-Index, HT-Index, GT-Index, number of publications/citations in a topic or set of topics, number of publications/citations in a venue or set of venues

Means for Achieving The Graph View

Experimental Setup

Results 17 PhD students and researchers 50 of the 51 tasks using Rexplore, with a 98% success rate(complete the task within 15 min) 8/9 subjects were asked to work with GS/MAS Only 3 people completed a task with MAS

Results No domain specific expertise is needed to use Rexplore to make sense of a particular research area Experts in Bibliometric and Learning Analytics would do better SUS: 75/100, ≥72% of the 500 tested systems 94%: the system are well integrated 82%: would be happy to use Rexplore for their work

Feedback 94%: “very effective” 18%: “easy/natural/intuitive” The Most useful features: Faceted filter (59%) The visualization/charts (47%) The graph view (47%) The semantic characterization of topics (41%) The main weakness: Visual complexity (41%) Not always well-evidenced Navigation context (35%)

Feedback To Suggest new features: “minor interface change” (23%) A natural language interface for formulating complex searched The ability to retrieve and search full text of a publication from within Rexplore Did not need any additional features (23%)

Conclusions Rexplore arguably affords a major advantage over other tools in its ability to support: The visualization of trends at a very fine level of granularity Methods to identify ‘semantic’ relations between authors Fine-grained multi-dimensional academic expert search Future work: Improve the minor interface Add to the number of navigation filters Release a version of the tool with comprehensive data coverage for use by the scientific community

Thank you