How to publish in a format that enhances literature-based discovery?

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Chapter 5: Introduction to Information Retrieval
The Web of data with meaning... By Michael Griffiths.
Applications Chapter 9, Cimiano Ontology Learning Textbook Presented by Aaron Stewart.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Properties of Text CS336 Lecture 3:. 2 Information Retrieval Searching unstructured documents Typically text –Newspaper articles –Web pages Other documents.
1 Information Retrieval and Web Search Introduction.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
ELN – Natural Language Processing Giuseppe Attardi
CAREERS IN LINGUISTICS OUTSIDE OF ACADEMIA CAREERS IN INDUSTRY.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Practical RDF Chapter 1. RDF: An Introduction
Name : Emad Zargoun Id number : EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF Computing and technology “ITEC547- text mining“ Prof.Dr. Nazife Dimiriler.
GLOSSARY COMPILATION Alex Kotov (akotov2) Hanna Zhong (hzhong) Hoa Nguyen (hnguyen4) Zhenyu Yang (zyang2)
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Key Foundational Layers Classification, Vocab, Web Stds –Classification Content Model (Taxonomy) –Web Std Tech Requirements W3C, cloud, online, offline.
© Copyright 2008 STI INNSBRUCK NLP Interchange Format José M. García.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Semantic Technologies & GATE NSWI Jan Dědek.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Department of computer science and engineering Two Layer Mapping from Database to RDF Martin Švihla Research Group Webing Department.
© Copyright 2013 STI INNSBRUCK “How to put an annotation in HTML?” Ioannis Stavrakantonakis.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Iana Atanassova Research: – Information retrieval in scientific publications exploiting semantic annotations and linguistic knowledge bases – Ranking algorithms.
Document Databases for Information Management Gregor Erbach FTW, Wien DFKI, Saarbrucken ETL, Tsukuba
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Toward Semantic Search: RDFa based facet browser Jin Guang Zheng Tetherless World Constellation.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
An Ontology-based Automatic Semantic Annotation Approach for Patent Document Retrieval in Product Innovation Design Feng Wang, Lanfen Lin, Zhou Yang College.
26/02/ WSMO – UDDI Semantics Review Taxonomies and Value Sets Discussion Paper Max Voskob – February 2004 UDDI Spec TC V4 Requirements.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Personalized Ontology for Web Search Personalization S. Sendhilkumar, T.V. Geetha Anna University, Chennai India 1st ACM Bangalore annual Compute conference,
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
KYOTO (ICT ) Knowledge Yielding Ontologies for Transition-Based Organization Intelligent Content and Semantics The First KYOTO Workshop February.
Information Retrieval in Practice
? Searching the WWW today document retrieval keyword based search user
Information Organization: Overview
Visual Information Retrieval
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Information Retrieval and Web Search
Natural Language Processing (NLP)
Information Retrieval and Web Search
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Information Retrieval and Web Search
Text Analytics Giuseppe Attardi Università di Pisa
Social Knowledge Mining
RichAnnotator: Annotating rich (XML-like) documents
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
CSE 635 Multimedia Information Retrieval
CS246: Information Retrieval
Natural Language Processing (NLP)
Web Mining Research: A Survey
Jonathan Griffin, Managing Director, IFIS Publishing &
Information Organization: Overview
Information Retrieval and Web Search
Information Retrieval
Detailed metadata is as easy as 1, 2, 3
Natural Language Processing (NLP)
Presentation transcript:

How to publish in a format that enhances literature-based discovery? All Course Materials : sci.ai/fsci2017 Join Community : sci.ai/community @sci_ai

Key question of our work : How to extract molecules of knowledge from research results so that new discoveries can be built from it? ? ?

Knowledge Graph of the Literature-Based Discovery

Defining molecules of knowledge in the text means: 1. Conceptualizing terms “cyclooxygenase-2” is a http://identifiers.org/uniprot/P35354 2. Identifying relationships between terms

Information Representation and Operations with Information

Linguistic Levels Linguistics Levels Corresponding Algorithms of Reading and Understanding in Computational Linguistics Pragmatics Sentiment analysis Topic segmentation and recognition Anaphora resolution Question answering Semantics Named entity recognition (NER) Word sense disambiguation Relationship extraction Syntax Parse (syntax) tree Morphology Stemming Lemmatization Part-of-Speech (POS) Tagging * Line between Pragmatics and Semantics is very blurred https://www.boundless.com/psychology/textbooks/boundless-psychology-textbook/language-10/introduction-to-language-60/the-structure-of-language-234-12769/

Basics Information Retrieval Operates with Syntactical Representation Document and query are represented in vector space model: If there is term, then corresponding value in the vector is non-zero Relevant document ~ higher syntactic (string form) similarity to the query 1. https://en.wikipedia.org/wiki/Vector_space_model 2. https://nlp.stanford.edu/IR-book/html/htmledition/a-first-take-at-building-an-inverted-index-1.html

Semantic Biomedicine Semantic Web Features: 1. Web of Data 2. The same language for data exchange. 3. Relationships and Ontologies. Royer L., Linse B., Wächter T., Furch T., Bry F., Schroeder M. (2007) Querying Semantic Web Contents. In: Baker C.J.O., Cheung KH. (eds) Semantic Web. Springer, Boston, MA https://link.springer.com/content/pdf/10.1007%2F978-0-387-48438-9.pdf

Literature-based discovery Knowledge extraction Reasoning Hypothesis Generation

How to publish in a format that enhances literature-based discovery? That is how search engines see your paper.

Standard vs XML-centered Publishing Process [Bazargan K. A complete end-to-end publishing system based on JATS.https://www.ncbi.nlm.nih.gov/books/NBK279828/]

Semanticized Paper Publishing Workflow

Semanticization: conceptualizing terms and defining relationships

Practice. Single paper semanticizing Step 1. Write research in Google Docs (support of other text editors is upcoming) Step 2. Send for automatic semanticization Step 3. Validate results in app.sci.ai

Practice. Validation of the semanticization results. Validate biomedical concepts in the text. 1. It is / it is not a bio object 2. Confirm / Reject proposed term-to-concept in ontology relationship 3. Add custom ontology concept

Practice. Validate Facts Extracted From The Text Validate and label new facts: 1. Confirm / Reject Facts. 2. Create New Facts.

Practice. Single paper knowledge graph

Practice. Export semanticized paper in JATS and HTML

Recipients of the Publishing Formats Humans Machines HTML PDF Printed JATS RDF RDF/XML CSV / XML / JSON Data API That is how search engines see your paper.

Extended JATS with biomedical metadata http://demo.sci.ai/jats/full-example.xml

Extended JATS validation http://demo.sci.ai/jats/full-example.xml JATS specific validator. XML validator against specific tag set https://www.ncbi.nlm.nih.gov/pmc/tools/xmlchecker/ Generic RDF/XML validator https://www.w3.org/RDF/Validator/

HTML + RDFa with Biomedical Microdata

HTML Page Layout Downloadable version of the paper in JATS, RDF/XML, RDF etc. HTML with hints for reading https://doi.org/10.3389/fncel.2017.00074

How Google reads HTML+RDFa https://search.google.com/structured-data/testing-tool/u/0/#url=http%3A%2F%2Fdemo.sci.ai%2Fsgc%2Fhdac6.html

Practice. Publishing preprint. Annotating with hypothes.is

How would you use semantic data layer in your research communication? e-mail: roman.gurinovich@sci.ai All Course Materials : sci.ai/fsci2017 Join Community : sci.ai/community

Bonus. Semantic Biomedicine Fundamentals

Bonus. Semantic Biomedicine Applications Applications and Technologies

Nanopublications Mons, Barend and Velterop, Jan. "Nano-Publication in the e-science era." Paper presented at the meeting of the International Semantic Web Conference, 2009. https://www.w3.org/wiki/images/4/4a/HCLS$$ISWC2009$$Workshop$Mons.pdf http://nanopub.org

Bonus Practice. Authors Authentication and Labeling with ORCID ID