Jonathan Griffin, Managing Director, IFIS Publishing &

Slides:



Advertisements
Similar presentations
Preservation, access and re-use of Research Data The STM view on publishing datasets Presented at the DataCite Summer Meeting 2010 Hannover, 8 June 2010.
Advertisements

Yi Wang CTO, Data Services Ivy Li Director, Equity Data Collection
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Catalyst Preview Enda McDonnell Alchemy User Conference London 2012 London Science Museum 31 May 2012.
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Making Search Relevant SchemaLogic Gary Carlson Chief Taxonomist
Darrell W. Gunter EVP / CMO Collexis Holdings, Inc. March 23, 2010 Spring Conference CONTENT: Uncovering the Value and Benefits of Semantic Technology.
ISI Web of Knowledge – Innovative Solutions ISI Web of Knowledge / Web of Science – coming developments BIOSIS Archive Web Citation Index – New product.
Digital Archiving Solutions for the Entertainment Industry August 2010.
IAEA International Atomic Energy Agency ICSTI 2013 Annual Members’ Meeting March 2013.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Using Social Care Online: an overview Version 1.0 April 2015.
Libraries and Institutional Content Management Systems
Implementing Metadata Marjorie M K Hlava, President Access Innovations, Inc. Albuquerque, NM
ASIDIC Spring Conference ‘Smart Content’ Uncovering the Value and Benefits of Semantic Technology Richard C. Fusco Director, Content Strategy – McGraw-Hill.
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE KIEV, 31 JANUARY.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Solutions. People. Innovation.1 Content Transformation in the Next Decade Solutions. People. Innovation.
GLOSSARY COMPILATION Alex Kotov (akotov2) Hanna Zhong (hzhong) Hoa Nguyen (hnguyen4) Zhenyu Yang (zyang2)
SCIENCE, RESEARCH DATA, AND PUBLISHING Stewart Wills Editorial Director, Web & New Media, Science 26 February 2013.
INTRODUCTION TO THE IMECHE VIRTUAL LIBRARY.
WISER Social Sciences: Politics & International Relations Gillian Beattie (Social Science Library) Jane Rawson (Vere Harmsworth Library)
Data Science for DTIC Data Ecosystem Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
Case Study SummaryChallenges Cisco WebEx, the world market leader in online web conference, has been working with Link Translation since 2009 to support.
1 By: Suman Negi, Technical Officer ‘B’ DESIDOC, DRDO, Delhi Presentation at NACLIN 14 (During 9-11 December 2014, Pondicherry) Design and Development.
LOGO A comparison of two web-based document management systems ShaoxinYu Columbia University March 31, 2009.
OWL Representing Information Using the Web Ontology Language.
Introduction to the Semantic Web and Linked Data
Text Analytics A Tool for Taxonomy Development Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture.
Entering the Data Era; Digital Curation of Data-intensive Science…… and the role Publishers can play The STM view on publishing datasets Bloomsbury Conference.
THOMSON REUTERS PROFESSIONAL SERVICES. THOMSON REUTERS PATENT CONTENT 98% of world’s filed patents.
Expediting Precision Medicine Initiatives for Clinical Genomics and Pharma through the Use of Knowledge Automation and Analytics Presenters: Dr. Scott.
Outsource Data Entry services to PGBS
What we mean by Big Data and Advanced Analytics
Information Retrieval in Practice
RightFind™ XML for Mining- One Cross-Publisher Initiative to Empower Text Mining Roy S Kaufman, Managing Director, New Ventures, CCC.
Using Social Care Online: an overview
Makes Insurance Smarter.
Tiewei (Lucy) Liu Metadata Librarian June 26, 2016
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
DocFusion 365 Intelligent Template Designer and Document Generation Engine on Azure Enables Your Team to Increase Productivity MICROSOFT AZURE APP BUILDER.
Search Engine Architecture
Gain Global Exposure: Partner with EBSCO to Promote your Scholarship
OceanDocs Digital Repository of Marine Science Research Outputs
Partner Logo Veropath Offers a Next-Gen Expense Management SaaS Technology Solution, Built Specifically to Harness Big Data Analytics Capabilities in Azure.
Major ILS disciplines What does iSchools like SILS study?
Insights driven Customer Experience
Elsevier Activity Range
Working with your archive organization Broadening your user community
VI-SEEM Data Repository
THE DEVELOPMENT SERVICE
Thanks to all of you for attending
A platform for Linked Data publishing
Implementing a content enrichment strategy
IL Step 3: Using Bibliographic Databases
Copyright © JanBask Training. All rights reserved Top 10 Charming IT jobs that would be High in Demand in 2019.
Entering the Data Era; Digital Curation of Data-intensive Science…… and the role Publishers can play The STM view on publishing datasets Bloomsbury Conference.
Conference of Actuaries
AGMLAB Information Technologies
AIP Publishing is… Supports the charitable, scientific & educational programs of the AIP Embodies AIP’s strong commitment to researchers worldwide Places.
Mobility Based Last Mile Banking Solution For
Anatomy of a modern data-driven content product
Web archives as a research subject
Deep SEARCH 9 A new tool in the box for automatic content classification: DS9 Machine Learning uses Hybrid Semantic AI ConTech November.
Indegene’s AI/NLP Powered Pharmacovigilance/Safety Solution
What is UiPATH? For more details visit this link online-training.
Challenges of e-Publishing in Arab World
Presentation transcript:

Re-invigorating a middle-aged publisher with machine learning, AI and open data Jonathan Griffin, Managing Director, IFIS Publishing & Jignesh Bhate, CEO, Molecular Connections Obviously I am a middle aged publisher BUT so are IFIS

Who are IFIS? educational charity founded 50 years ago Three trade associations concerned about difficulties of locating relevant research

What does IFIS do? Publish an abstracting & indexing database for food science community Team of food scientists curate content Used by universities worldwide

How are things going? The difficulties of locating relevant research are greater than they were Huge increases in scientific output High levels of innovation

There is now an extensive range of search tools How are things going? There is now an extensive range of search tools Time to retire?

is the most widely used tool in food science How are things going? is the most widely used tool in food science Google Scholar is seriously flawed Text string searches (not AI) Large numbers of irrelevant results Relevant results missed Articles from predatory journals not filtered out Faculty members are concerned … So, it’s not time to retire!

How are we reinvigorating our activities? Training & education to promote best practice Diversifying portfolio to balance risk Deploy latest technologies to further improve quality & relevance of information search  PROBLEM: We are an editorial driven organization & lack technical expertise SOLUTION: Partnership

+ = India's leading informatics company Charity based in a barn in SE England = Productive partnership

Increase in number of records FSTA (abstracting & indexing database) Increase in number of records Accuracy of search results (enhanced thesaurus) Cost savings Escalex New online service in an adjacent market Solves problems arising from using web to locate legislation Nominated for industry award last year New product pipeline Analytics Subsets of data

Copyright © 2018 Molecular Connections Pvt. Ltd. SCOPE & CHALLENGES SCOPE Processing regulatory & compliance information pertaining to Food Sciences/Industry of different regions and languages – Real time CHALLENGES Quick search and indexing on huge datasets (Text) Handle unstructured text across different types of datasets (Documents, WEB APIs) Managing updates (Up to date information) Copyright © 2018 Molecular Connections Pvt. Ltd.

Copyright © 2018 Molecular Connections Pvt. Ltd. Workflow ML AI LEGACY CURATED CONTENT BIG DATA platforms Linked Data WEB CONTENT PLUS Domain Expertise New Product Copyright © 2018 Molecular Connections Pvt. Ltd.

Legacy Curated Content Different file format Million + Abstracts WEB CONTENT 10 Million + Metadata 50 + years of Legacy content Copyright © 2018 Molecular Connections Pvt. Ltd.

Copyright © 2018 Molecular Connections Pvt. Ltd. MC’s proprietary platforms Description: End to end flexible content parser ML/AI ML based content segmentation And structure recognition modules Word, Excel, LaTeX, PDFs, XMLs, Social media texts Standards: TEI/XML/APIs Description: A high throughput semantic fingerprinting system ML/AI Topic Modelling ML based Entity Extraction Feature driven Classification Ontology based tagging/indexing Standards: APIs Heuristics/Domain Expertise: Yes Description: A Named entity recognition platform ML/AI Conditional Random fields(ML) based models with plug and lay ensemble capabilities Feedback ingestion and logs (Active training in AI terms) Standards: APIs Heuristics/Domain Expertise: Yes Description: A complete ontology management solution ML/AI ML modules that identify missing ‘Concepts’ and in parallel suggest candidate concepts Parse and mine large amounts of resources for candidate or lead generation in real time Standards: SKOS/RDF-XML/OWL/APIs Heuristics/Domain Expertise: Yes Description: A visual summary and analytics studio ML/AI Plug and play NERs and ontologies Standards: Embed, exchange formats Copyright © 2018 Molecular Connections Pvt. Ltd.

Copyright © 2018 Molecular Connections Pvt. Ltd. Workflow ML AI LEGACY CURATED CONTENT BIG DATA platforms Linked Data WEB CONTENT PLUS Domain Expertise New Product Copyright © 2018 Molecular Connections Pvt. Ltd.

Copyright © 2018 Molecular Connections Pvt. Ltd. New Product development engine Benefits Multiple New Products Content Slicing Granular Analytics Superior visualization Better Discoverability Linked Data New Product Copyright © 2018 Molecular Connections Pvt. Ltd.

Copyright © 2018 Molecular Connections Pvt. Ltd. Enhancing existing datasets Automated data processing significantly increased capacity AI-enhanced tools used to move from print-centric to enhanced digital thesaurus enabling more accurate search results Linked Data Copyright © 2018 Molecular Connections Pvt. Ltd.

Copyright © 2018 Molecular Connections Pvt. Ltd. Pipeline AI, ML & linked data enable us to take existing datasets to develop a new product pipeline Content collections Analytics Linked Data Copyright © 2018 Molecular Connections Pvt. Ltd.

Copyright © 2018 Molecular Connections Pvt. Ltd. Questions? Copyright © 2018 Molecular Connections Pvt. Ltd.

Copyright © 2018 Molecular Connections Pvt. Ltd. Thank You!! Copyright © 2018 Molecular Connections Pvt. Ltd.