RightFind™ XML for Mining- One Cross-Publisher Initiative to Empower Text Mining Roy S Kaufman, Managing Director, New Ventures, CCC.

Slides:



Advertisements
Similar presentations
RightSphere ® Basic A quick tutorial. Global Rights Broker 1/3/20142 Not-for-profit founded in 1978 Solutions for the seamless sharing of knowledge Manage.
Advertisements

1 of 16 Information Access The External Information Providers © FAO 2005 IMARK Investing in Information for Development Information Access The External.
Usage Statistics in Context: related standards and tools Oliver Pesch Chief Strategist, E-Resources EBSCO Information Services Usage Statistics and Publishers:
Ezra T. Ernst Chief Executive Officer Swets Information Services, Inc. The Long Tail and its’ application To Scholarly Information.
CASE STUDY Intelligent Subrogation Community health plan saves more than $2 million in less than a year with cloud-based coordination of benefits and subrogation.
11© 2011 Hitachi Data Systems. All rights reserved. HITACHI DATA DISCOVERY FOR MICROSOFT® SHAREPOINT ® SOLUTION SCALING YOUR SHAREPOINT ENVIRONMENT PRESENTER.
IAEA International Atomic Energy Agency ICSTI 2013 Annual Members’ Meeting March 2013.
Lund Online 07/10/2009 Ingolf Kaspar, Regional Sales Manager EBSCO Publishing.
Making sense of the data jumble Trinity College Library Dublin’s Discovery Solution Experience Arlene Healy & Charles Montague Digital Systems and Services.
Overview and capabilities MAY We are online marketing experts We are connecting the dots and delivering results We create powerful online marketing.
Product Offering Overview CONFIDENTIAL AND PROPRIETARY Copyright ©2004 Universal Business Matrix, LLC All Rights Reserved The duplication in printed or.
Copyright Compliance Sharing in a Digital World London Info International, Nov 2014 Kate Alzapiedi Business Development Director RightsDirect Stephen.
Licensing News Content in a Digital World Newspaper and Periodicals Working Group IFRRO World Congress, October 25, 2011 Presented by: Edward Colleran.
The Eyeblaster ACM Advertising Campaign Management.
Managing the information explosion Binesh Lad. 20% 80% Structured Content Everything else.
All Search Platforms are Created Equal … Myth or Reality Presented by Matt Dunie President, CSA
RefWorks Your Personal Online Database And Bibliography Creator.
Tutorial EBSCO Discovery Service for Corporate Users support.ebsco.com.
Case Study SummaryChallenges Cisco WebEx, the world market leader in online web conference, has been working with Link Translation since 2009 to support.
Accumulus Delivers Enterprise Class Subscription Billing and Automation Solutions for Gaming, Retail, and More on the Scalable Microsoft Azure Platform.
IT Enablement Approaches Large Business may have hundreds of processes to be enabled by IT. Several Types of Application may be deployed –Departmental.
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
Taking the Library Back from Google Abe Lederman, President and CTO October 18-20, 2007.
Accurate  Consistent  Compliant Contact: i4i the structured content company the structured content company.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
GameChanger’s Rate Quote Issue Solution is Deployed to Microsoft Azure for a Fast, Flexible Direct to Consumer Insurance Sales Solution MICROSOFT AZURE.
OFFICE 365 APP BUILDER PROFILE: Druva
NLA media access – update
Moshe Shechter | Alma Product Manager
SAP Trade Repository Reporting by Virtusa
Elsevier Operative Techniques - Netter Process Flow
WHY VIDEO SURVELLIANCE
WHY VIDEO SURVELLIANCE
Device Maintenance and Management, Parental Control, and Theft Protection for Home Users Made Easy with Remo MORE and Power of Azure MICROSOFT AZURE APP.
Data Platform and Analytics Foundational Training
The effort-saving, cost-cutting, low-overhead, cloud capture platform.
Meemim's Microsoft Azure-Hosted Knowledge Management Platform Simplifies the Sharing of Information with Colleagues, Clients or the Public MICROSOFT AZURE.
PLOS Facilitating Text & Data Mining The Role the Publisher Can Play
Simple and intuitive fare conditions
Gain Global Exposure: Partner with EBSCO to Promote your Scholarship
Free Cloud Management Portal for Microsoft Azure Empowers Enterprise Users to Govern Their Cloud Spending and Optimize Cloud Usage and Planning MICROSOFT.
Firefish Software for Professional Recruiters Stays Available Around the Clock from Any Device and Anywhere by Using the Microsoft Azure Platform Partner.
Wonderware Online Cost-Effective SaaS Solution Powered by the Microsoft Azure Cloud Platform Delivers Industrial Insights to Users and OEMs MICROSOFT AZURE.
Attention! In order to print this two-page flyer, please follow these steps: 1) Personalize the text and logo area with your custom copy and logo. 2) Delete.
Nicole Steen-Dutton, ClickDimensions
Speaker’s Name, SAP Month 00, 2017
Mastering automation to optimise quality
Presentation Title.
Order Management For Shippers.
The Sitecore® Experience Platform™ on Microsoft Azure
Rapid fire performance testing of 250 websites
Built on the Powerful Microsoft Azure Platform, iSwarm Helps Businesses Analyze Social Media Conversations, then Connect with Individuals MICROSOFT AZURE.
Be Better: Achieve Customer Service Excellence and Create a Lean RMA and Returns Process with Renewity RMA and the Power of Microsoft Azure MICROSOFT AZURE.
EPIC INFOTECH CONSULTING GROUP
Revolutionized, Automated Cash and Gratuity Management for the Hospitality Industry, Thanks to Microsoft Azure MICROSOFT AZURE APP BUILDER PROFILE: Evention.
CASE STUDY Intelligent Subrogation
Introducing Qwory, a Business-to-Business Search Engine That’s Powered by Microsoft Azure and Detects Vital Contact Information for Businesses MICROSOFT.
Dell Data Protection | Rapid Recovery: Simple, Quick, Configurable, and Affordable Cloud-Based Backup, Retention, and Archiving Powered by Microsoft Azure.
Adra ACCOUNTS: Transaction Matching Software Powered by the Microsoft Azure Cloud That Helps Optimize the Accounting and Finance Processes MICROSOFT AZURE.
Collaborative Business Solutions
Keep Your Digital Media Assets Safe and Save Time by Choosing ImageVault to be Your Digital Asset Management Solution, Hosted in Microsoft Azure Partner.
ADAM on Microsoft Azure Streamlines Access and Control of Full Function Digital Asset and Product Content Management for All Workers MICROSOFT AZURE ISV.
DOCUMENTAL SOLUTIONS Market Analysis Intelligence & Tools
Single Cell’s Progenitor Powered by Microsoft Azure Improves Organisational Efficiency with Strategic Procurement, Contract Management, and Analytics MICROSOFT.
WHY VIDEO SURVELLIANCE
WHY VIDEO SURVELLIANCE
Jonathan Griffin, Managing Director, IFIS Publishing &
Contract Management Software from ContraxAware Simplify Your Contract Management Process.
OU BATTLECARD: Oracle Database 12c R2
Presentation transcript:

RightFind™ XML for Mining- One Cross-Publisher Initiative to Empower Text Mining Roy S Kaufman, Managing Director, New Ventures, CCC

Copyright, simplified. Remove this Global content and licensing solutions that make copyright work for everyone Corporate researchers sharing journal articles to support drug discovery Publishers seeking permission to use third-party content in new works Course creators preparing materials for student readings 950+ million rights, 12,000+ rightsholders, 35,000 customers in 140 countries Based in Danvers MA USA, with international subsidiary, RightsDirect, based in Amsterdam with presence in Tokyo One of EContent’s “100 Companies that Matter Most” in digital content for last 7 years Named one of Outsell’s “10 to Watch” in the search, aggregation and syndication segment Copyright Clearance Center, or CCC, is a global rights broker that manages more than 600 million individual rights. CCC was started more than 30 years ago as a not-for-profit organization. CCC has relationships with similar organizations, or RROs, around the world, through which we obtain valuable non-US titles for inclusion in our licenses. CCC is dedicated to the progress of collective licensing efforts around the world and is an active member of the International Federation of Reproduction Rights Organisations (IFRRO). CCC serves as a thought leader on copyright-related issues, providing licensing solutions that serve both copyright holders and the people who use their content. For 7 straight years, CCC has been named to Econtent ‘s list “100 Companies That Matter Most” in the digital content industry. CCC also joined Google, Yahoo, Microsoft and 6 other organizations named by research specialist Outsell as one of the “10 to Watch” in search, aggregation and syndication.

Making Copyright Work Rightsholders Content Users Licensing Solutions Rights Management Content Delivery Copyright Education 950+ million rights from: Publishers Authors Agents Creators 35,000 companies Workers worldwide 1,200 colleges and universities Publishers and Authors

CCC in the World of Text Mining this goes to Eefke Our product is like High Octane gasoline for Text Miners. Companies already have a text mining tool but it runs poorly with out gas. You don’t need to become proficient at selling text mining, but you’ll need to know all about gasoline…

Text Mining Today – Example Workflow Manual work Text mining tools Search Get permission Download PDFs Convert PDFs Import into text mining software Search Search Get permission Get permission Download PDFs Download PDFs Convert PDFs Convert PDFs Run queries Import into text mining software Import into text mining software View results Perform search Obtain permission from publishers to mine full text for commercial use Requires automated tool or custom software to download in bulk Requires text mining permission from multiple publishers Requires content storage and feed management PDF is converted to a “blob of text” No tags Loss of metadata Low fidelity of content References induce noise Requires structuring text into XML Article text does not have “fields” Combining content from multiple sources takes time to normalize the metadata Here’s an example of a text mining workflow based on the information gathered in our research with text miners in the commercial life sciences. Recapping the challenges to researchers: Difficult to obtain full-text XML Difficult to integrate content into text mining platforms Multiple sets of terms, conditions and file formats Hard to negotiate and manage multiple publisher feeds No single solution addresses these issues until now…Our service is used to automate that laborious manual process on the left so that you can get better results faster with your text mining soluiton.

Introducing CCC’s XML for Mining Service Build a collection of full-text articles in XML format for mining CCC’s Text Mining Service CCC’s text mining service expands the capability of companies’ text mining efforts beyond article abstracts and Open Access articles, allowing researchers to search and download the full-text from a single source, eliminating the need to manually find, acquire, license and convert articles from disparate publishers and other online sources. Enables researchers to quickly and efficiently create collections of full-text articles, from multiple publishers, in XML format for text mining. CCC’s text mining service is specifically designed to allow users to access and obtain machine readable content formatted in XML for loading into text mining systems such as Linguamatics I2E or IBM Watson. CCC uses the JATS format for its XML files, enabling mining tools to easily ingest the content from our system. Text Mining Software

CCC Integration with I2E Too Detailed Automatically index in Linguamatics I2E Index Directly CCC’s Text Mining Service The integrated CCC and Linguamatics I2E solution enables researchers to spend less time gathering and formatting content into a mineable form so they can spend more time querying and analyzing results. Linguamatics I2E

Benefits of CCC’s Text Mining Service Improves the results of your text mining efforts Saves time and money for corporations; large, small, established and start-up Ensures copyright compliance Improves the results of your text mining efforts Enables text miners to go beyond the abstract level to search, download and mine full-text articles in XML format from both company subscriptions as well as unsubscribed published material. CCC’s service gives you more accurate and richer results enabling you to make discoveries that can only be found in the full text. Ensures copyright compliance Because all of the content in the service is pre-authorized for commercial text mining, you get the peace-of-mind that your text mining projects comply with copyright, minimizing your organization’s infringement risk. Saves time and money Aggregates full text article content and normalizes metadata from multiple publishers into a secure cloud for fast and easy access, reducing the time and costs associated with article conversions, content management, and negotiations with publishers. CCC’s service accelerates access to article collections for mining, giving text mining professionals more time to focus on analysis and discovery.

Features of CCC’s Text Mining Service Enables search within sections of the body text Identifies keyword hits in article excerpts to ensure a good match Enables the discovery of relevant unsubscribed content Provides uniform terms and conditions for mining Integrates with text mining tools. Employs API for additional workflow integrations

Thank you! Roy S Kaufman rkaufman@copyright.com http://orcid.org/0000-0002-7192-6578