Impact of Information Architecture on Content Digitization and SEO ASIDIC Spring 2007 Meeting S. Gurke SVP, Knovel Corp.
Topics Transforming e-book collection into reference database Why transform? Impact on content digitization Impact on search engine optimization (SEO) Impact on pricing and revenue
Transforming E-Book Collection into Reference Database Collection –Formatted content is outside database –Book presentation –Search full text and metadata indexes Database (XML) –Unformatted content is inside database –Database presentation (chunks) –Search tagged content chunks with metadata
Why Transform? Better reflect use patterns Streamline search and improve relevancy ranking Increase usage
Content Digitization Now Content –Text (PDF, HTML) –Metadata (TOC, etc.) –Interactive tables, graphs, equations Process (outsourced) –Scanning/OCRing –Keying –Special techniques (e.g., graph calibration) Tools –CMS –SQL database
Challenges Conversion from relational to XML database –Metadata –Interactive content –Text Chunking and tagging text –Who does it? –TOC and Subject Index as chunk metadata
Impact on SEO Exposing secure content Making metadata work –Title and author –TOC and Subject Index Benefits for users –Finding information made easier –Comprehensive search –Improved relevancy
Impact on Pricing and Revenue Pricing models –Subscription –Usage based Transaction By the drink –Hybrid/Enterprise Ad-based STM revenue Usage based royalties