Database Technologies for E-Commerce Rakesh Agrawal IBM Almaden Research Center.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Automatic Timeline Generation from News Articles Josh Taylor and Jessica Jenkins.
PNS: Personalized Multi-Source News Delivery Georgios Paliouras(1), Mouzakidis Alexandros(1), Christos Ntoutsis(2), Angelos Alexopoulos(3), Christos Skourlas(2)
Chapter 10: Designing Databases
Business Development Suit Presented by Thomas Mathews.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
T.Sharon-A.Frank 1 Internet Resources Discovery (IRD) Shopping Agents.
Stefania Bergamasco, Cecilia Colasanti An integrated approach to turn statistics into knowledge combining data warehouse, controlled vocabularies and advanced.
Cloud platforms Lead to Open and Universal access for people with Disabilities and for All WP Federating repositories of Solutions.
Merging Taxonomies. Assertion Creation and maintenance of large ontologies will require the capability to merge taxonomies This problem is similar to.
Search Engines and Information Retrieval
Chapter 12: Web Usage Mining - An introduction
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Making Semantic Web Real: Some Building Blocks Rakesh Agrawal IBM Almaden Research Center.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Meerkat Overview David Robb CSCI 7818: Topics in Software Engineering Fall 2001.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Overview of Search Engines
Application Standards for ‘Push’ Content and Streaming Media Hadi Partovi Microsoft Corporation.
Databases & Data Warehouses Chapter 3 Database Processing.
Business Overview Who Is ROCKETinfo?. The Business Rocketinfo is a Web 2.0 Company focusing on providing Web-based information. The goal is to provide.
Xpantrac connection with IDEAL Sloane Neidig, Samantha Johnson, David Cabrera, Erika Hoffman CS /6/2014.
MSDSonline HQ: Viewer Site Tour Main Menu Getting to your Company List Searching within your Company List How to View and Print an MSDS How to Print a.
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
CISTI Source & SiteSearch OCLC User Meeting 2001 Danielle Langlois & Carol Serroul May 9, 2001.
1 New IT Initiatives at the National University of Singapore Libraries Sylvia Yap Oct 2003.
E-Commerce Systems Chapter 8
Joomla! Day France SEBLOD Version 2.0 for Joomla! 1.6.
Search Engines and Information Retrieval Chapter 1.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
Copyright © 2009 Pearson Education, Inc. Slide 6-1 Chapter 6 E-commerce Marketing Concepts.
1 The BT Digital Library A case study in intelligent content management Paul Warren
MSF Requirements Envisioning Phase Planning Phase.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
ChemStation Integration with ECM November 7, 2006 Integration of ChemStation with OpenLAB ECM Life Sciences Solutions Unit Susanne Kramer, Application.
Meeting with PIRSA, June 2010 © 2009 BRGM & Intrepid December 2006 Ray Seikel (Intrepid Geophysics) Jetstream V4.2 – Briefing for PIRSA.
New Features in Release 9.2 (July 27, 2009). 2 Release 9.2 New Features Updated Shopping Experience Home/Shop page Shop at the top search New Hosted Supplier.
Chapter 1 Introduction to Data Mining
DIGITAL ASSET MANAGEMENT FOR NEXTGEN APPS PROVIDER Business Problem In today’s growing and connected world, it has become a necessity for almost everyone.
Melissa Armstrong – Sponsor Dr. Eck Doerry – Mentor Greg Andolshek Alex Koch Michael McCormick Department of Computer Science SolutionProblemDesign User.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Using Taxonomies Effectively in the Organization KMWorld 2000 Mike Crandall Microsoft Information Services
Edwin Ombego Software Developer Web Portals Key Concepts Your Logo.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
 Background & Overview  Business Model & Value Proposition  Consumer & Purchase Analysis  The E-Commerce Value Chain  Technical & Design Aspects.
 Digital PR combines traditional PR with content marketing, social media and search engine optimization  Converts static news into conversations by.
NCR Confidential NCR RETAIL ONLINE Ecommerce Made Simple 1.
Mining real world data Web data. World Wide Web Hypertext documents –Text –Links Web –billions of documents –authored by millions of diverse people –edited.
Presented by Rae Huang January 23 rd, 2009 GSB Website Redevelopment Project Outline.
CIS 210 Systems Analysis and Development Week 8 Part II Designing Distributed and Internet Systems,
DATA RESOURCE MANAGEMENT
Cloud platforms Lead to Open and Universal access for people with Disabilities and for All WP Federating repositories of Solutions.
CS562 Advanced Java and Internet Application Introduction to the Computer Warehouse Web Application. Java Server Pages (JSP) Technology. By Team Alpha.
E-Commerce Systems Chapter 8 Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Copyright © 2007, Oracle. All rights reserved. Managing Items and Item Catalogs.
Web mining is the use of data mining techniques to automatically discover and extract information from Web documents/services
SAP BODS Online Training and Placement in USA Online | classroom| Corporate Training | certifications | placements| support CONTACT US: MAGNIFIC TRAINING.
Basics Components of Web Design & Development Basics, Components, Design and Development.
A Presentation Presentation On JSP On JSP & Online Shopping Cart Online Shopping Cart.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
SmartCode Brad Argue INLS /19/2001.
We specialize in scalable Commerce solutions for suppliers and retailers that simultaneously work together.
WEB 237 Education for Service-- snaptutorial.com.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
ASP.NET Module Subtitle.
Presentation transcript:

Database Technologies for E-Commerce Rakesh Agrawal IBM Almaden Research Center

Outline  Overview of the Pangea B2B marketplace  Technology dives:  Interactive parametric search  Catalog integration  Storage & retrieval of e-commerce data  Research opportunities

Pangea Architecture

Product Selection

Brand and Loyalty Creation

Features (1)  Eureka:  Interactive parametric search  Searches based on example products  Similarity search for product substitution  Dynamic Content generation/presentation (on the fly from database):  Catalog Hierarchy pages with product counts  Side-by-Side Product Comparison  Product Listings/Details  Shopper groups through preference layer

Features (2)  Server side Searching:  Functional description search using classification  Key word & Part Number Search  Restriction of the search to sub-trees of the category hierarchy (pushed down to the database)  Real time Price and Availability Engine:  Ability to crawl, interrogate, extract price & availability data from various suppliers/distributors in real time and present them in side-by-side comparison format  Easily add new distributors/suppliers  Completely client side implementation to prevent blocking

Features (3)  Design Worksheet (Generalized shopping cart):  List of items: an item could be a specific part, alternative set of parts, specifications, other design worksheets (nesting)  Completely integrated with all relevant components (search, content, price, etc.)  Aggregate constraints (e.g. total price)  Multiple projects saved on server  Share projects with other (authorized) users  Mining for suggestions

Features (4)  Design data warehouse creation and maintenance  Crawling technology for extracting part information from websites of various manufacturer/distributor/suppliers  Data extraction from Manufacturer Spec Sheets (pdf files)  Classification technology to build, merge, manage and populate category hierarchies.

Features (5)  Soft content creation and maintenance  Crawling to acquire articles/news/postings from various web news sources, usenet newsgroups, etc.  Agents to cluster, classify, extract concepts, identify associations.  Personalized news/data presentation (based on user defined profile, channels, etc.)  Complete interleaving and integration with part content.  Automatic generation of summary pages for manufacturers (e.g. related news articles, URLs, product categories).

Prototype  Data for nearly 2 million parts, in 2000 categories  Focused crawling of more than 175 manufacturers for datasheets/ application notes/manuals (160,000)  1/4 Terabyte database