Web Mining and Semantic Web Web Mining and Semantic Web Pınar Şenkul Dept. of Computer Engineering.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

TU e technische universiteit eindhoven / department of mathematics and computer science Modeling User Input and Hypermedia Dynamics in Hera Databases and.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Chase Repp.  knowledge discovery  searching, analyzing, and sifting through large data sets to find new patterns, trends, and relationships contained.
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
Data Mining Techniques Cluster Analysis Induction Neural Networks OLAP Data Visualization.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Data Mining By Archana Ketkar.
Building Knowledge-Driven DSS and Mining Data
Data Mining – Intro.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Overview of Web Data Mining and Applications Part I
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Mining on the Web via Cloud Computing COMS E6125 Web Enhanced Information Management Presented By Hemanth Murthy.
Chapter 4: Organizing and Manipulating the Data in Databases
Chapter 4-1. Chapter 4-2 Database Management Systems Overview  Not a database  Separate software system Functions  Enables users to utilize database.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Copyright © 2003 by Prentice Hall Computers: Tools for an Information Age Chapter 13 Database Management Systems: Getting Data Together.
Introduction to Biometrics Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #3 Information Management and Data Mining August 29, 2005.
Marko Grobelnik Jozef Stefan Institute ( Ljubljana, Slovenia.
Agent Model for Interaction with Semantic Web Services Ivo Mihailovic.
Chapter 4: Organizing and Manipulating the Data in Databases
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Data Mining By Dave Maung.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Guest Lecture Introduction to Data Mining Dr. Bhavani Thuraisingham September 17, 2010.
1 Improving quality of graduate students by data mining Asst. Prof. Kitsana Waiyamai, Ph.D. Dept. of Computer Engineering Faculty of Engineering, Kasetsart.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Multi-Relational Data Mining: An Introduction Joe Paulowskey.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
Mining real world data Web data. World Wide Web Hypertext documents –Text –Links Web –billions of documents –authored by millions of diverse people –edited.
Foundations of Business Intelligence: Databases and Information Management.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
MIS2502: Data Analytics Advanced Analytics - Introduction.
An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Data Mining Concepts and Techniques Course Presentation by Ali A. Ali Department of Information Technology Institute of Graduate Studies and Research Alexandria.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
12. DISTRIBUTED WEB-BASED SYSTEMS Nov SUSMITHA KOTA KRANTHI KOYA LIANG YI.
Data Mining – Intro.
MIS2502: Data Analytics Advanced Analytics - Introduction
Introduction C.Eng 714 Spring 2010.
Web Ontology Language for Service (OWL-S)
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Data Warehousing and Data Mining
Supporting End-User Access
Web Mining Department of Computer Science and Engg.
Database System Concepts and Architecture
Presentation transcript:

Web Mining and Semantic Web Web Mining and Semantic Web Pınar Şenkul Dept. of Computer Engineering

Data mining finding interesting trends and patterns in very large databases extract information which is not explicit from the data and implicitly specified also related to artificial intelligence (knowledge discovery and machine learning) and statistics

Data Mining Applications Sequential patterns Association rules Clustering and classification

Discovering Association Rules Rule: “If i then j” Support: the percentage of records containing both items i and j. Confidence: the percentage of records containing item j among the records containing item i.

Multi-Table Association Rule Mining Integrating data from multiple tables into a single table through some preprocessing such as using joins and aggregation –can cause loss of semantics and information Inductive Logic Programming (ILP) –applies directly to the data that is stored on multiple tables

ILP Systems Top-down vs. Bottom-up Well-known systems: –CLAUDIEN, ICL, WARMR, TILDE, PROGOL, ALEPH, TERTIUS, FDEP,FOCL,GOLEM, MERLIN, FLIP & SMILES, ATRE, ACL, CHILLIN, FORTE, GBR, LIVE

ILP Given: –a set of examples, E –background knowledge, BK –produce a set of relations (clauses) using BK that describe E. Strong language bias : precise syntactical description of acceptable clauses

Current Problems ILP-Based Association Rule Discovery System –No negative examples –Purely relational –Using support & confidence –Efficient but restricted with language biases

Application Areas Has a verity of application areas Possibly –Web mining –Semantic Web/Ontology Mining

Web Mining Using data mining techniques on web data web content mining web structure mining web usage mining

Web Usage Mining Discovering usage patterns from the web in order to better understand and serve the needs of users and web-based applications. It includes two important phases: data preprocessing pattern discovery

Semantic Web Mining Discovering better patterns by using semantic information Discovering/enhancing the semantic information by the using the extracted patterns.

Web Services and Web Service Composition Problem: construction of a complex service from the existing individual atomic services according to user's needs and requirements. The set of existing services is highly dynamic Individual service selection should be ● dynamic ● fulfill user's requirements

Web Service Composition ● A uniform framework for modeling and satisfaction of the constraint ● Flexibility for constraint specification ● Automatic selection of concrete services

Example ● Wedding Anniversary Celebration – buy a bouquet of flowers – buy earrings or buy a one-day weekend trip – make dinner reservation Constraints: budget ≤ $1000 duration < 5 hours quality(restaurant) ≥ 3-star

Architecture

Some Problems How to using semantic knowledge for different levels of composition process Web mining for service recommendation utilizing semantic information

Some of the FP6-IST projects on data mining –Semantic Interoperability and Data Mining in Biomedicine Semantic Interoperability and Data Mining in BiomedicineSemantic Interoperability and Data Mining in Biomedicine Project Acronym: SEMANTICMINING Action Line: IST eHealth Contract Type: NETWORK OF EXCELLENCE –Semantic Interaction with Music Audio Contents Semantic Interaction with Music Audio ContentsSemantic Interaction with Music Audio Contents Project Acronym: SIMAC Action Line: IST Semantic-based knowledge systems Contract Type: SPECIFIC TARGETED RESEARCH PROJECT –Data Mining Tools and Services for Grid Computing Environments Data Mining Tools and Services for Grid Computing EnvironmentsData Mining Tools and Services for Grid Computing Environments Project Acronym: DATAMININGGRID Action Line: IST Grid based systems for complex problem solving Contract Type: SPECIFIC TARGETED RESEARCH PROJECT –Inductive Queries for Mining Patterns and Models Inductive Queries for Mining Patterns and ModelsInductive Queries for Mining Patterns and Models Project Acronym: IQ Action Line: IST FET - Open Contract Type: SPECIFIC TARGETED RESEARCH PROJECT –GRID ENABLED REMOTE INSTRUMENTATION WITH DISTRIBUTED CONTROL AND COMPUTATION GRID ENABLED REMOTE INSTRUMENTATION WITH DISTRIBUTED CONTROL AND COMPUTATIONGRID ENABLED REMOTE INSTRUMENTATION WITH DISTRIBUTED CONTROL AND COMPUTATION Project Acronym: GRIDCC Action Line: IST Research Networking Test beds Contract Type: SPECIFIC TARGETED RESEARCH PROJECT

Contact Info Pinar SENKUL Ismail Hakki TOROSLU