Mining Web Logs to improve Website Organization Ramakrishnan Srikant and Yinghui Yang Professor :Wan-Shiou Yang The algorithm to automatically find pages.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

Recap: Mining association rules from large datasets
Chapter 4: Trees Part II - AVL Tree
Back to Table of Contents
Marketing Communications Services Hayward, WI.
Chapter 12: Web Usage Mining - An introduction
Aki Hecht Seminar in Databases (236826) January 2009
LinkSelector: A Web Mining Approach to Hyperlink Selection for Web Portals Xiao Fang University of Arizona 10/18/2002.
How Search Engines Work Source:
Designing Good Web Pages. Form versus Function A good website is a compromise between form and function. While creativity is encouraged, you have to meet.
Website: Best Practices. Sources: The World Wide Web Consortium the main international standards organization for the World Wide Web Research-Based Web.
“IT Solutions for Tourism Industry” CAPS Workshop Yerevan April 14, 2009.
Search Engine Optimization March 23, 2011 Google Search Engine Optimization Starter Guide.
Search Engine Optimization. What is SEO? Search engine optimization (SEO) is the process of improving the visibility of a website or a web page in search.
About Waterloo website Project report June Outline Overview of process Project deliverables Lessons learned.
Marketing Your Web Site Increase Your Web Traffic As Designer 4 You creates your website a key component in that design is to create a results oriented.
3.02G Website Components 3.02 Develop webpages.. Website Components  The website MUST contain an Index/Home Page.  A business website should contain:
Section 13.1 Add a hit counter to a Web page Identify the limitations of hit counters Describe the information gathered by tracking systems Create a guest.
How Search Engines Work. Any ideas? Building an index Dan taylor Flickr Creative Commons.
Slide 1 Today you will: think about criteria for judging a website understand that an effective website will match the needs and interests of users use.
AdWords Instructor: Dawn Rauscher. Quality Score in Action 0a2PVhPQhttp:// 0a2PVhPQ.
SEO Presentation High Level SEO overview 08 th December 2010.
National Institute of Science & Technology Algorithm to Find Hidden Links Pradyut Kumar Mallick [1] Under the guidance of Mr. Indraneel Mukhopadhyay ALGORITHM.
Writing for the Web 101. Benefits of Writing Good Content The Web is most often the first place people go to find information –Good content improves image.
Web-designWeb-design. Web design What is it? Web-design features Before…
Mining Optimal Decision Trees from Itemset Lattices Dr, Siegfried Nijssen Dr. Elisa Fromont KDD 2007.
Fall 2006 Davison/LinCSE 197/BIS 197: Search Engine Strategies 7-1 Module II Overview PLANNING: Things to Know BEFORE You Start… Why SEM? Goal Analysis.
Launching a Successful Online Business and EC Project.
Building Online Traffic Instructor: Hanniya Abid Assistant Professor COMSATS Institute of Information Technology Lecture 22 E-Marketing.
Web site development: Basics & MS FrontPage. What I hope to demonstrate n n Basics of a good web site n n How to most effectively communicate via the.
Guerrilla Marketing Tactics Building a proper web Presence March 24, 2010 Session 3.
4 Simple Web Design Rules. Your Web Site Should Be Easy To Read  Choose your text and background colors very carefully  Don't use backgrounds that obscure.
MIS 424 Professor Sandvig. Overview  Why Analytics?  Two major approaches:  Server logs  Google Analytics.
News-Directory.org Meta Search Engine. What is a Search Engine? A Search Engine is an online tool which helps the users in finding the web sites or the.
Google Analytics Top 10 Metrics By: Errett Cord Website:
Search engines are the key to finding specific information on the vast expanse of the World Wide Web. Without sophisticated search engines, it would be.
Searching for Dollars How money is made on the Web by Doug Ward February 12th, 2008.
Log files presented to : Sir Adnan presented by: SHAH RUKH.
Chapter 8 HTML Frames. 2 Principles of Web Design Chapter 8 Objectives Understand the benefits and drawbacks of frames Understand and use frames syntax.
Basic Search Engine Optimization. What is SEO?  SEO is an abbreviation for search engine optimization.
Chapter 12: Web Usage Mining - An introduction Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher, M.
Web Search Algorithms By Matt Richard and Kyle Krueger.
Search Engines1 Searching the Web Web is vast. Information is scattered around and changing fast. Anyone can publish on the web. Two issues web users have.
Content Management Systems a library overview. Key Points What is a CMS? Goals & Benefits Examples the UIUC Library.
Video Sales System Module 01: Secret Elements of High Converting Sales Videos.
What is Google? Google is a popular web search engine— And learning techniques saves time and results in rewarding research.
R-Trees: A Dynamic Index Structure For Spatial Searching Antonin Guttman.
HEMANTH GOKAVARAPU SANTHOSH KUMAR SAMINATHAN Frequent Word Combinations Mining and Indexing on HBase.
Week 1 Introduction to Search Engine Optimization.
Microsoft Expression Web 3 – Illustrated Unit G: Designing Site Navigation.
People and Families of the Bible Nathan Friedly. Overview Introduction Key Ideas Description and use Deliverables Demonstration Conclusion.
Mining Tag Semantics for Social Tag Recommendation Hsin-Chang Yang Department of Information Management National University of Kaohsiung.
Improvement of Apriori Algorithm in Log mining Junghee Jaeho Information and Communications University,
KiloBytes Technologies “New Face Of Technology” / Website: SEOwww.kilobytes.inSEO.
1 DATA-DRIVEN SOLUTIONS. 2 KEYWORD-LEVEL SEARCH RETARGETING TARGET USERS BASED ON THEIR RECENT SEARCH HISTORY AND SEARCH QUERIES. A user performs a search.
1 Web Search What are easy ways to create a website? 2 Web Search What is a blog? What type of content does this type of website provide? 3 Web.
Making Web Analytics Actionable Peter O’Neill Freelance Web Analytics Consultant Scottish Web Folk 28 th Nov 2008.
Ning Jin, Wei Wang ICDE 2011 LTS: Discriminative Subgraph Mining by Learning from Search History.
Creating an Ebrary Bookshelf Click Here to Start Tutorial There are 15 slides in this tutorial. Click where directed on the screen to advance the tutorial.
Responsive Web Design The Best Way to Make Your Website a Hit! Straight Up Marketing.
Algorithm to Find Hidden Links [1] ALGORITHM TO FIND HIDDEN LINKS IN A WEB PAGE.
Data Structures and Algorithms for Information Processing
4Schools Adding a Web Page.
The Difference Between STATIC & DYNAMIC Websites
Checklist to Consider For Design Ultimate Landing Page Menomonie Web Design Agency.
Changes coming soon to our portal
Planning a Website MAD2053.
Understanding SharePoint
Agenda What is SEO ? How Do Search Engines Work? Measuring SEO success ? On Page SEO – Basic Practices? Technical SEO - Source Code. Off Page SEO – Social.
Who is Using your webSite?
Presentation transcript:

Mining Web Logs to improve Website Organization Ramakrishnan Srikant and Yinghui Yang Professor :Wan-Shiou Yang The algorithm to automatically find pages in a website whose location is different from where visitors expect to find them

Introduction The Key insight is that visitors will backtrack if they don ’ t find the page where they expect it: The point from where they backtrack is the expected location for this page. The point from where they backtrack is the expected location for this page. Expected locations with a significant number of hits are presented to Website administrator for adding navigation links from the expected location to the target page. Expected locations with a significant number of hits are presented to Website administrator for adding navigation links from the expected location to the target page.

Model of Visitor Search pattern

Identifying Target Pages Analysis the present famous Websites Amazon :There is a clear separation between content pages and index such as Reference itemsets Amazon :There is a clear separation between content pages and index such as Reference itemsets Yahoo: List website on the internal nodes of its hierarchy, not just on the leaf nodes. Yahoo: List website on the internal nodes of its hierarchy, not just on the leaf nodes.

Website & Search Pattern

Test Find Expected Location For i:= 2 to n-2 begin If ((Pi-1)=(Pi+1) or (no link from Pi to Pi=1) Add Pi to B When i=2 P1=P3 or no link P2 -> P3,P2=>X B i=3 P2=P4 or no link P3 -> P4,P3=> B i=4 P3=P5 or no link P4 -> P5,P4=>X B i=5 P4=P6 or no link P5 -> P6,P5=> B i=6 P5=P7 or no link P6 -> P7,P6=> B

Algorithm: Find Expected Location

Limitations When the website doesn ’ t have a clear separation between content and index page, it can hard to distinguish target pages and other pages. When the website doesn ’ t have a clear separation between content and index page, it can hard to distinguish target pages and other pages. Another limitation is that only people who can successfully find a target page will generate an expected location for that page. Another limitation is that only people who can successfully find a target page will generate an expected location for that page.

Optimizing The Set of Navigation Links We consider three approaches for recommending additional links to the web site administrator We consider three approaches for recommending additional links to the web site administrator 1. FirstOnly: Easy and Simple. 2. OptimizeBenefit: Order and all elements. 3. OptimizeTime: Reduce time for both.

FirstOnly The algorithm recommends the frequency first expected locations (the page that occur frequency in ^E1) to the website administrator, ignoring any subsequent expected locations the visitor may have considered. The algorithm recommends the frequency first expected locations (the page that occur frequency in ^E1) to the website administrator, ignoring any subsequent expected locations the visitor may have considered. Disadvantage: It Just satisfied with information a little of people needed Disadvantage: It Just satisfied with information a little of people needed

Example FirstOnly

Algorithm: FirstOnly

OptimizeBenefit The is a greedy algorithm that attempt to maximize the benefit to the website of adding additional links. The is a greedy algorithm that attempt to maximize the benefit to the website of adding additional links. In each pass, it find the page with the maximum benefit. In each pass, it find the page with the maximum benefit. adds it to the set of recommendations. adds it to the set of recommendations. null out all instances of this page and succeeding page, and recomputes the benefit. null out all instances of this page and succeeding page, and recomputes the benefit.

Example OptimizeBenefit

Algorithm: OptimizeBenefit

OptimizeTime The goal of the algorithm is to minimize the number of backtrack the visitor has to make. The goal of the algorithm is to minimize the number of backtrack the visitor has to make. Saving time for each record (person) makes good performance for website. Saving time for each record (person) makes good performance for website. The algorithm also a greedy search,and is quit similar to OptimizeBenefit. The algorithm also a greedy search,and is quit similar to OptimizeBenefit.

Example OptimizeTime

Algorithm: OptimizeTime

Algorithm: OptimizeTime&Profit We can emphasize that adding Pi_num of the special recommend from Webdesigner view. We can emphasize that adding Pi_num of the special recommend from Webdesigner view. P:=Page with highest support from Timesaved (Pi) * Pi_num P:=Page with highest support from Timesaved (Pi) * Pi_num We can get the list of recommendations with Web-designer focus. We can get the list of recommendations with Web-designer focus.