Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

Slides:



Advertisements
Similar presentations
The VAO is operated by the VAO, LLC. Web SAMP in VAO Data Discovery Tool Dean Hinshaw, NASA/HEASARC.
Advertisements

Google News Personalization: Scalable Online Collaborative Filtering
Google News Personalization Scalable Online Collaborative Filtering
Copyright Online British Art Gallery All rights reserved Online British Art Gallery Gallery Presentation.
Mining di Dati Web Web Community Mining and Web log Mining : Commody Cluster based execution Romeo Zitarosa.
10/24/2010 Distributed Software Development. 10/24/2010 Content Recommendation and Reuse Current state – Beta Prototype Hamid Riaz, Loredana Baračić.
University of Illinois Visualizing Text Loretta Auvil UIUC February 25, 2011.
Query Specific Fusion for Image Retrieval
Finding Similar Music Artists for Recommendation Presented by :Abhay Goel, Prerak Trivedi.
Web Categorization Crawler – Part I Mohammed Agabaria Adam Shobash Supervisor: Victor Kulikov Winter 2009/10 Final Presentation Sep Web Categorization.
DYNAMIC ELEMENT RETRIEVAL IN A STRUCTURED ENVIRONMENT MAYURI UMRANIKAR.
The Web is perhaps the single largest data source in the world. Due to the heterogeneity and lack of structure, mining and integration are challenging.
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
By Morris Wright, Ryan Caplet, Bryan Chapman. Overview  Crawler-Based Search Engine (A script/bot that searches the web in a methodical, automated manner)
Web Mining Research: A Survey
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
Towards Implementing Better Movie Recommendation Systems Rahul Thathoo, Zahid Khan Volume of items available for sale increasing rapidly due to low barriers.
Memoplex Browser: Searching and Browsing in Semantic Networks CPSC 533C - Project Update Yoel Lanir.
The Social Web: A laboratory for studying s ocial networks, tagging and beyond Kristina Lerman USC Information Sciences Institute.
University of Kansas Data Discovery on the Information Highway Susan Gauch University of Kansas.
Library 10 – Information Competency Search Engines.
Semantic Scrolling How to navigate structured documents.
New “Collaborate” Button Integrate UI directly into the browser. Preferred target: Firefox Easiest browser to extend in terms of UI.
Google Chrome Your Customized Google Buddy April 2012 John Riley and Denise Tate-Kuhler.
Library of Efficient Data types and Algorithms (LEDA)
The College of Saint Rose CSC 460 / CIS 560 – Search and Information Retrieval David Goldschmidt, Ph.D. from Programming Collective Intelligence by Toby.
1 Anonshare 2.0 P2P Anonymous Browsing History Share Frank Chiang Terry Go Rui Ma Anita Mathew.
DVAEYC IREG Meeting 2 June 2010 Tamara Manik-Perlman, Azavea (formerly Avencia) CommonSpace: A Shared Transitspace Collaboration Platform.
Improving the Catalogue Interface using Endeca Tito Sierra NCSU Libraries.
A Survey on Social Network Search Ranking. Web vs. Social Networks WebSocial Network Publishing Place documents on server Post contents on social network.
Bringing it All Together: NODC’s Geoportal Server as an Integration Tool for Interoperable Data Services Kenneth S. Casey, Ph.D. YuanJie Li NOAA National.
Intelligent Database Systems Lab Presenter: MIN-CHIEH HSIU Authors: NHAT-QUANG DOAN ∗, HANANE AZZAG, MUSTAPHA LEBBAH 2013 NN Growing self-organizing trees.
Web Categorization Crawler Mohammed Agabaria Adam Shobash Supervisor: Victor Kulikov Winter 2009/10 Design & Architecture Dec
No Title, yet Hyunwoo Kim SNU IDB Lab. September 11, 2008.
Recommendation system MOPSI project KAROL WAGA
Objective Understand concepts used to web-based digital media. Course Weight : 5%
Proposal for Term Project J. H. Wang Mar. 2, 2015.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
Data Mining By Dave Maung.
Mackenzie Hazelwood.  Stores and allows users to share documents  Document types include word, spreadsheet, drawings, and power points  Allows owner.
Map-driven Learning in Enterprise Systems DevLearn DevLearn 2013.
1 Computing Trust in Social Networks Huy Nguyen Lab seminar April 15, 2011.
The Filter Bubble By: William Santora. The Book The title of my book is The Filter Bubble: What the Internet Is Hiding From You. It was written by Eli.
1 SEMEF : A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks Delroy Cameron Masters Thesis Computer Science, University of Georgia.
GBIF Governing Board 20 Module 6B: New GBIF Tools II 2013 Portal and NPT Startup Daniel Amariles IT Leader, National Biodiversity Information System of.
The ECOST Web-based platform for data providers and for data users.
Group 3 CMPE Community Project. What is CMPE Community project? CMPE Community project aims to create a social web application to create an environment,
GUILLOU Frederic. Outline Introduction Motivations The basic recommendation system First phase : semantic similarities Second phase : communities Application.
User Modeling for Personal Assistant
Strategies for Spatial Joins
Gedas Adomavicius Jesse Bockstedt
Proposal for Term Project
User working in web-browser
Distance Computation “Efficient Distance Computation Between Non-Convex Objects” Sean Quinlan Stanford, 1994 Presentation by Julie Letchner.
Data Structures and Database Applications Binary Trees in C#
Website URL
A* Path Finding Ref: A-star tutorial.
A Restaurant Recommendation System Based on Range and Skyline Queries
6. Implementation of Vector-Space Retrieval
Magnet & /facet Zheng Liang
The New LexisNexis® Statistical
Information & Democracy
Information & Democracy
Lecture 10 Graph Algorithms
Information & Democracy
Democracy and Information
Democracy and Information
Information & Democracy
Recommender Systems and Fast Algorithms
Presentation transcript:

Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss

Motivation Inconvenient to understand browsing history Weak content discovery tools

Why Use Histograph? Visualize how you use the web using graphs Discover new content tailored to your browsing habits

Demo: Typical User Flow histograph.us

System Overview Ext Server DB Front End

Discover Friends Graph Demo Suggestions

System Overview + Friends Ext Server DB Front End FB

User Options Time Filtering URL Filtering Upvote/Downvote

URL Recommendation Algorithm Collaborative filtering algorithm based URL frequencies at each level. Hierarchical URL tree designed for efficiency.

URL Recommendation Algorithm Recommend URLS for User A Construct a tree of the User A’s URLs, another user’s URLs. Recursively traverse tree and accumulate scores. If node is leaf, store in a Rank Table. Repeat for each other user in the system. Filter out websites User A has already seen.

Hierarchical Tree User A google 40% reddit 30% … User B google 33% reddit 20% … reddit/gifs 67% 1 2 Level reddit/funny 33% reddit/politics 50% reddit/ sports 50%

Scoring Final scores for URL is accumulated by traversing tree: score at each level is prop to Bhattacharrya Distance, your frequency, weights from up/down vote.

Challenges in Design Chrome Extension Limitations Extremely Large Datasets (~40,000 nodes per user) – Processing – Efficient Display – Hierarchical tree uses hashing to find children

Future Steps Content Aware Suggestions Get More Users Improved Server Platform More Options – Time period selection – Algorithm adjustment Search Functionality Firefox extension Map Reduce

Questions Sam Payne: – – Riley Thomasson: – Ankush Gola – Aaron Himelman – Akshaya Uttamadoss –