Yoda: An Accurate and Scalable Web-based Recommendation Systems Cyrus Shahabi, Farnoush Banaei-Kashani, Yi-Shin Chen, and Dennis McLeod Integrated Media.

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Recommender Systems & Collaborative Filtering
Item Based Collaborative Filtering Recommendation Algorithms
Collaborative QoS Prediction in Cloud Computing Department of Computer Science & Engineering The Chinese University of Hong Kong Hong Kong, China Rocky.
Collaborative Filtering Sue Yeon Syn September 21, 2005.
1 RegionKNN: A Scalable Hybrid Collaborative Filtering Algorithm for Personalized Web Service Recommendation Xi Chen, Xudong Liu, Zicheng Huang, and Hailong.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
2. Introduction Multiple Multiplicative Factor Model For Collaborative Filtering Benjamin Marlin University of Toronto. Department of Computer Science.
Using a Trust Network To Improve Top-N Recommendation
Modeling the Internet and the Web School of Information and Computer Science University of California, Irvine E-Commerce.
EigenTaste: A Constant Time Collaborative Filtering Algorithm Ken Goldberg Students: Theresa Roeder, Dhruv Gupta, Chris Perkins Industrial Engineering.
1 Collaborative Filtering and Pagerank in a Network Qiang Yang HKUST Thanks: Sonny Chee.
Top-N Recommendation Algorithm Based on Item-Graph
Chapter 8 Collaborative Filtering Stand
Agent Technology for e-Commerce
Customizable Bayesian Collaborative Filtering Denver Dash Big Data Reading Group 11/19/2007.
1 ISI’02 Multidimensional Databases Challenge: representation for efficient storage, indexing & querying Examples (time-series, images) New multidimensional.
Modeling User Rating Profiles For Collaborative Filtering
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
FACE RECOGNITION, EXPERIMENTS WITH RANDOM PROJECTION
Sparsity, Scalability and Distribution in Recommender Systems
Collaborative Filtering CMSC498K Survey Paper Presented by Hyoungtae Cho.
Analysis of Recommendation Algorithms for E-Commerce Badrul M. Sarwar, George Karypis*, Joseph A. Konstan, and John T. Riedl GroupLens Research/*Army HPCRC.
Recommender systems Ram Akella November 26 th 2008.
Collaborative Recommendation via Adaptive Association Rule Mining KDD-2000 Workshop on Web Mining for E-Commerce (WebKDD-2000) Weiyang Lin Sergio A. Alvarez.
CS 277: Data Mining Recommender Systems
Item-based Collaborative Filtering Recommendation Algorithms
+ Social Bookmarking and Collaborative Filtering Christopher G. Wagner.
Recommender Systems on the Web: A Model-Driven Approach Gonzalo Rojas – Francisco Domínguez – Stefano Salvatori Department of Computer Science University.
Content-Based Recommendation Systems Michael J. Pazzani and Daniel Billsus Rutgers University and FX Palo Alto Laboratory By Vishal Paliwal.
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
Item Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karpis, Joseph KonStan, John Riedl (UMN) p.s.: slides adapted from:
LOGO Recommendation Algorithms Lecturer: Dr. Bo Yuan
A Hybrid Recommender System: User Profiling from Keywords and Ratings Ana Stanescu, Swapnil Nagar, Doina Caragea 2013 IEEE/WIC/ACM International Conferences.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Recommender Systems David M. Pennock NEC Research Institute contributions: John Riedl, GroupLens University of Minnesota.
1 Social Networks and Collaborative Filtering Qiang Yang HKUST Thanks: Sonny Chee.
Authors: Rosario Sotomayor, Joe Carthy and John Dunnion Speaker: Rosario Sotomayor Intelligent Information Retrieval Group (IIRG) UCD School of Computer.
Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.
Collaborative Filtering  Introduction  Search or Content based Method  User-Based Collaborative Filtering  Item-to-Item Collaborative Filtering  Using.
Badrul M. Sarwar, George Karypis, Joseph A. Konstan, and John T. Riedl
The Effect of Dimensionality Reduction in Recommendation Systems
Collaborative Data Analysis and Multi-Agent Systems Robert W. Thomas CSCE APR 2013.
A Content-Based Approach to Collaborative Filtering Brandon Douthit-Wood CS 470 – Final Presentation.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
WSP: A Network Coordinate based Web Service Positioning Framework for Response Time Prediction Jieming Zhu, Yu Kang, Zibin Zheng and Michael R. Lyu The.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Fuzzy integration of structure adaptive SOMs for web content.
Collaborative Filtering Zaffar Ahmed
Department of Automation Xiamen University
Recommendation Algorithms for E-Commerce. Introduction Millions of products are sold over the web. Choosing among so many options is proving challenging.
Venue Recommendation: Submitting your Paper with Style Zaihan Yang and Brian D. Davison Department of Computer Science and Engineering, Lehigh University.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Mining Advisor-Advisee Relationships from Research Publication.
Anomaly Detection. Network Intrusion Detection Techniques. Ştefan-Iulian Handra Dept. of Computer Science Polytechnic University of Timișoara June 2010.
Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.
10/03/59 1 Recommendation Systems Sunantha Sodsee Information Technology King Mongkut’s University of Technology North Bangkok.
Personalizing the Web Todd Lanning Project 1 - Presentation CSE 8331 Dr. M. Dunham.
Collaborative Filtering: Searching and Retrieving Web Information Together Huimin Lu December 2, 2004 INF 385D Fall 2004 Instructor: Don Turnbull.
Collaborative Filtering - Pooja Hegde. The Problem : OVERLOAD Too much stuff!!!! Too many books! Too many journals! Too many movies! Too much content!
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
1 Top-N Recommendation Algorithm Based on Item-Graph Allen, Zhenjiang LIN CSE, CUHK June 7, 2007.
Recommender systems 06/10/2017 S. Trausan-Matu.
Data Mining: Concepts and Techniques
Recommender Systems & Collaborative Filtering
WSRec: A Collaborative Filtering Based Web Service Recommender System
Collaborative Filtering
Lin Lu, Margaret Dunham, and Yu Meng
ITEM BASED COLLABORATIVE FILTERING RECOMMENDATION ALGORITHEMS
Privacy-Aware Tag Recommendation for Image Sharing
Presentation transcript:

Yoda: An Accurate and Scalable Web-based Recommendation Systems Cyrus Shahabi, Farnoush Banaei-Kashani, Yi-Shin Chen, and Dennis McLeod Integrated Media Systems Center and Computer Science Department, University of Southern California {shahabi, banaeika, yishinc,

Outline Motivation Related Work Content-based Filtering Collaborative Filtering Offline Process: Clustering, Voting, Aggregation Online Process: Classification & Aggregation Performance Evaluation Conclusion & Future Work

Motivation The amount of data is enormous on the Web Users suffer from information overload Recommendation systems can personalize and customize the Web environment in real-time Similar to Amazon.com “real-time” recommendations (people who bought this book also purchased …) Different approach (vs. association-rule mining) Challenges: Scalability : As the # of items and users grow, the system stay efficient Sparsity: Not enough information available on the user

Related Work: Content-Based Filtering From the Information Retrieval community [Maes1994] [Shardanand and Maes 1995] [Balabanovi and Shoham 1997] Based on a comparison between the feature vectors of items (e.g., artist, style) in the database and the user’s interest list Major weakness [Balabanovi and Shoham 1997] Content limitation: only can be applied to few kinds of content, can only capture certain aspects of the content Over-specialization: users can only obtain information based on the content of their profiles

Related Work: Collaborative Filtering(CF) Employ a user’s item evaluations (not the actual content) to find other similar users: nearest-neighbor algorithm [Resnick et al. 1994] Three major weaknesses Scalability: time complexity O(U*I) (I : #items, U: #users) Clustering [Breese et al. 2000] Bayesian network [Kitts et al. 2000] Sparsity: profile matrix (i.e., # of user evaluated items) is sparse SVD [Sarwar et al. 2000] Synonymy: latent association between items is not considered Content analysis [Balabanovi and Shoham 1997] Categorization [Kohrs and Merialdo 2000]

Fuzzy Aggregation Fuzzy Aggregation Clusters Offline Process PPED Similarity Measure and Clustering PPED Similarity Measure and Clustering User Navigation Behaviors User 1 User 2 User 3 User 4 User 5 User U-6 User U-5 User U-4 User U-3 User U-2 User U-1 User U User 6 Voting Favorite PVs (Rock= High Classical= Low Pop= Low Rap= High) Item Database Cluster Wish-list

Voting Mechanism Favorite PVs (Rock= High Classical= Low Pop= Low Rap= High Blues= Low) Rock Classical Pop Rap Blues High Low Mid High Low Property Values Voting Rock Classical Blues H M L H M L H M L C p,f (k) M pf =Max{ C p,f (k) } f in F

Ranking Items Item Database Cluster Wish-List Fuzzy Aggregation Fuzzy Aggregation fmax{ … } Favorite PVs (Rock= High Classical= Low Pop= Low Rap= High Blues= Low) F p (k) (High*High), (Mid*Low), (Low*Low) V k (i) Rock Classical Pop Rap Blues High Low Mid Mid Low Property Values Locality Sensitive Hashing algorithm

Favorite PVs (Rock= High Classical= Low Pop= Low Rap= High Blues= Low) Rock Classical Pop Rap Blues High Low Mid Mid Low Property Values Fuzzy Aggregation Fuzzy Aggregation fmax{ } Optimized Equation Why optimized: time complexity O(#P*I) (#P: # of properties, I: # of items) Intuition: the v k (i) value comes from the maximum value among M high (k) f (High*High), (Low*Mid)

Optimized Equation Time complexity: O(f*I) I=#items f=#fuzzy terms Satisfy a triangular norm form Time complexity can be further reduced to O(N) (N: constant number) by Fagin’s A 0 algorithm [Fagin 1996]

PPED Similarity Measure PPED Similarity Measure Fuzzy Aggregation Clusters Online Process Current User’s Navigation Behavior A List of Similarity Values User Wish-List Cluster Wish-lists

Optimized Method Original Time complexity: O(K*I) K=#clusters I=#items Time complexity of optimized method: O(f*I) f=#fuzzy terms Time complexity can be further reduced to O(N) (N: constant number) by Fagin’s A 0 algorithm [Fagin 1996]

Experimental Methodology Clustering Generate Item Database User Set Clusters Similarity Matrix cluster user Cluster Favorite PVs Ranking of Items in Clusters Generate User Navigation Behaviors Assign Property Values to Items: Item-PV = f(Cluster-PV, noise) noise ~ item-rank

Experimental Methodology Clusters Similarity Matrix cluster user Cluster Favorite PVs Ranking of Items in Clusters User Navigation Behaviors Item Database User Set H L M N F F L L M N F F L M N F F L M N F F L M N F F L L M N F F L Assign evaluation values to items Item-Rating = f(Cluster-Ranking, weight) weight ~ user-cluster similarities

Experimental Methodology Item Database User Set User Navigation Behaviors H L M N F F L L M N F F L M N F F L M N F F L M N F F L L M N F F L Training Testing Current SessionRecommendation

Accuracy Comparison Number of Items Harmonic Mean Improvement Nearest Neighbor MethodYodaImprovement

Processing Time Comparison Number of Users CPU Time (milliseconds/user) YodaBNN: Basic Nearest Neighbor Method Processing Time= CPU +IO In BNN process: #Items = 5000; #Users = 1000 In Yoda process: #Items in each cluster wish-list = 250 #Clusters = 18

Conclusion Yoda scales as the # of users/items grow Higher accuracy Future Work Compare other techniques Run more experiments with real data Incorporate the content-based filtering mechanism into the user clustering & classification phases Incorporate the user profiles

Reference [Shardanand and Maes 1995] U. Shardanand and P. Maes, Social Information Filtering: Algorithm for automating ''Word of Mouth'', proceedings on Human factors in computing systems, Denver,CO,USA, p , May, 1995 [Maes 1994] Pattie Maes, Agents that reduce work and information overload, Communications of the ACM, 37(7), p.30-40, 1994 [Balabanovi and Shoham 1997]Marko Balabanovi and Yoav Shoham, Fab: content- based, collaborative recommendation, Communications of the ACM, 40(3), p , 1997 [Resnick et al. 1994] P. Resnick and N. Iacovou and M. Suchak and P. Bergstrom and J. Riedl, GroupLens: An Open Architecture for Collaborative Filtering of Netnews, Proceedings of ACM conference on Cumputer-Supported Cooperative Work, Chapel Hill, NC, p , 1994 [Sarwar et al. 2000] B. Sarwar and G. Karypis and J. Konstan and J.Riedl, Application of Dimensionality Reduction in Recommender System -- A Case Study, ACM WebKDD 2000 Web Mining for E-Commerce Workshop, 2000 [Kohrs and Merialdo 2000] A. Kohrs and B. Merialdo, Using category-based collaborative filtering in the Active WebMuseum, Proceedings of IEEE International Conference on Multimedia and Expo, 1, p , 2000

Reference [Kitts et al. 2000] Brendan Kitts and David Freed and Martin Vrieze, Cross-sell: a fast promotion-tunable customer-item recommendation method based on conditionally independent probabilities, Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, Boston, MA USA, p , August, 2000 [Breese et al. 2000] J. Breese and D. Heckerman and C. Kadie, Empirical Analysis of Predictive Algorithms for Collaborative Filtering, Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, Madison, WI USA, p.43-52, July, 1998 Shahabi C., A.M. Zarkesh, J. Adibi, and V. Shah: Knowledge, Discovery from Users Web Page Navigation, Proceedings of the IEEE, RIDE97 Workshop, April, Shahabi C., F. Banaei-Kashani, J. Faruque, and A. Faisal: Feature Matrices: A Model for Efficient and Anonymous Web Usage Mining, EC-Web 2001, Germany, September 2001 Fagin R.: Combining Fuzzy Information from Multiple Systems, Proceedings of Fifteenth ACM Symposyum on Principles of Database Systems, Montreal, pp , Shahabi C., and Y. Chen: A Unified Framework to Incorporate Soft Query into Image Retrieval Systems, International Conference on Enterprise Information Systems, Setubal, Portugal, July 2001