Content Management & Hashtag Recommendation IN P2P OSN By Keerthi Nelaturu.

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Social media for business by Frank Flores Hash Cloud Studio A Creative Marketing Agency 200 Industrial Rd. Suite 155 San Carlos, CA (650)
Alex Cheung and Hans-Arno Jacobsen August, 14 th 2009 MIDDLEWARE SYSTEMS RESEARCH GROUP.
Entity-Centric Topic-Oriented Opinion Summarization in Twitter Date : 2013/09/03 Author : Xinfan Meng, Furu Wei, Xiaohua, Liu, Ming Zhou, Sujian Li and.
TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Finding Topic-sensitive Influential Twitterers Presenter 吴伟涛 TwitterRank:
One Theme in All Views: Modeling Consensus Topics in Multiple Contexts Jian Tang 1, Ming Zhang 1, Qiaozhu Mei 2 1 School of EECS, Peking University 2 School.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
NHnetWORKS December 14,  Facebook is a global Social Networking website that is operated and privately owned by Facebook, Inc.  Users can add.
Web 2.0: Concepts and Applications 5 Connecting People.
Web 2.0: Concepts and Applications 5 Connecting People.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
6/2/ An Automatic Personalized Context- Aware Event Notification System for Mobile Users George Lee User Context-based Service Control Group Network.
Recommender Systems Aalap Kohojkar Yang Liu Zhan Shi March 31, 2008.
A S URVEY OF TRUST MANAGEMENT AND ITS APPLICATIONS S UPERVISED BY : D R. Y AN W ANG Ravendra Singh Student-id:
Caimei Lu et al. (KDD 2010) Presented by Anson Liang.
Gossip-based Search Selection in Hybrid Peer-to-Peer Networks M. Zaharia and S. Keshav D.R.Cheriton School of Computer Science University of Waterloo,
 Manmatha MetaSearch R. Manmatha, Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts, Amherst.
Computing Trust in Social Networks
MusicSense: Contextual Music Recommendation using Emotional Allocation Modeling Rui Cai, Chao Zhang, Chong Wang, Lei Zhang, and Wei-Ying Ma Proceedings.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
A Social Help Engine for Online Social Network Mobile Users Tam Vu, Akash Baid WINLAB, Rutgers University May 21,
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
A Privacy Preserving Efficient Protocol for Semantic Similarity Join Using Long String Attributes Bilal Hawashin, Farshad Fotouhi Traian Marius Truta Department.
Projects ( ) Ida Mele. Rules Students have to work in teams (max 2 people). The project has to be delivered by the deadline that will be published.
1/18 Hypertwitter Collaborative Knowledge Engineering via Twitter Messages Зорана Васиљевић 09/3256
1 Speaker : 童耀民 MA1G Authors: Ze Li Dept. of Electr. & Comput. Eng., Clemson Univ., Clemson, SC, USA Haiying Shen ; Hailang Wang ; Guoxin.
As news analysis tool SNATZ TECHNOLOGY. Main terms used in presentation Term – a phrase, which system uses for training NLP algorithms. Summary – a phrase,
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
1 Wikification CSE 6339 (Section 002) Abhijit Tendulkar.
Advanced Software Engineering PROJECT. 1. MapReduce Join (2 students)  Focused on performance analysis on different implementation of join processors.
Learning to Classify Short and Sparse Text & Web with Hidden Topics from Large- scale Data Collections Xuan-Hieu PhanLe-Minh NguyenSusumu Horiguchi GSIS,
Popularity-Aware Topic Model for Social Graphs Junghoo “John” Cho UCLA.
Improved search for Socially Annotated Data Authors: Nikos Sarkas, Gautam Das, Nick Koudas Presented by: Amanda Cohen Mostafavi.
Recommendation system MOPSI project KAROL WAGA
User Profiling based on Folksonomy Information in Web 2.0 for Personalized Recommender Systems Huizhi (Elly) Liang Supervisors: Yue Xu, Yuefeng Li, Richi.
Laboratory for InterNet Computing CSCE 561 Social Media Projects Ryan Benton October 8, 2012.
Pete Bohman Adam Kunk. Real-Time Search  Definition: A search mechanism capable of finding information in an online fashion as it is produced. Technology.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
2010 Summer Training Series Teaching & Learning: Professional Development Institute Don Blake, NEA.
Recommending Twitter Users to Follow Using Content and Collaborative Filtering Approaches John HannonJohn Hannon, Mike Bennett, Barry SmythBarry Smyth.
Chapter 6: Information Retrieval and Web Search
Finding the Hidden Scenes Behind Android Applications Joey Allen Mentor: Xiangyu Niu CURENT REU Program: Final Presentation 7/16/2014.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
Probabilistic Models for Discovering E-Communities Ding Zhou, Eren Manavoglu, Jia Li, C. Lee Giles, Hongyuan Zha The Pennsylvania State University WWW.
PCI th Panhellenic Conference in Informatics Clustering Documents using the 3-Gram Graph Representation Model 3 / 10 / 2014.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Topic Modeling using Latent Dirichlet Allocation
1 A Biterm Topic Model for Short Texts Xiaohui Yan, Jiafeng Guo, Yanyan Lan, Xueqi Cheng Institute of Computing Technology, Chinese Academy of Sciences.
Concept-based P2P Search How to find more relevant documents Ingmar Weber Max-Planck-Institute for Computer Science Joint work with Holger Bast Torino,
Link Distribution on Wikipedia [0407]KwangHee Park.
Query Suggestions in the Absence of Query Logs Sumit Bhatia, Debapriyo Majumdar,Prasenjit Mitra SIGIR’11, July 24–28, 2011, Beijing, China.
An overlay for latency gradated multicasting Anwitaman Datta SCE, NTU Singapore Ion Stoica, Mike Franklin EECS, UC Berkeley
Don’t Follow me : Spam Detection in Twitter January 12, 2011 In-seok An SNU Internet Database Lab. Alex Hai Wang The Pensylvania State University International.
Guided By Ms. Shikha Pachouly Assistant Professor Computer Engineering Department 2/29/2016.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
Mining Tag Semantics for Social Tag Recommendation Hsin-Chang Yang Department of Information Management National University of Kaohsiung.
CS791 - Technologies of Google Spring A Web­based Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.
Presented by: Siddhant Kulkarni Spring Authors: Publication:  ICDE 2015 Type:  Research Paper 2.
Inferring User Interest Familiarity and Topic Similarity with Social Neighbors in Facebook INSTRUCTOR: DONGCHUL KIM ANUSHA BOOTHPUR
1 Text Categorization  Assigning documents to a fixed set of categories  Applications:  Web pages  Recommending pages  Yahoo-like classification hierarchies.
Grow Your Business with Social Marketing
Collaborative Deep Learning for Recommender Systems
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Topic Modeling for Short Texts with Auxiliary Word Embeddings
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
Community-based User Recommendation in Uni-Directional Social Networks
Knowledge Sharing Mechanism in Social Networking for Learning
Presentation transcript:

Content Management & Hashtag Recommendation IN P2P OSN By Keerthi Nelaturu

Challenges with current Social Networks Personal data left with Service Provider even when Social graph is removed Control of the User-generated content with Service Provider “Targeted Advertising” Data being sold to third-party companies without user’s approval (Ex: Facebook beacon) Solution: Peer-to-Peer(P2P) Architecture instead of Centralized Architecture

Peer-to-Peer Social Network Utilizing advantages of P2P computing in-order to avoid issues with current OSN. Client needs to install Java Desktop Application BATON overlay for the P2P Architecture (Balanced binary Tree Overlay Network) Relational Database – PostgreSQL Open Source Amazon EC2 Cloud platform for Servers Bootstrap Peer Server Peer Client Peer

Content organization in our P2P Social Network Bootstrap Peer Server Peer Bootstrap Peer Functionalities New user joining / leaving Auto fail-over Auto scaling Server Peer Functionalities Schema Mapping Data loader Data Indexer Access Control

Content organization in our P2P Social Network Categories of Content Posts Comments Blogs Articles Group Discussions Attachments Privacy Control Options Public Self Friends Only

About Hashtag Introduced by Twitter network in August 2007 String of words or characters preceded by “#” symbol Used for : 1.Forming groups 2.Highlight events or disasters and any information being posted by users under one context 3.Organize personal content

Common issues with Hashtags Ambiguous tags Ex: “apple” – may refer to fruit or mac product Heterogeneous tags Ex: Multi-lingual, acronyms, synonyms, plurals Expertise of the user adding hashtag Solution: Provide organization and relationship between hashtags Reference: HyperTwitter: Collaborative Knowledge Engineering via Twitter Messages – Martin Hepp

Hashtag Syntax For getting additional information from user Consolidating multiple synonymous hashtags Letting others know about relations between tags Syntax:

Hashtag Recommendation System

Recommendation Systems Content-based – Items already used by a user Collaborative Filtering – based on interests from similar users Hybrid

Related Work Authors: Su Mon Kywe & Tuan-Anh Hoang et.al Personalized hashtag recommendation Select hashtags from similar users and similar tweets and rank them TF-IDF scheme used for measuring weights of content and cosine similarity for comparison

Related Work Authors: Eva Zangerle, Wolfgang Gassler & Gunther Specht Find most similar messages in a crawled data set for the tweet entered by the user Retrieving the set of hashtags used within these most similar messages Ranking the computed set of hashtag recommendation candidates TF-IDF scheme used for measuring similarity between tweets

Related Work Authors: Fréderic Godin, Viktor Slavkovikj, Wesley De Neve Topic model for recommendation – LDA (Latent Dirichlet Allocation) Recommend general hashtag topics by identifying keywords Do not consider user preferences

Drawbacks with previous models TF-IDF works good for short messages Suggested tags are sparse – 77% of hashtags used only once, 94% not used more than five times Mostly used when recommending a maximum of five hashtags Using topic model, keywords for hashtags are recommended

Our Recommendation System Based on probabilistic topic model – LDA Both content-based and collaborative filtering methods For recommending hashtags and also content

About LDA – Latent Dirichlet Allocation Hidden topic model Discover topics from large set of documents Gives topic-word distribution Gibbs Sampling algorithm t1 tn w1wm weight

About LDA – Latent Dirichlet Allocation Training Phase Inference Phase doc1 docn t1tm weight

Offline Sampling Collection of Training Dataset JGibbLDA Content From Network JGibbLDA Final Doc- Topic Distribution JGibbLDA : TrainingInference

Content Types LDA User- generated content Content from Friends Content from Similar Users Location based Overall Popularity Keywords

Content Types User Generated Content Pass on all of the user’s data to LDA Identify Topics of Interest (TOI) Recommend hashtags from docs with TOI greater than threshold Content from Friends Pass on publicly available data from Friends to LDA Identify documents with similar TOI using Cosine similarity Recommend hashtags from docs with similar TOI

Data Retrieval Considerations in P2P OSN Overall Popular hashtags stored in bootstrap peer LDA models sampled for global data also in bootstrap peer Only “public” content is collected for options Friends and Similar Users New User will receive recommendations based on Overall Popularity and Location based

Conclusion & Future Work Simulation model for evaluation – Yet to! Users to choose their own hashtag recommendation approach Include hashtag-topic distribution along with topic- word distribution Embed Ecommerce capabilities into the application

Thank you!