The Matrix: Using Intermediate Features to Classify and Predict Friends in a Social Network Michael Matczynski 6.338 Status Report April 14, 2006.

Slides:



Advertisements
Similar presentations
A PowerPoint Presentation
Advertisements

CSE 5243 (AU 14) Graph Basics and a Gentle Introduction to PageRank 1.
Social Media Mining Chapter 5 1 Chapter 5, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool, September, 2010.
INFORMATION REVELATION AND PRIVACY IN ONLINE SOCIAL NETWORKS (THE FACEBOOK CASE) Rachid Hamcha IMM.
Collaborative Filtering in iCAMP Max Welling Professor of Computer Science & Statistics.
UNDERSTANDING VISIBLE AND LATENT INTERACTIONS IN ONLINE SOCIAL NETWORK Presented by: Nisha Ranga Under guidance of : Prof. Augustin Chaintreau.
Scaling Distributed Machine Learning with the BASED ON THE PAPER AND PRESENTATION: SCALING DISTRIBUTED MACHINE LEARNING WITH THE PARAMETER SERVER – GOOGLE,
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
To Join or Not to Join: The Illusion of Privacy in Social Networks with Mixed Public and Private User Profiles By Elena Zheleva, Lise Getoor Presented.
CES 514 – Data Mining Lecture 8 classification (contd…)
Recommendations via Collaborative Filtering. Recommendations Relevant for movies, restaurants, hotels…. Recommendation Systems is a very hot topic in.
A Model of Communities Multi-Layered Networks John Stevens Introduction This poster describes a multi-layered model of a community. This model builds on.
Data Mining with Decision Trees Lutz Hamel Dept. of Computer Science and Statistics University of Rhode Island.
ISP 433/633 Week 7 Web IR. Web is a unique collection Largest repository of data Unedited Can be anything –Information type –Sources Changing –Growing.
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
PageRank Identifying key users in social networks Student : Ivan Todorović, 3231/2014 Mentor : Prof. Dr Veljko Milutinović.
HCC class lecture 22 comments John Canny 4/13/05.
UT DALLAS Erik Jonsson School of Engineering & Computer Science FEARLESS engineering Security and Privacy in Social Networks Raymond Heatherly Data Security.
Big data analytics with R and Hadoop Chapter 5 Learning Data Analytics with R and Hadoop 데이터마이닝연구실 김지연.
COMMUNITIES IN MULTI-MODE NETWORKS 1. Heterogeneous Network Heterogeneous kinds of objects in social media – YouTube Users, tags, videos, ads – Del.icio.us.
Enkh-Amgalan Baatarjav Jedsada Chartree Thiraphat Meesumrarn University of North Texas.
Using Friendship Ties and Family Circles for Link Prediction Elena Zheleva, Lise Getoor, Jennifer Golbeck, Ugur Kuter (SNAKDD 2008)
Social Networking and On-Line Communities: Classification and Research Trends Maria Ioannidou, Eugenia Raptotasiou, Ioannis Anagnostopoulos.
Link Recommendation In P2P Social Networks Yusuf Aytaş, Hakan Ferhatosmanoğlu, Özgür Ulusoy Bilkent University, Ankara, Turkey.
Kristina Lerman Aram Galstyan USC Information Sciences Institute Analysis of Social Voting Patterns on Digg.
Artificial Neural Networks (ANN). Output Y is 1 if at least two of the three inputs are equal to 1.
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
Enkh-Amgalan Baatarjav Jedsada Chartree Thiraphat Meesumrarn University of North Texas.
Data Analysis in YouTube. Introduction Social network + a video sharing media – Potential environment to propagate an influence. Friendship network and.
Random Walk with Restart (RWR) for Image Segmentation
NEURAL NETWORKS FOR DATA MINING
Searching and Browsing Using Tags Nikos Sarkas Social Information Systems Seminar DCS, University of Toronto, Winter 2007.
Problems in large-scale computer vision David Crandall School of Informatics and Computing Indiana University.
The Database and Info. Systems Lab. University of Illinois at Urbana-Champaign User Profiling in Ego-network: Co-profiling Attributes and Relationships.
Healthcare Process Modelling by Rule Based Networks Han Liu First Year PhD Student Alex Gegov, Jim Briggs, Mohammed Bader PhD Supervisors.
Nicholas D. Lane, Hong Lu, Shane B. Eisenman, and Andrew T. Campbell Presenter: Pete Clements Cooperative Techniques Supporting Sensor- based People-centric.
Social networking. Social networking services A Social networking services : is online community used between people who share the same interests, so.
Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering.
Preventing Private Information Inference Attacks on Social Networks.
1 Lecture 6 Neural Network Training. 2 Neural Network Training Network training is basic to establishing the functional relationship between the inputs.
KNN CF: A Temporal Social Network kNN CF: A Temporal Social Network Neal Lathia, Stephen Hailes, Licia Capra University College London RecSys ’ 08 Advisor:
Characterization of a Computational Grid as a Complex System Lovro Ilijasic ( Lorenza Saitta
Supervised Random Walks: Predicting and Recommending Links in Social Networks Lars Backstrom (Facebook) & Jure Leskovec (Stanford) Proc. of WSDM 2011 Present.
Refined Online Citation Matching and Adaptive Canonical Metadata Construction CSE 598B Course Project Report Huajing Li.
Project 1 : Phase 1 22C:021 CS II Data Structures.
Design and Analysis of Algorithms Introduction to graphs, representations of a graph Haidong Xue Summer 2012, at GSU.
Soft Computing Lecture 15 Constructive learning algorithms. Network of Hamming.
Computing and Information Sciences Kansas State University ANNIE Conference November 10, 2008 Predicting Links and Link Change in Friends Networks: Supervised.
Applying Link-based Classification to Label Blogs Smriti Bhagat, Irina Rozenbaum Graham Cormode.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Alan Mislove Bimal Viswanath Krishna P. Gummadi Peter Druschel.
1 CS 430 / INFO 430: Information Retrieval Lecture 20 Web Search 2.
To Join or Not to Join: The Illusion of Privacy in social Networks with Mixed Public and Private User Profiles Paper by Elena Zhelea and Lise Getoor.
Big data classification using neural network
Security and Privacy in Social Networks
CPS : Information Management and Mining
Semi-Supervised Clustering
Distributed voting application for handheld devices
Dieudo Mulamba November 2017
Unsupervised Learning and Autoencoders
Using Friendship Ties and Family Circles for Link Prediction
COS 518: Advanced Computer Systems Lecture 12 Mike Freedman
Prepared by: Mahmoud Rafeek Al-Farra
Security and Privacy in Social Networks
Graph-based Security and Privacy Analytics via Collective Classification with Joint Weight Learning and Propagation Binghui Wang, Jinyuan Jia, and Neil.
CSE 491/891 Lecture 25 (Mahout).
Security and Privacy in Social Networks
Recommender Systems Group 6 Javier Velasco Anusha Sama
UCF-REU in Computer Vision
Presentation transcript:

The Matrix: Using Intermediate Features to Classify and Predict Friends in a Social Network Michael Matczynski Status Report April 14, 2006

Vision In order to successfully classify users in a social network such as facebook.com, we should leverage intermediate features. In order to successfully classify users in a social network such as facebook.com, we should leverage intermediate features.

Steps 1. Gather profile, friend, and group data from all MIT users on facebook.com 2. Build graph 3. Develop PageRank algorithm to determine profile popularity 4. Generate intermediate features from profiles 5. Develop algorithm to identify similarities between all users 6. Develop online interface for users

1. Gather Data Gathered data from 11,744 MIT profiles Gathered data from 11,744 MIT profiles Profile data (major, living group, etc) Profile data (major, living group, etc) Friend information (to build the graph) Friend information (to build the graph)

2. Build Graph Due to privacy settings, not all friend information is available Due to privacy settings, not all friend information is available Nonetheless, because a friendship link is undirected, the friends of users with strict privacy settings can mostly be deduced Nonetheless, because a friendship link is undirected, the friends of users with strict privacy settings can mostly be deduced

3. PageRank Algorithm Google’s PageRank Algorithm determines important nodes of a graph by using each link as a vote for that particular node Google’s PageRank Algorithm determines important nodes of a graph by using each link as a vote for that particular node Run Time: Run Time: <1sec / iteration <1sec / iteration PageRank converges within 20 iterations PageRank converges within 20 iterations Results: Due to the undirected nature of social networks, PageRank is highly correlated with number of friends  Not that useful Results: Due to the undirected nature of social networks, PageRank is highly correlated with number of friends  Not that useful

4. Generate Intermediate Features from Profiles Classical Musicians John Williams Fans College Democrats College Republicans Napolean Dynamite is a Retarded Movie Pirates of the Caribbean +Music+Music +Politics, +Liberal +Politics, -Liberal +Movies, -Napolean +Movies, +Pirates

5. Identify Similar Users Modified PageRank Algorithm Modified PageRank Algorithm One network for each attribute (ie. Music) One network for each attribute (ie. Music) Resulting PageRank would indicate clusters of similar interest Resulting PageRank would indicate clusters of similar interest Neural Networks Neural Networks Train neural network with known friends and learn about similarities / classifications Train neural network with known friends and learn about similarities / classifications

6. Online Interface If interesting results emerge, develop an online interface so members of the MIT community can learn about themselves If interesting results emerge, develop an online interface so members of the MIT community can learn about themselves

Next Steps Generate intermediate features Generate intermediate features Determine classification algorithm Determine classification algorithm Parallel computation Parallel computation