Mingzhen Mo and Irwin King

Slides:



Advertisements
Similar presentations
A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.
Advertisements

January 23 rd, Document classification task We are interested to solve a task of Text Classification, i.e. to automatically assign a given document.
+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.
Relational Learning with Gaussian Processes By Wei Chu, Vikas Sindhwani, Zoubin Ghahramani, S.Sathiya Keerthi (Columbia, Chicago, Cambridge, Yahoo!) Presented.
Transfer Learning for WiFi-based Indoor Localization
Chen Cheng1, Haiqin Yang1, Irwin King1,2 and Michael R. Lyu1
Co-Training and Expansion: Towards Bridging Theory and Practice Maria-Florina Balcan, Avrim Blum, Ke Yang Carnegie Mellon University, Computer Science.
Pseudoinverse Learning Algorithm for Feedforward Neural Networks Guo, Ping Department of Computer Science & Engineering, The Chinese University of Hong.
Jierui Xie, Boleslaw Szymanski, Mohammed J. Zaki Department of Computer Science Rensselaer Polytechnic Institute Troy, NY 12180, USA {xiej2, szymansk,
Efficient Convex Relaxation for Transductive Support Vector Machine Zenglin Xu 1, Rong Jin 2, Jianke Zhu 1, Irwin King 1, and Michael R. Lyu 1 4. Experimental.
Relevance Feedback based on Parameter Estimation of Target Distribution K. C. Sia and Irwin King Department of Computer Science & Engineering The Chinese.
Non-fixed and Asymmetrical Margin Approach to Stock Market Prediction using Support Vector Regression Haiqin Yang, Irwin King and Laiwan Chan Department.
Semi-Supervised Clustering Jieping Ye Department of Computer Science and Engineering Arizona State University
1 PageSim: A Link-based Similarity Measure for the World Wide Web Zhenjiang Lin, Irwin King, and Michael, R., Lyu Computer Science & Engineering, The Chinese.
A Study of the Relationship between SVM and Gabriel Graph ZHANG Wan and Irwin King, Multimedia Information Processing Laboratory, Department of Computer.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin The Chinese.
Structure based Data De-anonymization of Social Networks and Mobility Traces Shouling Ji, Weiqing Li, and Raheem Beyah Georgia Institute of Technology.
1 Efficiently Learning the Accuracy of Labeling Sources for Selective Sampling by Pinar Donmez, Jaime Carbonell, Jeff Schneider School of Computer Science,
Wang, Z., et al. Presented by: Kayla Henneman October 27, 2014 WHO IS HERE: LOCATION AWARE FACE RECOGNITION.
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
Preserving Link Privacy in Social Network Based Systems Prateek Mittal University of California, Berkeley Charalampos Papamanthou.
Hierarchical Distributed Genetic Algorithm for Image Segmentation Hanchuan Peng, Fuhui Long*, Zheru Chi, and Wanshi Siu {fhlong, phc,
Page 1 Ming Ji Department of Computer Science University of Illinois at Urbana-Champaign.
Crowdsourcing for Spoken Dialogue System Evaluation Ling 575 Spoken Dialog April 30, 2015.
Xiangnan Kong,Philip S. Yu Department of Computer Science University of Illinois at Chicago KDD 2010.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Question Routing in Community Question Answering: Putting Category in Its Place 1 The Chinese University of Hong Kong, Shatin, N.T., Hong Kong 2 AT&T Labs.
Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering.
Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.
Project by: Cirill Aizenberg, Dima Altshuler Supervisor: Erez Berkovich.
Optimal Dimensionality of Metric Space for kNN Classification Wei Zhang, Xiangyang Xue, Zichen Sun Yuefei Guo, and Hong Lu Dept. of Computer Science &
Support Vector Machines and Gene Function Prediction Brown et al PNAS. CS 466 Saurabh Sinha.
Hongbo Deng, Michael R. Lyu and Irwin King
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
ICONIP 2010, Sydney, Australia 1 An Enhanced Semi-supervised Recommendation Model Based on Green’s Function Dingyan Wang and Irwin King Dept. of Computer.
Supervised Random Walks: Predicting and Recommending Links in Social Networks Lars Backstrom (Facebook) & Jure Leskovec (Stanford) Proc. of WSDM 2011 Present.
Privacy Preserving in Social Network Based System PRENTER: YI LIANG.
MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.
Semi-Supervised Learning with Graph Transduction Project 2 Due Nov. 7, 2012, 10pm EST Class presentation on Nov. 12, 2012.
CopyCatch: Stopping Group Attacks by Spotting Lockstep Behavior in Social Networks (WWW2013) BEUTEL, ALEX, WANHONG XU, VENKATESAN GURUSWAMI, CHRISTOPHER.
Personality Classification: Computational Intelligence in Psychology and Social Networks A. Kartelj, School of Mathematics, Belgrade V. Filipovic, School.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
SemiBoost : Boosting for Semi-supervised Learning Pavan Kumar Mallapragada, Student Member, IEEE, Rong Jin, Member, IEEE, Anil K. Jain, Fellow, IEEE, and.
Computer Science and Engineering Jianye Yang 1, Ying Zhang 2, Wenjie Zhang 1, Xuemin Lin 1 Influence based Cost Optimization on User Preference 1 The University.
Experience Report: System Log Analysis for Anomaly Detection
A Collaborative Quality Ranking Framework for Cloud Components
Semi-Supervised Clustering
CACTUS-Clustering Categorical Data Using Summaries
Privacy-Preserving Clustering
Sofus A. Macskassy Fetch Technologies
Prepared by: Mahmoud Rafeek Al-Farra
WSRec: A Collaborative Filtering Based Web Service Recommender System
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
Video Summarization by Spatial-Temporal Graph Optimization
COMBINED UNSUPERVISED AND SEMI-SUPERVISED LEARNING FOR DATA CLASSIFICATION Fabricio Aparecido Breve, Daniel Carlos Guimarães Pedronette State University.
Dieudo Mulamba November 2017
Machine Learning Week 1.
Community Distribution Outliers in Heterogeneous Information Networks
Soft Error Detection for Iterative Applications Using Offline Training
Zhenjiang Lin, Michael R. Lyu and Irwin King
Binghui Wang, Le Zhang, Neil Zhenqiang Gong
GANG: Detecting Fraudulent Users in OSNs
Zhedong Zheng, Liang Zheng and Yi Yang
A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 22, Feb, 2010 Department of Computer.
Three steps are separately conducted
Sofia Pediaditaki and Mahesh Marina University of Edinburgh
Web Page Classification with Heterogeneous Data Fusion
Using Clustering to Make Prediction Intervals For Neural Networks
Towards Maximum Independent Sets on Massive Graphs
Machine Learning.
Presentation transcript:

Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering, The Chinese University of Hong Kong

Problem Hidden Information ? Some Technique Users’ Profile Friendship Group & Network Friendship Users’ Profile Some Technique ICONIP 2010, Sydney

Objective Build a model with proper algorithm to predict the unknown information Better utilize community information Previous Works Graph Theory [G. Flake et al, SIGKDD 2000] Supervised Learning [E. Zheleva et al, WWW2009] Semi-Supervised Laerning [M. Mo et al, IJCNN2010] ICONIP 2010, Sydney

Contributions A novel model and algorithms are proposed Help to understand the security level in social networks. Community Consistency Community-Based Graph (CG) SSL Model 3 Local Consistency Global Consistency Basic Graph-Based SSL with Harmonic Function Local and Global Consistency (LGC) Graph SSL Model 1 Model 2 ICONIP 2010, Sydney

Model Preparation Online Social Network: G(V, E) Two Sets: labeled data & unlabeled data P3 P3 W1,3 W3,5 Y1 Y5 P1 P1 W3,4 P5 P5 W1,2 W4,5 P2 P2 P4 P4 W2,4 ICONIP 2010, Sydney

Local & Global Consistency (LGC) Learning Model Objective is the Laplace Matrix of community info . , and Local & Global Consistency (LGC) Learning Community Term Improve….. ICONIP 2010, Sydney

Model Generating Clustering vertices “Distance” is measured by Group and Network info. Mark down each cluster in a matrix _ , nc is the number of clusters ICONIP 2010, Sydney

Model Algorithms Closed form algorithm Simple and time-saving Input Process Output ICONIP 2010, Sydney

Model Algorithms Iterative algorithm Able to deal with large-scale data Input Process False True Output ICONIP 2010, Sydney

Experiments Datasets Objectives Evaluation One synthetic dataset: TwoMoons Two real-world datasets: StudiVZ & Facebook Objectives classification in TwoMoons Predict university names in StudiVZ & Facebook Evaluation Accuracy and Confidence ICONIP 2010, Sydney

Experiments Datasets Improve….. ICONIP 2010, Sydney

Experiments Results – Accuracy of Prediction (a) TwoMoons (b) StudiVZ (c) Facebook ICONIP 2010, Sydney

Experiments CG SSL performs best in most cases The curves of CG SSL are not monotone increasing The accuracies on Facebook dataset are less than the others ICONIP 2010, Sydney

Conclusion Community-based Graph SSL model describes the real world exactly CG SSL predicts the hidden information of online social networks with higher accuracy and confidence The security of users’ information is no longer secure ICONIP 2010, Sydney

Thank you Q & A ICONIP 2010, Sydney