1 Topic Distributions over Links on Web Jie Tang 1, Jing Zhang 1, Jeffrey Xu Yu 2, Zi Yang 1, Keke Cai 3, Rui Ma 3, Li Zhang 3, and Zhong Su 3 1 Tsinghua.

Slides:



Advertisements
Similar presentations
iRobot: An Intelligent Crawler for Web Forums
Advertisements

TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Bellwork If you roll a die, what is the probability that you roll a 2 or an odd number? P(2 or odd) 2. Is this an example of mutually exclusive, overlapping,
Feichter_DPG-SYKL03_Bild-01. Feichter_DPG-SYKL03_Bild-02.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. *See PowerPoint Lecture Outline for a complete, ready-made.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 116.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 40.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 28.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 38.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Growing Every Child! The following slides are examples of questions your child will use in the classroom throughout the year. The questions progress from.
Chapter 1 Image Slides Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
and 6.855J Cycle Canceling Algorithm. 2 A minimum cost flow problem , $4 20, $1 20, $2 25, $2 25, $5 20, $6 30, $
Exploring Traversal Strategy for Web Forum Crawling Yida Wang, Jiang-Ming Yang, Wei Lai, Rui Cai, Lei Zhang and Wei-Ying Ma Chinese Academy of Sciences.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
0 - 0.
ALGEBRAIC EXPRESSIONS
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULTIPLYING MONOMIALS TIMES POLYNOMIALS (DISTRIBUTIVE PROPERTY)
ADDING INTEGERS 1. POS. + POS. = POS. 2. NEG. + NEG. = NEG. 3. POS. + NEG. OR NEG. + POS. SUBTRACT TAKE SIGN OF BIGGER ABSOLUTE VALUE.
MULTIPLICATION EQUATIONS 1. SOLVE FOR X 3. WHAT EVER YOU DO TO ONE SIDE YOU HAVE TO DO TO THE OTHER 2. DIVIDE BY THE NUMBER IN FRONT OF THE VARIABLE.
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING Think Distributive property backwards Work down, Show all steps ax + ay = a(x + y)
Addition Facts
Year 6 mental test 10 second questions Numbers and number system Numbers and the number system, fractions, decimals, proportion & probability.
ZMQS ZMQS
© S Haughton more than 3?
15. Oktober Oktober Oktober 2012.
1 Directed Depth First Search Adjacency Lists A: F G B: A H C: A D D: C F E: C D G F: E: G: : H: B: I: H: F A B C G D E H I.
Linking Verb? Action Verb or. Question 1 Define the term: action verb.
Squares and Square Root WALK. Solve each problem REVIEW:
We are learning how to read the 24 hour clock
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.
Chapter 5 Test Review Sections 5-1 through 5-4.
SIMOCODE-DP Software.
1 First EMRAS II Technical Meeting IAEA Headquarters, Vienna, 19–23 January 2009.
Addition 1’s to 20.
25 seconds left…...
Test B, 100 Subtraction Facts
Week 1.
Vanderbilt Business Objects Users Group 1 Linking Data from Multiple Sources.
Visions of Australia – Regional Exhibition Touring Fund Applicant organisation Exhibition title Exhibition Sample Support Material Instructions 1) Please.
We will resume in: 25 Minutes.
1 Unit 1 Kinematics Chapter 1 Day
Highlights From the Survey on the Use of Funds Under Title II, Part A
Murach’s OS/390 and z/OS JCLChapter 16, Slide 1 © 2002, Mike Murach & Associates, Inc.
Learning to Recommend Questions Based on User Ratings Ke Sun, Yunbo Cao, Xinying Song, Young-In Song, Xiaolong Wang and Chin-Yew Lin. In Proceeding of.
RollCaller: User-Friendly Indoor Navigation System Using Human-Item Spatial Relation Yi Guo, Lei Yang, Bowen Li, Tianci Liu, Yunhao Liu Hong Kong University.
Mining Triadic Closure Patterns in Social Networks
1 Zi Yang, Wei Li, Jie Tang, and Juanzi Li Knowledge Engineering Group Department of Computer Science and Technology Tsinghua University, China {yangzi,
Finding Topic-sensitive Influential Twitterers Presenter 吴伟涛 TwitterRank:
1 Social Influence Analysis in Large-scale Networks Jie Tang 1, Jimeng Sun 2, Chi Wang 1, and Zi Yang 1 1 Dept. of Computer Science and Technology Tsinghua.
1 A Topic Modeling Approach and its Integration into the Random Walk Framework for Academic Search 1 Jie Tang, 2 Ruoming Jin, and 1 Jing Zhang 1 Knowledge.
1 1 Chenhao Tan, 1 Jie Tang, 2 Jimeng Sun, 3 Quan Lin, 4 Fengjiao Wang 1 Department of Computer Science and Technology, Tsinghua University, China 2 IBM.
Introduction The large amount of traffic nowadays in Internet comes from social video streams. Internet Service Providers can significantly enhance local.
1 A Discriminative Approach to Topic- Based Citation Recommendation Jie Tang and Jing Zhang Presented by Pei Li Knowledge Engineering Group, Dept. of Computer.
Example 16,000 documents 100 topic Picked those with large p(w|z)
1 Linmei HU 1, Juanzi LI 1, Zhihui LI 2, Chao SHAO 1, and Zhixing LI 1 1 Knowledge Engineering Group, Dept. of Computer Science and Technology, Tsinghua.
Local Linear Matrix Factorization for Document Modeling Institute of Computing Technology, Chinese Academy of Sciences Lu Bai,
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Mining Advisor-Advisee Relationships from Research Publication.
Concept-Based Analysis of Scientific Literature Chen-Tse Tsai, Gourab Kundu, Dan Roth UIUC.
1 Zi Yang Tsinghua University Joint work with Prof. Jie Tang, Prof. Juanzi Li, Dr. Keke Cai, Jingyi Guo, Chi Wang, etc. July 21, 2011, CASIN 2011, Tsinghua.
1 Zi Yang Tsinghua University Joint work with Prof. Jie Tang, Prof. Juanzi Li, Dr. Keke Cai, Jingyi Guo, Chi Wang, etc. July 21, 2011, CASIN 2011, Tsinghua.
MINING DEEP KNOWLEDGE FROM SCIENTIFIC NETWORKS
Jinwen Guo, Shengliang Xu, Shenghua Bao, and Yong Yu
Presentation transcript:

1 Topic Distributions over Links on Web Jie Tang 1, Jing Zhang 1, Jeffrey Xu Yu 2, Zi Yang 1, Keke Cai 3, Rui Ma 3, Li Zhang 3, and Zhong Su 3 1 Tsinghua University 2 Chinese University of Hong Kong 3 IBM, China Research Lab Dec. 7 th 2009

2 Motivation Web users create links with significantly different intentions Understanding of the category and the influence of each link can benefit many applications, e.g., –Expert finding –Collaborator finding –New friends recommendation –…

3 Original citation networkSemantic citation network Examples – Topic distribution analysis over citations Researcher A an in-depth understanding of the research field? VS.

4 Problem: Link Semantic Analysis Topic modeling over links Citation context words Link semantics

5 Outline Previous Work Our Approach –Pairwise Restricted Boltzmann Machines (PRBMs) Experimental Results Conclusion & Future Work

6 Previous Work Link influence analysis Citation influence topic [Dietz, 07]; Social influence analysis [Crandall, 08; Tang, 09]; Graphical model Probabilistic LSI [Hofmann, 99], Latent Dirichlet Allocation [Blei, 03], Restricted Boltzmann machines [Welling, 01] Social network analysis Social network analysis [Wasserman, 94] Web community discovery [Newman, 04] Small world networks [Watts, 18]

7 Outline Previous Work Our Approach –Pairwise Restricted Boltzmann Machines (PRBMs) Experimental Results Conclusion & Future Work

8 Pairwise Restricted Boltzmann Machines (PRBMs) Link context words Topic distribution Link category Latent variables defined over the link to bridge the two pages Pairwise Restricted Boltzmann Machines (PRBMs) Example

9 Formalization of PRBMs Formalization PRBMs Obj. Func: with

10 Model Learning Generative learning Discriminative learning Hybrid learning Obj. Func: Expectation w.r.t. the data distribution Expectation w.r.t. the distribution defined by the model We use the Contrast Divergence to learn the model distribution P M

11 Link Semantic Analysis Link category annotation –First we calculate –Then we estimate the probability p(c|e) by a mean field algorithm Link influence estimation –Estimate influence by KL divergence –An alternative way is to generate the influence score by a Gaussian distribution, thus

12 Outline Previous Work Our Approach –Pairwise Restricted Boltzmann Machines (PRBMs) Experimental Results Conclusion & Future Work

13 Experimental Setting Data sets –Arnetminer data: 978,504 papers, 14M citations –Wikipedia: 14K article pages and 25 K links Evaluation measures –Link categorization accuracy –Topical analysis Baselines: –SVM+LDA –SVM+RBM

14 Accuracy of Link Categorization gPRBM: our approach with generative learning dPRBM: our approach with discriminative learning hPRBM: our approach with hybrid learning

15 Category-Topic Mixture

16 Example Analysis

17 Outline Previous Work Our Approach –Pairwise Restricted Boltzmann Machines (PRBMs) Experimental Results Conclusion & Future Work

18 Conclusion & Future Work Concluding remarks –Investigate the problem of quantifying link semantics on the Web –Propose a Pairwise Restricted Boltzmann Machines to solve this problem Future Work –Semantic analysis over social relationships –Correlation between the link semantics and the information propagation

19 Thanks! Q&A HP: