Co-evolution of network structure and content Lada Adamic School of Information & Center for the Study of Complex Systems University of Michigan.

Slides:



Advertisements
Similar presentations
Mobile Communication Networks Vahid Mirjalili Department of Mechanical Engineering Department of Biochemistry & Molecular Biology.
Advertisements

Network Matrix and Graph. Network Size Network size – a number of actors (nodes) in a network, usually denoted as k or n Size is critical for the structure.
Self-introduction Name:  鲍鹏 (Peng Bao) Research Interests:  Popularity Prediction, Information Diffusion, Social Network , etc… Grade:  In the third.
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
Analysis and Modeling of Social Networks Foudalis Ilias.
Comparison of Social Networks by Likhitha Ravi. Outline What is a social network? Elements of social network Previous studies What is missing in previous.
Relationship Mining Network Analysis Week 5 Video 5.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
We Know #Tag: Does the Dual Role Affect Hashtag Adoption? Lei Yang 1, Tao Sun 2, Ming Zhang 2, Qiaozhu Mei 1 1 School of Information, the University.
UNDERSTANDING VISIBLE AND LATENT INTERACTIONS IN ONLINE SOCIAL NETWORK Presented by: Nisha Ranga Under guidance of : Prof. Augustin Chaintreau.
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Social networks and information sharing Lada Adamic.
Blogosphere  What is blogosphere?  Why do we need to study Blog-space or Blogosphere?
Advanced Topics in Data Mining Special focus: Social Networks.
Social Networks: Advertising, Pricing and All That Zvi Topol & Itai Yarom.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
U. Michigan participation in EDIN Lada Adamic, PI E 2.1 fractional immunization of networks E 2.1 time series analysis approach to correlating structure.
1 Robert Engle UCSD and NYU July WHAT IS LIQUIDITY? n A market with low “transaction costs” including execution price, uncertainty and speed n.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.
Leveraging Big Data: Lecture 11 Instructors: Edith Cohen Amos Fiat Haim Kaplan Tova Milo.
Models of Influence in Online Social Networks
JP Nijjer – Head Analyst. 2 Everyone Loves Dilbert Introduction Time Value of Money Financial Products StocksConclusion UTEFA.
Social Network Analysis via Factor Graph Model
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
Network properties Slides are modified from Networks: Theory and Application by Lada Adamic.
LANGUAGE NETWORKS THE SMALL WORLD OF HUMAN LANGUAGE Akilan Velmurugan Computer Networks – CS 790G.
Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution 3.0 License.
Principles of Social Network Analysis. Definition of Social Networks “A social network is a set of actors that may have relationships with one another”
Sponsor: Dr. K.C. Chang Tony Chen Ehsan Esmaeilzadeh Ali Jarvandi Ning Lin Ryan O’Neil Spring 2010.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
TWITTER What is Twitter, a Social Network or a News Media? Haewoon Kwak Changhyun Lee Hosung Park Sue Moon Department of Computer Science, KAIST, Korea.
Google News Personalization: Scalable Online Collaborative Filtering
A Graph-based Friend Recommendation System Using Genetic Algorithm
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 25, 2012.
Social Network Metrics. Types of network metrics Group level – Density – Components Isolates – Cliques – Centralization Degree Closeness Betweenness –
Structural Properties of Networks: Introduction Networked Life NETS 112 Fall 2015 Prof. Michael Kearns.
Special Topics in Educational Data Mining HUDK5199 Spring 2013 March 25, 2012.
Lecture 13: Network centrality Slides are modified from Lada Adamic.
Web Intelligence Complex Networks I This is a lecture for week 6 of `Web Intelligence Example networks in this lecture come from a fabulous site of Mark.
1 Chapter 20 Benching the Equity Players Portfolio Construction, Management, & Protection, 4e, Robert A. Strong Copyright ©2006 by South-Western, a division.
1 Agribusiness library LESSON : Applying Trading Techniques.
Slides are modified from Lada Adamic
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
1 Agribusiness Library Lesson : Options. 2 Objectives 1.Describe the process of using options on futures contracts, and define terms associated.
Network Community Behavior to Infer Human Activities.
Measuring Behavioral Trust in Social Networks
Social Interactions and Commerce (in Second Life) April 4, 2011.
Community Discovery in Social Network Yunming Ye Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
Statistics Unit 9 only requires us to do Sections 1 & 2. * If we have time, there are some topics in Sections 3 & 4, that I will also cover. They tie in.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Chapter 2 Data in Science. Section 1: Tools and Models.
Twitter Community Discovery & Analysis Using Topologies
Informatics tools in network science
© Vipin Kumar IIT Mumbai Case Study 2: Dipoles Teleconnections are recurring long distance patterns of climate anomalies. Typically, teleconnections.
Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha Hamed Haddadi Fabricio Benevenuto Krishna P. Gummadi.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.
Graph clustering to detect network modules
Structural Properties of Networks: Introduction
Structural Properties of Networks: Introduction
Gephi Gephi is a tool for exploring and understanding graphs. Like Photoshop (but for graphs), the user interacts with the representation, manipulate the.
Graph Analysis by Persistent Homology
Generative Model To Construct Blog and Post Networks In Blogosphere
Section 7.12: Similarity By: Ralucca Gera, NPS.
#VisualHashtags Visual Summarization of Social Media Events using Mid-Level Visual Elements Sonal Goel (IIIT-Delhi), Sarthak Ahuja (IBM Research, India),
Peer-to-Peer and Social Networks Fall 2017
Department of Computer Science University of York
Modelling and Searching Networks Lecture 2 – Complex Networks
Presentation transcript:

Co-evolution of network structure and content Lada Adamic School of Information & Center for the Study of Complex Systems University of Michigan

Outline  Co-evolution of network structure and content  Can the structure of Twitter and virtual world interactions reveal something about their content?   Can the structure of a commodity futures trading network reveal something about information flowing into the market? 

3 What is the relationship between network structure and information diffusion?

Is information flowing over the network? Or is information shaping the network?

Can the shape of the network reveal properties of information  Big news! Giant microbes!

Can the shape of the network reveal properties of information  Little news. How’s the weather?

Related work on time evolving graphs  Densification over time (Leskovec et al. 2005)  Community structure over time (Leicht et al. 2007, Mucha et al. 2010)  Change in structure (ability to “compress” network) signals events (Graphscope by Sun et al. 2007)  Disease propagation & timing (Moody 2002, Liljeros 2010)  Enron (B. Aven, 2011)

What’s different here  We look at network dynamics at relatively short time scales and construct time series  A range of network metrics, instead of just community structure  Information novelty and diversity as opposed to tracking single events / pieces of information

Can the network reveal…  If everyone is talking about the same thing, or if there is just background chatter.  If what they are talking about is novel?

1 st context: virtual worlds  Networks: asset transfers (gestures, landmarks) and transactions (e.g. rent, object purchases)  Content: assets being transferred 10

Study transfers in the context of 100 groups with highest numbers of transfers 11

Second context: Twitter Network  microblogging : < 140 characters / tweet  Network links read from tweets  Reply or mention: by putting in front of the username  Retweet : repeat something someone else wrote on twitter, preceded by the letters RT in front of their username

Selecting Twitter communities to track   For each “researcher” gather tweets of accounts they follow

Highly dynamic networks  Segmentation:  Twitter: every 800 tweets  median segment duration 1.5 days  SecondLife: every 50 asset transfers  median segment duration 8.4 days % of edges repeated Segments elapsed

Conductance: capturing potential for information flow A B A B A B low conductance medium conductance high conductance  Temporal conductance (summed over all pairs):  High if pairs of nodes share edges, or many short, indirect paths Koren, North, Volinsky, KDD, 2006

Network expectedness  Define expectedness:  Average conductance of all neighbor pairs at time t,  based on conductance of pair at time t-1 16 expected unexpected

Conductan ce and expectedn ess as a toy network evolves d network configuratio n at t = 0 possible configuratio ns at t = 1 conductance = 4 expectedness = 1.5 edge jaccard = 1 conductance = 4.5 expectedness = edge jaccard = conductance = 6 expectedness = 0.5 edge jaccard = 0.25

SecondLife: network structure and content overlap t,t+1 overlap t-1,t  diversity t, (t+1) standard network metrics are not indicative of information properties conductance and expectedness are  diversity t-1, t

Conductance & diversity of information  High conductance brings higher content diversity  Repeat network patterns bring less diversity and less novelty  but… similarity and novelty are positively correlated (  = 0.19) Social and transaction network of top sellers in SL

Twitter: textual diversity and novelty  Semantic metrics Metric TypeComputation Methods Contemporary Metrics (average cosine similarity of words in Tweets) between connected node pairs in the graph between indirectly-connected node pairs, i.e., non-neighbors with an undirected path of length > 1 between them between isolated pairs (in different components) Novelty Metric (Language Model distance) between two sets of tweets associated with Twitter networks captured at different times

Twitter: network structure and information diversity network structure content similarity

Inferring Network Semantic Information  Question: Does the network structural information help to improve the prediction performance of the characteristics of information exchanged? Kernel Regression Prediction Model Semantic variables Topological variables Semantic variables

Example: Inferring the average similarity score between isolated pairs  Don’t need to use other textual variables (e.g. similarity between indirectly connected pairs) when sufficient topological information available  Reason: topological variables account for much of the pattern in the text! The input variables of curve c i start from X i and increase each time by adding the variable labeled on x-axis.

Network structure and information novelty  Greater novelty in edges corresponds to greater novelty in content shared  For nodes that are interacting (citing or being cited):  Higher conductance and expectedness correlates with less information novelty

Information in trading networks  CFTC = Commodity futures trading commission  stated mission: protect market users and the public from fraud, manipulation, and abusive practices  futures contracts started out as contracts for agricultural products, but expanded to more exotic contracts, including index futures Collaboration with Celso Brunetti, Jeff Harris, and Andrei Kirilenko

Data  6.3 million transactions in Aug in the Sept. E-mini S&P futures contract  price discovery for the index occurs mostly in this contract (Hasbrouck (2003))  data includes: date & time, executing broker, opposite broker, buy or sell, price, quantity  sample in transaction windows of 240 transactions executing brokeropposite broker quantity: 10 price: $171.25

matching algorithm  limit order book 27 buy 30 contracts at $ sell 10 contracts at $ sell 20 contracts at $ sell 5 contracts at $ buy 20 contracts at $ buy 50 contracts at $ buy 30 contracts at $ buy 20 contracts at $171.50

not social, not intentional, not persistent 28

Financial variables Rate of return: Last price to first price in logs (close-to-open) Volatility: Range – log difference between max and min price Duration: Total period duration - time in seconds between the start and end of each sampling period Proxy for arrival of new information Volume: Trading volume – number of contracts traded

What can we learn from network structure? e.g. centralization? low in-centralizationhigh in-centralization 30 low indegree high indegree high outdegree low outdegree

overview of network variables  # nodes, # edges  clustering coefficient, LSCC, reciprocity  CEN = gini in-degree – gini out-degree  INOUT =  (indegree of node, outdegree of same node)  AI (asymmetric information) 31

Correlations between network and financial variables High Centralization: market dominance - a dominant trader buys from many small sellers – low duration, low volume

Negative assortativity: large sellers sell to small buyers and vice versa – low duration, higher volume Correlations between network and financial variables

High av. degree & largest strongly connected component: no news - many buyers and sellers – high duration, high volume Correlations between network and financial variables

Rate of return: positive correlation with centralization Volatility & duration: correlated with standard deviation of degree, average deg. and the total number of edges (E). Volume: Correlated with a few network variables, sign varies.

Conclusion  Network structure alone is revealing of the diversity and novelty information content being transmitted  Results depend on the scope and relative position of the activity in the network

Future work  Sensitivity to inclusion of non-interactive or across-community interactions  Applying novelty & conductance metrics to financial time series  Continuous formulation of novelty and other network metrics (because segmentation is problematic)  Roles of individual nodes  Thanks:  Edwin Teng Liuling Gong Avishay Livne  Information network academic research center

Questions?