Socio-Digital Influence Attack Models and Deterrence (FA9550-15-1-0003) PI: Tim Weninger (University of Notre Dame; Computer Science and Engineering) AFOSR.

Slides:



Advertisements
Similar presentations
Design of Experiments Lecture I
Advertisements

Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Chapter 3 Histograms Histogram is a summary graph showing a count of the data falling in various ranges. Purpose: To graphically summarize and display.
Chapter 3 Review Two Variable Statistics Veronica Wright Christy Treekhem River Brooks.
University of Buffalo The State University of New York Spatiotemporal Data Mining on Networks Taehyong Kim Computer Science and Engineering State University.
SIA: Secure Information Aggregation in Sensor Networks Bartosz Przydatek, Dawn Song, Adrian Perrig Carnegie Mellon University Carl Hartung CSCI 7143: Secure.
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
UNIVERSITY OF JYVÄSKYLÄ Yevgeniy Ivanchenko Yevgeniy Ivanchenko University of Jyväskylä
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Notes  Data are presented as a pair of overlying bars, the outer, wider bar representing the period 1st Oct 2007 to 30th September 2008, and the inner,
Summary of Quantitative Analysis Neuman and Robson Ch. 11
8/10/2015Slide 1 The relationship between two quantitative variables is pictured with a scatterplot. The dependent variable is plotted on the vertical.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Application of SAS®! Enterprise Miner™ in Credit Risk Analytics
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
A Statistical Analysis Example of A Full Functional Utilization of An Engineering Calculator Li-Fei Huang Dept. of App. Statistics.
Research and Statistics AP Psychology. Questions: ► Why do scientists conduct research?  answer answer.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Inference for Linear Regression Conditions for Regression Inference: Suppose we have n observations on an explanatory variable x and a response variable.
Quantitative Skills 1: Graphing
Chapter 12 – Probability and Statistics 12.7 – The Normal Distribution.
A Statistical Analysis of Seedlings Planted in the Encampment Forest Association By: Tony Nixon.
Stats/Methods I JEOPARDY. Jeopardy CorrelationRegressionZ-ScoresProbabilitySurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Treatment Learning: Implementation and Application Ying Hu Electrical & Computer Engineering University of British Columbia.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Simple Linear Regression SECTION 9.1 Inference for correlation Inference for.
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
Computer Science Department, Peking University
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
Yongqin Gao, Greg Madey Computer Science & Engineering Department University of Notre Dame © Copyright 2002~2003 by Serendip Gao, all rights reserved.
Correlation – Recap Correlation provides an estimate of how well change in ‘ x ’ causes change in ‘ y ’. The relationship has a magnitude (the r value)
Tracking Critical-Mass Outbreaks in Social Contagions (FA DEF)
Model City Model City A study of the value of simulation and modeling as a tool for urban planners, politicians, and citizens A research proposal submitted.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Classification Ensemble Methods 1
Early Identification of Introductory Major's Biology Students for Inclusion in an Academic Support Program BETHANY V. BOWLING and E. DAVID THOMPSON Department.
SUMMARIZING QUANTITATIVE DATA.
Randy S. Baker Director, Credit Risk Management F&A Committee – Preview for CWG / MCWG October 20, 2009 Potential Credit Risk Model Quarterly Update: Q
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Charts Overview PowerPoint Prepared by Alfred P.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
A Connectivity-Based Popularity Prediction Approach for Social Networks Huangmao Quan, Ana Milicic, Slobodan Vucetic, and Jie Wu Department of Computer.
1 Thinking Critically with Psychology/Research and Experimental Psychology Chapter 1.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
BPS - 5th Ed. Chapter 231 Inference for Regression.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
WARM UP: Penny Sampling 1.) Take a look at the graphs that you made yesterday. What are some intuitive takeaways just from looking at the graphs?
Correlation and Linear Regression
Exploring Group Differences
Statistics Use of mathematics to ORGANIZE, SUMMARIZE and INTERPRET numerical data. Needed to help psychologists draw conclusions.
CHAPTER 12 More About Regression
Disseminating Research Findings Shawn A. Lawrence, PhD, LCSW SOW 3401
Modify—use bio. IB book  IB Biology Topic 1: Statistical Analysis
Socio-Digital Influence Attack Models and Deterrence (FA )
CHAPTER 26: Inference for Regression
Basic Statistical Terms
AP Statistics, Section 3.3, Part 1
Department of Computer Science University of York
Basic Practice of Statistics - 3rd Edition Inference for Regression
CHAPTER 12 More About Regression
José Vergara, Natsuko Rivera, Román Rossi-Pool, Ranulfo Romo  Neuron 
Annabelle C. Singer, Loren M. Frank  Neuron 
Liu D. Liu, Christopher C. Pack  Neuron 
Volume 32, Issue 1, Pages (October 2001)
Higher National Certificate in Engineering
Module 2 Research Methods
Volume 29, Issue 5, Pages e4 (March 2019)
Presentation transcript:

Socio-Digital Influence Attack Models and Deterrence (FA ) PI: Tim Weninger (University of Notre Dame; Computer Science and Engineering) AFOSR Program Review: T rust & Influence May 11-15, 2015, USAF Academy, CO.

The Idea: Determine the influence and impact that social news and networks have on user behavior. …and find countermeasures against social media manipulation.

Hivemind Experiment What effects does early voting have on later voting? Does quality matter?

Hivemind Experiment Final scores for artificially, randomly up-treated posts, down-treated posts, and scores for untreated posts in the control group are shown. Red inner error bars show the standard error of the mean; black outer error bars show the 95% confidence interval. (a) shows the scores in the heavily skewed full distribution. When the highest 1% of scores are removed as in (b), the score distribution becomes much less skewed resulting in tighter error bounds, which further result in significant increases for up-treated posts and significant decreases for down-treated posts when compared to the control group. Paper under review

Hivemind Experiment Final scores for artificially, randomly up-treated posts, down-treated posts, and scores for untreated posts in the control group separated into their respective treatment delay intervals. Horizontal lines show the overall mean of each treatment group. The top 1% of scores were removed to un-skew the score distribution. These results show that treatment delay had little effect on the mean final score. Paper under review

Hivemind Experiment The middle 9 deciles of up-treated, down-treated and control group posts are shown according to their interval times. These results show that most posts receive a median score of 2 or less, and that the treatment has the most effect in the higher deciles of the score distribution. Paper under review

Hivemind Experiment The probability of a post receiving a corresponding score by treatment type. The inset graph shows the complete probability distribution function. The outer graph shows the probability of a post receiving scores between 500 and 2000 an approximation for trending or frontpage posts. Up-treated posts are 24% more likely to reach 2000 votes than the control group. Paper under review

Hivemind Experiment Mean scores of down-treated, control group and up-treated posts shown with 95% confidence intervals on the top 8 most active subreddits. Paper under review

Hivemind Experiment Update on Vote gathering Reddit won’t allow it, some work already done. I (mistakenly) spoke with Reddit cofounder and interim-CEO, they changed the mechanisms. No experiment like this is possible any longer. Paper under review

Patterns of Natural Human Navigation Model human navigation processes through knowledge networks Use these models to create knowledge repositories better suited for human browsing Paper under review

Patterns of Natural Human Navigation Paper under review Scatterplots comparing path lengths for each path navigation model. Only 500 random points are plotted in each subfigure for efficient document rendering. Slope of linear regression line indicates correlation strength.

Patterns of Natural Human Navigation Paper under review CatPath distance-to-go as a function of (a) PPR, and (b) human path length- to-go with standard error (red) and 95% condence intervals (black)

Patterns of Natural Human Navigation Paper under review Distribution of path lengths for various models. PPR does not result in a path length; rather the PPR score between the source and the destination nodes discretized into 30 equal size bins for comparison-sake.

Publications, Awards, Patents, or Transitions Attributed to the Grant Baoxu Shi and Tim Weninger Mining Interesting Meta-Paths from Complex Heterogeneous Information Networks. International Conference on Data Mining (ICDM) Designing Market of Data, Shenzhen, China, December 14-17, Tim Weninger. An Exploration of Submissions and Discussions in Social News - Mining Collective Intelligence of Reddit. Social Network Analysis and Mining (SNAM), 4:173 (2014). Invited Speaker - Greater Chicago Area System Research Workshop at UIC Forum in Chicago. Apr 2015 Keynote - International Symposium on Occupational Exposure to deliver keynote on Data Science. Jan 2015 Invited Fellow - United States National Academies of Science as Fellow of the National Academies Keck Futures Initiative. Nov Invited Speaker - MetaKnowledge Meeting in Asilomar, CA August Speaker - KDD Conference in New York City August Speaker -WebSci Conference in Bloomington, IN. July 2014

Project Summary Research Objectives: Identify and measure sources of influence corruption and their effect on global outcomes Create models of trust and influence propagation in social media and networks. Key Findings: Vote influence is real and is happening, but effects are not as bad as it could be. Trust and influence may be accurately modeled in networks bidirectionally. Technical Approach: In vivo studies of social media Statistical and Bayesian models of human behavior. Benefits to the wider academic or DoD community: Preliminary measurements of a new type of influence attack. Bidirectional network influence model. Project Start Date: Jan 1, 2015 Project End Date: Dec 31, 2017

Thank you