Machine Learning, Sebastiano Galazzo

Slides:

Advertisements

Similar presentations

CSC321: Introduction to Neural Networks and Machine Learning Lecture 24: Non-linear Support Vector Machines Geoffrey Hinton.

Advertisements

Slides from: Doug Gray, David Poole

Perceptron Learning Rule

NEURAL NETWORKS Perceptron

CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 7: Learning in recurrent networks Geoffrey Hinton.

CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.

Machine Learning Neural Networks

An introduction to: Deep Learning aka or related to Deep Neural Networks Deep Structural Learning Deep Belief Networks etc,

An Illustrative Example

Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)

Neural Networks AI – Week 23 Sub-symbolic AI Multi-Layer Neural Networks Lee McCluskey, room 3/10

1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.

CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 11: Bayesian learning continued Geoffrey Hinton.

M Machine Learning F# and Accord.net. Alena Dzenisenka Software architect at Luxoft Poland Member of F# Software Foundation Board of Trustees Researcher.

Machine Learning in Ad-hoc IR. Machine Learning for ad hoc IR We’ve looked at methods for ranking documents in IR using factors like –Cosine similarity,

1 Learning Chapter 18 and Parts of Chapter 20 AI systems are complex and may have many parameters. It is impractical and often impossible to encode all.

Today Ensemble Methods. Recap of the course. Classifier Fusion

School of Engineering and Computer Science Victoria University of Wellington Copyright: Peter Andreae, VUW Image Recognition COMP # 18.

CSC321 Introduction to Neural Networks and Machine Learning Lecture 3: Learning in multi-layer networks Geoffrey Hinton.

SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.

Neural Networks Lecture 4 out of 4. Practical Considerations Input Architecture Output.

語音訊號處理之初步實驗 NTU Speech Lab 指導教授: 李琳山助教: 熊信寬

Machine Learning Artificial Neural Networks MPλ ∀ Stergiou Theodoros 1.

LECTURE 15: PARTIAL LEAST SQUARES AND DEALING WITH HIGH DIMENSIONS March 23, 2016 SDS 293 Machine Learning.

Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation Dr.G.M.Nasira R. Vidya R. P. Jaia Priyankka.

Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.

Neural networks and support vector machines

Histograms CSE 6363 – Machine Learning Vassilis Athitsos

Support Vector Machines

CSC321 Lecture 18: Hopfield nets and simulated annealing

Deep Learning Amin Sobhani.

an introduction to: Deep Learning

Artificial Intelligence

ECE 5424: Introduction to Machine Learning

MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.

CSSE463: Image Recognition Day 11

Neural Networks for Machine Learning Lecture 1e Three types of learning Geoffrey Hinton with Nitish Srivastava Kevin Swersky.

Reading: Pedro Domingos: A Few Useful Things to Know about Machine Learning source: /cacm12.pdf reading.

Announcements HW4 due today (11:59pm) HW5 out today (due 11/17 11:59pm)

Classification with Perceptrons Reading:

Artificial Neural Networks

with Daniel L. Silver, Ph.D. Christian Frey, BBA April 11-12, 2017

Data Mining Lecture 11.

CS 188: Artificial Intelligence

Convolutional Networks

Perceptron Learning Demonstration

CS 4/527: Artificial Intelligence

CSSE463: Image Recognition Day 11

CS 188: Artificial Intelligence

Perceptron as one Type of Linear Discriminants

network of simple neuron-like computing elements

Neural Networks Geoff Hulten.

Advanced Artificial Intelligence Classification

Zip Codes and Neural Networks: Machine Learning for

ML – Lecture 3B Deep NN.

Ensemble learning.

Learning Chapter 18 and Parts of Chapter 20

Ensembles An ensemble is a set of classifiers whose combined results give the final decision. test feature vector classifier 1 classifier 2 classifier.

CSSE463: Image Recognition Day 11

CSSE463: Image Recognition Day 11

CSC321: Neural Networks Lecture 11: Learning in recurrent networks

An introduction to: Deep Learning aka or related to Deep Neural Networks Deep Structural Learning Deep Belief Networks etc,

Introduction to Neural Networks

Software Development Techniques

MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.

EE 193/Comp 150 Computing with Biological Parts

Directional Occlusion with Neural Network

Outline Announcement Neural networks Perceptrons - continued

Presentation transcript:

Machine Learning, Sebastiano Galazzo best practices and vulnerabilities Sebastiano Galazzo Microsoft MVP A.I. Category

Sebastiano Galazzo Microsoft MVP @galazzoseba sebastiano.galazzo@gmail.com https://it.linkedin.com/in/sebastianogalazzo

Best practices

The perceptron In machine learning, the perceptron is a binary classifier or a function which can decide whether or not an input, represented by a vector of numbers, belongs to some specific class. 𝑓 𝑥 =χ ( ⟨ w , x ⟩ + b ) w is a vector having weights of real values, while operator ⟨ ⋅ , ⋅ ⟩ is the scalar product, b is the 'bias’, a constant not related to any input value and χ ( y ) is the output function

Main evolutions Easy way, Logistic Regression, Support Vector Machine Pro: Easy and fast use use Cons: Low accuracy (compared to neural networks) Hard way, Neural Networks Pro: If get convergence you gain a very high accuracy (State of the art) Cons: Very difficult to model, a lot of experience is required

Easy way Pseudo equation 𝑥∗∝ +𝑦∗𝛽+𝑐∗𝛿+..+𝑧∗𝜔=(0,1) #logisticregression #svm

Hard way #neuralnetwork

Advanced modelling of Neural Networks Use case, provide a customer's willingness to vote a political party Age Gender Income City Political party 30 Male 38,000 New York Democrat 39 Female 42,000 Page Republican 24 Other 39,000 San Francisco 51 Prefer not to say 71,000 Seattle

Advanced modelling of Neural Networks Age Gender Income City Political party 30 Male 38,000 New York Democrat 39 Female 42,000 Page Republican 24 Other 39,000 San Francisco 51 Prefer not to say 71,000 Seattle 0,17 18,24 25,35 36,45 46,60 >60 𝑚𝑎𝑙𝑒 𝑓𝑒𝑚𝑎𝑙𝑒 𝑢𝑟𝑏𝑎𝑛 𝑟𝑢𝑟𝑎𝑙 [𝑠𝑢𝑏𝑢𝑟𝑏𝑎𝑛]..[democrat][Republican] > 20 parameters

Advanced modelling of Neural Networks Age Gender Income City Political party 30 Male 38,000 New York Democrat 39 Female 42,000 Page Republican 24 Other 39,000 San Francisco 51 Prefer not to say 71,000 Seattle Age Gender /= 100 /4 [0,100] 0 = Male, 0.25 = Female, 0.5=Other, 0.75 = Prefer not to Say, 1 = Unk 0.42 1 0.5 [0.25][<20.000][21.000-30.000][31.000-40][41.000-60.000]…

Advanced modelling of Neural Networks Age Gender Income City Political party 30 Male 38,000 New York Democrat 39 Female 42,000 Page Republican Method 1-of-(C-1) effects-coding: Standard deviation 𝜎= 1 𝑁 𝑖=1 𝑁 𝑥 𝑖 −𝜇 2 𝜇=𝑎𝑣𝑒𝑟𝑎𝑔𝑒 𝑜𝑓 𝑎𝑙𝑙 𝑣𝑎𝑙𝑢𝑒𝑠

Advanced modelling of Neural Networks Age Gender Income City Political party 30 Male 38,000 New York Democrat 39 Female 42,000 Page Republican age = (30 + 36 + 52 + 42) / 4 = 40.0 𝜎= 1 𝑁 𝑖=1 𝑁 𝑥 𝑖 −𝜇 2 𝜎= 30−40 2 + 36−40 2 52−40 2 42−40 2 4 =8,12

Advanced modelling of Neural Networks Age Gender Income City Political party 30 Male 38,000 New York Democrat 39 Female 42,000 Page Republican 𝑉 ′ = (𝑉−𝑚𝑒𝑎𝑛) 𝑠𝑡𝑑 𝑑𝑒𝑣 𝑉 ′ input will be used in place of the original input Having the age average is 40.0, standard deviation is 8.12, and our current value is 30.0: 30.0= (30−40) 8.12 = −1.23

Advanced modelling of Neural Networks One of parameters: Italian cities (About 8000) 𝑀𝑖𝑙𝑎𝑛𝑜 𝑇𝑜𝑟𝑖𝑛𝑜 𝑅𝑜𝑚𝑎 … 𝐶𝑎𝑡𝑎𝑛𝑖𝑎 Binary compression: 2 13 =8192 City Value Milano 0,0,0,0,0,0,0,0,0,0,0,0,0,0 Torino 0,0,0,0,1,1,0,0,0,0,0,1,0,0 Catania 0,1,0,0,1,0,0,0,0,1,0,1,1,0 With 13 nodes we can map 8192 values (Having the same meaning/context)

Advanced modelling of Neural Networks Age Gender Income City Political party 30 Male 38,000 New York Democrat 39 Female 42,000 Page Republican −1,23 1 3,4 [0.25][0,3] The model has a mapping ratio of 1:1 between concepts and the number of neurons. Only 5 parameters! Can be managed without neural networks by an IF,THEN sequence in the code

Advanced modelling of Neural Networks Data must be manipulated and made understandable by the machine, not for the humans!

Vulnerabilities

Vulnerabilities Let’s imagine that we run an auction website like Ebay. On our website, we want to prevent people from selling prohibited items . Enforcing these kinds of rules are hard if you have millions of users. We could hire hundreds of people to review every auction listing by hand, but that would be expensive.

Vulnerabilities Instead, we can use deep learning to automatically check auction photos for prohibited items and flag the ones that violate the rules. This is a typical image classification problem.

Vulnerabilities – Image Classification We repeat this thousands of times with thousands of photos until the model reliably produces the correct results with an acceptable accuracy.

Vulnerabilities - Convolutional neural networks Convolutional neural networks are powerful models that consider the entire image when classifying it. They can recognize complex shapes and patterns no matter where they appear in the image. In many image recognition tasks, they can equal or even beat human performance.

Vulnerabilities - Convolutional neural networks With a fancy model like that, changing a few pixels in the image to be darker or lighter shouldn’t have a big effect on the final prediction, right? Sure, it might change the final likelihood slightly, but it shouldn’t flip an image from “prohibited” to “allowed”. “expectations”

Vulnerabilities - Convolutional neural networks It was discovered that this isn’t always true

Vulnerabilities - Convolutional neural networks If you know exactly which pixels to change and exactly how much to change them, you can intentionally force the neural network to predict the wrong output for a given picture without changing the appearance of the picture very much. That means we can intentionally craft a picture that is clearly a prohibited item but which completely fools our neural network

Vulnerabilities - Convolutional neural networks Why is this?

Vulnerabilities - Convolutional neural networks A machine learning classifier works by finding a dividing line between the things it’s trying to tell apart. Here’s how that looks on a graph for a simple two-dimensional classifier that’s learned to separate green points (acceptable) from red points (prohibited) Right now, the classifier works with 100% accuracy. It’s found a line that perfectly separates all the green points from the red points.

Vulnerabilities - Convolutional neural networks But what if we want to trick it into mis-classifying one of the red points as a green point? What’s the minimum amount we could move a red point to push it into green territory? If we add a small amount to the Y value of a red point right beside the boundary, we can just barely push it over into green territory. Here’s how that looks on a graph for a simple two-dimensional classifier that’s learned to separate green points (acceptable) from red points (prohibited)

Vulnerabilities - Convolutional neural networks In image classification with deep neural networks, each “point” we are classifying is an entire image made up of thousands of pixels. That gives us thousands of possible values that we can tweak to push the point over the decision line. If we make sure that we tweak the pixels in the image in a way that isn’t too obvious to a human, we can fool the classifier without making the image look manipulated. Here’s how that looks on a graph for a simple two-dimensional classifier that’s learned to separate green points (acceptable) from red points (prohibited) Global AI Nights - London 2019

Vulnerabilities - Convolutional neural networks + = People Squirel Here’s how that looks on a graph for a simple two-dimensional classifier that’s learned to separate green points (acceptable) from red points (prohibited)

Perturbation of math model

Perturbation of math model

Perturbation of math model

Perturbation of math model

Perturbation of math model

Perturbation of math model

Perturbation of math model

Perturbation of math model

Vulnerabilities – The steps Feed in the photo that we want to hack. Check the neural network’s prediction and see how far off the image is from the answer we want to get for this photo. Tweak our photo using back-propagation to make the final prediction slightly closer to the answer we want to get. Repeat steps 1–3 a few thousand times with the same photo until the network gives us the answer we want. Here’s how that looks on a graph for a simple two-dimensional classifier that’s learned to separate green points (acceptable) from red points (prohibited)

Snippet of a Python script using Keras Vulnerabilities Snippet of a Python script using Keras

How can we protect ourselves against these attacks? Simply create lots of hacked images and include them in your training data set going forward, that seems to make your neural network more resistant to these attacks. This is called Adversarial Training and is probably the most reasonable defense to consider adopting right now. Here’s how that looks on a graph for a simple two-dimensional classifier that’s learned to separate green points (acceptable) from red points (prohibited)

How can we protect ourselves against these attacks? Pretty much every other idea researchers have tried so far has failed to be helpful in preventing these attacks. Here’s how that looks on a graph for a simple two-dimensional classifier that’s learned to separate green points (acceptable) from red points (prohibited)

Thanks! Sebastiano Galazzo Microsoft MVP @galazzoseba sebastiano.galazzo@gmail.com https://it.linkedin.com/in/sebastianogalazzo