Artificial Intelligence Lecture No. 28

Slides:

Advertisements

Similar presentations

Learning in Neural and Belief Networks - Feed Forward Neural Network 2001 년 3 월 28 일 안순길.

Advertisements

G53MLE | Machine Learning | Dr Guoping Qiu

NEURAL NETWORKS Perceptron

Neural Network I Week 7 1. Team Homework Assignment #9 Read pp. 327 – 334 and the Week 7 slide. Design a neural network for XOR (Exclusive OR) Explore.

Artificial Neural Network

Data Mining Classification: Alternative Techniques

Data Mining Classification: Alternative Techniques

PERCEPTRON. Chapter 3: The Basic Neuron  The structure of the brain can be viewed as a highly interconnected network of relatively simple processing.

CES 514 – Data Mining Lecture 8 classification (contd…)

Artificial Neural Networks Artificial Neural Networks are (among other things) another technique for supervised learning k-Nearest Neighbor Decision Tree.

Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.

Machine Learning. Learning agent Any other agent.

Artificial Intelligence Lecture No. 28 Dr. Asad Ali Safi Assistant Professor, Department of Computer Science, COMSATS Institute of Information Technology.

Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.

Artificial Neural Network Yalong Li Some slides are from _24_2011_ann.pdf.

Artificial Neural Networks. Applied Problems: Image, Sound, and Pattern recognition Decision making  Knowledge discovery  Context-Dependent Analysis.

1 Machine Learning The Perceptron. 2 Heuristic Search Knowledge Based Systems (KBS) Genetic Algorithms (GAs)

NEURAL NETWORKS FOR DATA MINING

Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.

LINEAR CLASSIFICATION. Biological inspirations  Some numbers…  The human brain contains about 10 billion nerve cells ( neurons )  Each neuron is connected.

Artificial Intelligence Lecture No. 29 Dr. Asad Ali Safi Assistant Professor, Department of Computer Science, COMSATS Institute of Information Technology.

1 Chapter 11 Neural Networks. 2 Chapter 11 Contents (1) l Biological Neurons l Artificial Neurons l Perceptrons l Multilayer Neural Networks l Backpropagation.

Data Mining Practical Machine Learning Tools and Techniques Chapter 4: Algorithms: The Basic Methods Section 4.6: Linear Models Rodney Nielsen Many of.

Introduction to Artificial Intelligence (G51IAI) Dr Rong Qu Neural Networks.

Non-Bayes classifiers. Linear discriminants, neural networks.

Mehdi Ghayoumi MSB rm 132 Ofc hr: Thur, a Machine Learning.

Dr.Abeer Mahmoud ARTIFICIAL INTELLIGENCE (CS 461D) Dr. Abeer Mahmoud Computer science Department Princess Nora University Faculty of Computer & Information.

1 Perceptron as one Type of Linear Discriminants IntroductionIntroduction Design of Primitive UnitsDesign of Primitive Units PerceptronsPerceptrons.

Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.

1 Azhari, Dr Computer Science UGM. Human brain is a densely interconnected network of approximately neurons, each connected to, on average, 10 4.

Learning with Neural Networks Artificial Intelligence CMSC February 19, 2002.

Today’s Lecture Neural networks Training

Neural networks and support vector machines

CS 388: Natural Language Processing: Neural Networks

Artificial Neural Networks

Fall 2004 Perceptron CS478 - Machine Learning.

Other Classification Models: Neural Network

Learning with Perceptrons and Neural Networks

Advanced information retreival

Artificial Intelligence (CS 370D)

Artificial neural networks:

Machine Learning Neural Networks.

Other Classification Models: Neural Network

Real Neurons Cell structures Cell body Dendrites Axon

CS344: Introduction to Artificial Intelligence (associated lab: CS386)

Classification with Perceptrons Reading:

with Daniel L. Silver, Ph.D. Christian Frey, BBA April 11-12, 2017

Artificial Neural Networks

Machine Learning. Support Vector Machines A Support Vector Machine (SVM) can be imagined as a surface that creates a boundary between points of data.

Simple learning in connectionist networks

Machine Learning Today: Reading: Maria Florina Balcan

Perceptrons for Dummies

Chapter 3. Artificial Neural Networks - Introduction -

OVERVIEW OF BIOLOGICAL NEURONS

CSE (c) S. Tanimoto, 2004 Neural Networks

Perceptron as one Type of Linear Discriminants

Machine Learning. Support Vector Machines A Support Vector Machine (SVM) can be imagined as a surface that creates a boundary between points of data.

Machine Learning. Support Vector Machines A Support Vector Machine (SVM) can be imagined as a surface that creates a boundary between points of data.

CSE (c) S. Tanimoto, 2001 Neural Networks

CSE (c) S. Tanimoto, 2002 Neural Networks

Artificial neurons Nisheeth 10th January 2019.

Artificial Intelligence 12. Two Layer ANNs

CS 621 Artificial Intelligence Lecture /10/05 Prof

Hubs and Authorities & Learning: Perceptrons

Neuro-Computing Lecture 2 Single-Layer Perceptrons

ARTIFICIAL NEURAL networks.

A task of induction to find patterns

CSE (c) S. Tanimoto, 2007 Neural Nets

Introduction to Neural Network

A task of induction to find patterns

Presentation transcript:

Artificial Intelligence Lecture No. 28

Summary of Previous Lecture Machine learning Machine learning / Data mining Algorithm types

Today’s Lecture Supervised Artificial Neural Networks Perceptrons

Supervised learning In Supervised learning a task is to give data to a function from labeled training data. The training data consist of a set of training examples. In supervised learning, each example is a pair consisting of an input object and a desired output value. A supervised learning algorithm analyzes the training data and produces an inferred function, which can be used for mapping new examples. An optimal scenario will allow for the algorithm to correctly determine the class labels for unseen instances. This requires the learning algorithm to generalize from the training data to unseen situations.

Factors to consider Factors to consider when choosing and applying a learning algorithm are: Heterogeneity of the data Input features be numerical and scaled to similar ranges Support Vector Machines, linear regression, logistic regression, neural networks, and nearest neighbor methods, Redundancy in the data Input features contain redundant information some learning algorithms (e.g., linear regression, logistic regression, and distance based methods) will perform poorly because of numerical instabilities.

Factors to consider… Presence of interactions and non-linear If each of the features makes an independent contribution to the output, then algorithms based on linear functions (e.g., linear regression, logistic regression, Support Vector Machines, naive Bayes) and distance functions (e.g., nearest neighbor methods, support vector machines with Gaussian kernels) generally perform well. However, if there are complex interactions among features, then algorithms such as decision trees and neural networks work better, because they are specifically designed to discover these interactions.

Artificial Neural Networks Artificial neural network (ANN) is a machine learning approach that models human brain and consists of a number of artificial neurons. Neuron in ANNs tend to have fewer connections than biological neurons.

Neural Networks A large number of very simple neuron like processing elements A large number of weighted connections between the elements Highly parallel, distributed control An emphasis on learning internal representations automatically

Why Neural Nets? Solving problems under the constraints similar to those of the brain may lead to solutions to AI problems that might otherwise be overlooked. Individual neurons operate relatively slowly, but make up for that with massive parallelism.

The Parts of a Neuron

How it Works Each neuron has branching from it a number of small fibers called dendrites and a single long fiber, the axon.

How it Works The axon eventually splits and ends in a number of synapses which connect the axon to the dendrites of other neurons.

How it Works Communication between neurons occurs along these paths. When the electric potential in a neuron rises above a threshold, the neuron activates.

How it Works The neuron sends the electrical impulse down the axon to the synapses.

How it Works A synapse can either add to the electrical potential or subtract from the electrical potential.

How it Works The pulse then enters the connected neuron’s dendrites, and the process begins again.

Portion of a network: two interconnected cells. Signals can be transmitted unchanged or they can be altered by synapses. A synapse is able to increase or decrease the strength of the connection from the neuron to neuron. This is where information is stored. The information processing abilities of biological neural systems must follow from highly parallel processes operating on representations that are distributed over many neurons. One motivation for ANN is to capture this kind of highly parallel computation based on distributed representations.

Warren and Walter, 1943 Modern era of neural networks starts in the 1940’s, when Warren and Walter (a mathematician) explored the computational capabilities of networks made of very simple neurons A Warren and Walter network fires if the sum of its excited inputs exceeds its threshold, as long as it does not receive an out of scope Input Using a network of such neurons, they showed that it was possible to construct any logical function

Neural network representation An ANN is composed of processing elements called perceptrons, organized in different ways to form the network’s structure. Processing Elements An ANN consists of perceptrons. Each of the perceptrons receives inputs, processes inputs and delivers a single output. The input can be raw input data or the output of other perceptrons. The output can be the final result (e.g. 1 means yes, 0 means no) or it can be inputs to other perceptrons.

Perceptrons A perceptron takes a vector of real-valued inputs, calculates a linear combination of these inputs, then outputs a 1 if the result is greater than some threshold –1 otherwise. Given real-valued inputs x1 through xn, the output o(x1, …, xn) computed by the perceptron is o(x1, …, xn) = 1 if w0 + w1x1 + … + wnxn > 0 -1 otherwise where wi is a real-valued constant, or weight. Notice the quantify (-w0) is a threshold that the weighted combination of inputs w1x1 + … + wnxn must surpass in order for perceptron to output a 1.

To simplify notation, we imagine an additional constant input called Bias value x0 = 1, allowing us to write the above inequality as n i=0 wixi >0 Learning a perceptron involves choosing values for the weights w0, w1,…, wn. b (bias)

Representation Power of Perceptrons We can view the perceptron as representing a hyperplane decision surface in the n-dimensional space of instances (i.e. points). The perceptron outputs a 1 for instances lying on one side of the hyperplane and outputs a –1 for instances lying on the other side, as in , Some sets of positive and negative examples cannot be separated by any hyperplane. Those that can be separated are called linearly separated set of examples.

{ Perceptron Training i=0 Linear threshold is used. W - weight value 1 if  wi xi >t Output= -1 otherwise { i=0 Linear threshold is used. W - weight value t - threshold value

{ Simple network 1 if  wi xi >t output= i=0 -1 otherwise W = 1.5 X Y X W = 1.5 W = 1

Training Perceptrons What are the weight values? y x W = ? For AND A B Output 0 0 0 0 1 0 1 0 0 1 1 1 What are the weight values? Initialize with random weight values

Training Perceptrons Decision hyperplane : w0 + w1 x1 + w2 x2 = 0 For AND X Y Output 0 0 0 0 1 0 1 0 0 1 1 1 Decision hyperplane : w0 + w1 x1 + w2 x2 = 0 -0.8 + 0.5 x1 + 0.5 x2 = 0

Training Perceptrons Decision hyperplane : w0 + w1 x1 + w2 x2 = 0 For OR X Y Output 0 0 0 0 1 1 1 0 1 1 1 1 1 Decision hyperplane : w0 + w1 x1 + w2 x2 = 0 -0.3 + 0.5 x1 + 0.5 x2 = 0

Summery of Today’s Lecture Supervised Artificial Neural Networks Perceptrons