Mihir Patel and Nikhil Sardana

Mihir Patel and Nikhil Sardana
Neural Networks Pt 2 Mihir Patel and Nikhil Sardana

Synopsis

Data Techniques Expanding datasets Weight initialization Batching
Narrow gaussian Batching

Learning Rate Step size for backpropagation

Activation Functions: tanh
y = (ex-e-x)/(ex + e-x) Benefits Greater derivative = faster learning -1 vs. 0 prevents stagnation

Cost Function: Cross-Entropy
Saturation Needs Approaches 0 when correct Always has positive sign Benefits Doesn’t give 0 when incorrect

Overfitting Over-matching dataset

Solutions to Overfitting
Weight Decay Punish larger weights Dropout

Presentation on theme: "Mihir Patel and Nikhil Sardana"— Presentation transcript: