Download presentation
Presentation is loading. Please wait.
Published byAdam Rich Modified over 9 years ago
1
By Sharath Kumar Aitha. Instructor: Dr. Dongchul Kim.
2
Introduction. Logistic Regression. Formula Conditional Density Function. Maximum log like hood Pseudo code Classification Output Application Conclusion
3
To analyze the historical data of online shop customer behaviors by logistic predictive modeling and predicts whether the customers purchase a tablet pc online shop. Analyzes data by datamining and establishes predictive modeling based on logistic regression.
4
This is the most commonly used technology used for developing predictive modeling. Its function is to look for a variety of equations which reflects the customer behavior patterns according to the observations of X and Y. There are two types of regressions they are: 1.Ordinary least square Regression(OLS) 2.Logistic Regression.
5
If the variable Y is continuous we usually use OLS For example if variable Y is the sales volume of tablet PCs and the distribution of these observations is more beautiful, we can use OLS. But the requirements of OLS of Y are very harsh and difficulty, so we use logistic regression.
6
Logistic regression is a statistical method for analyzing a dataset in which there are one or more independent variables that determine an outcome. This is a unsupervised function where we need to provide the input values.
7
If we have feature data t i ∈ℜ and label data y i ∈ {0,1},i = 1,2,···,n. corresponding to two classes. we can use logistic regression to solve this by doing regression between y and t on logistic function H(t)=1/1+e -(at-b) T is the input. E is the exponential A is the coefficient to adjust the slope. B is the parameter.
8
Def: Suppose X and Y are continuous random variables with joint probability density function f(x,y) and marginal probability density functions f X (x) and f Y (y), respectively. Then, the conditional probability density function of Y given X = x is defined as: provided f X (x) > 0. Formula: conditional density function p(y/t): P(y/t)=h(t) y (1-h(t)) (1-y) Which can also be written as P(y/t)=(1/1+e -(at-b) ) y (1/1+e (at-b) ) (1-y)
9
This is a statistical method used for estimating the co- efficients of a model. The likelihood function measures the probability of observing the particular set of dependent variable values that occurs in the sample.
10
Algorithm 1 One dimension logistic regression Input: t; y Output: a; b 1: Initialize a and b 2: repeat 3: a = a + τ ∂ℓ/∂a 4: b = b + τ ∂ℓ/∂b 5: until convergence of a and b
11
∂ℓ/∂a ∂ℓ/∂b
12
For a testing data given feature t or x, by Eq.(1), we get h, it is the probability of y=1. so the discriminant function is:
14
Design and implementation:
18
Logistic Regression modeling can be used in predicting the possibility of a customer to buy a commodity. This is also helpful for the online shop owner to make the decision.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.