逻辑回归

Logistic Regression

1 分类 Classification

首先我们来看看使用线性回归来解决分类会出现的问题。下图中，我们加入了一个训练集，产生的新的假设函数使得我们进行分类出现了错误；而且线性回归计算的结果往往会远小于0或者远大于1，这对于0,1分类变得很奇怪。可见线性回归并不适用与分类。下面介绍的逻辑回归的结果总是在[0,1]，适用于分类，其实逻辑回归是一种分类算法。

ML 逻辑回归 Logistic Regression

2 假设函数Hypothesis Representation

逻辑回归假设函数为：

ML 逻辑回归 Logistic Regression

其中 ML 逻辑回归 Logistic Regression 是参数向量，特征向量。该函数叫做Logistic function（也叫做Sigmoid function）。函数图像如下，范围(0,1)

ML 逻辑回归 Logistic Regression

对于分类问题，它可以表示给定特征 ML 逻辑回归 Logistic Regression ，参数属于类别1的概率，那么属于另一类的概率自然也就是。

ML 逻辑回归 Logistic Regression

3 决策边界Decision Boundary

根据逻辑函数的特性可以看出，当 ML 逻辑回归 Logistic Regression ，当

ML 逻辑回归 Logistic Regression

我们把 ML 逻辑回归 Logistic Regression 就称为决策边界（Decision Boundary）。

ML 逻辑回归 Logistic Regression

我们可以通过多项式组合来制定更加复杂的决策边界。

ML 逻辑回归 Logistic Regression

4 代价函数 Cost Function

代价函数 ML 逻辑回归 Logistic Regression ，在线性回归中，我们的，如果在逻辑回归中也使用这个函数，那么代价函数会是一个非凸函数，无法使用梯度下降去求解参数，所以我们要寻找一些函数使得代价函数为凸函数。

ML 逻辑回归 Logistic Regression

我们来看一下这个代价函数：

ML 逻辑回归 Logistic Regression

我们可以将它合并为一个公式：

ML 逻辑回归 Logistic Regression

5 梯度下降 Gradient Desecent

ML 逻辑回归 Logistic Regression

这里我们把代价函数写成这种形式是由最大似然估计得到了，当然也还有其他的形式。

梯度下降算法：

ML 逻辑回归 Logistic Regression

可以看到这里推导出来的公式看起来和线性回归梯度下降中推导出来的公式是一样的，但是要注意 ML 逻辑回归 Logistic Regression 已经是sigmod函数而不是线性公式了，所以他们是两码事。

6 多分类 Multi-class classification: one-vs-all

我们可以降逻辑回归用于多分类问题，假设有K类，我们可以训练出K个分类器，每个分类器，将其中一种类别作为正类，其余的都是负类来训练，然后再预测时，类别属于概率最大的那个类。

ML 逻辑回归 Logistic Regression

7 正则化regularization

7.1 过拟合 overfitting

ML 逻辑回归 Logistic Regression

7.2 代价函数cost function

在代价函数中加入正则项，注意，这里对 ML 逻辑回归 Logistic Regression 计算，而不是从开始。其中是正则项参数，如果太大，那么会趋向于0，使得，导致欠拟合。

ML 逻辑回归 Logistic Regression

7.3 正则化线性回归 Regularized linear regression

代价函数：

ML 逻辑回归 Logistic Regression

Gradeint descent

Repeat{

ML 逻辑回归 Logistic Regression

}

这里 ML 逻辑回归 Logistic Regression 是一个比1小一点点的数。

在线性回归中，我们除了梯度下降，还有正规方程的方法，正规方法加入正则项后：

ML 逻辑回归 Logistic Regression

7.4 正则化逻辑回归Regularized logistic regression

代价函数：

ML 逻辑回归 Logistic Regression

Gradient descent

Repeat{

ML 逻辑回归 Logistic Regression

}

实验代码

正则化逻辑回归

 function [J, grad] = costFunctionReg(theta, X, y, lambda)

 %COSTFUNCTIONREG Compute cost and gradient for logistic regression with regularization

 %   J = COSTFUNCTIONREG(theta, X, y, lambda) computes the cost of using

 %   theta as the parameter for regularized logistic regression and the

 %   gradient of the cost w.r.t. to the parameters. 

 % Initialize some useful values

 m = length(y); % number of training examples

 % You need to return the following variables correctly

 J = ;

 grad = zeros(size(theta));

 % ====================== YOUR CODE HERE ======================

 % Instructions: Compute the cost of a particular choice of theta.

 %               You should set J to the cost.

 %               Compute the partial derivatives and set grad to the partial

 %               derivatives of the cost w.r.t. each parameter in theta

 hx=sigmoid(X*theta);

 Jnorm=(-/m)*(y'*log(hx)+(1-y)'*log(-hx));

 theta0=theta(); %注意theta0不用正则化

 theta1=theta(:end);

 Jreg=(lambda/(*m))*sum(theta1.^);

 J=Jnorm+Jreg;

 grad0=(hx-y)'*X(:,1)./m;

 grad1=((hx-y)'*X(:,2:end)./m)'+(lambda/m).*theta1;

 grad=[grad0;grad1];

 % =============================================================

 end

sigmoid函数

 function g = sigmoid(z)

 %SIGMOID Compute sigmoid functoon

 %   J = SIGMOID(z) computes the sigmoid of z.

 % You need to return the following variables correctly

 g = zeros(size(z));

 % ====================== YOUR CODE HERE ======================

 % Instructions: Compute the sigmoid of each value of z (z can be a matrix,

 %               vector or scalar).

 g=./(+exp((-).*z));

 % =============================================================

 end

特征

 function out = mapFeature(X1, X2)

 % MAPFEATURE Feature mapping function to polynomial features

 %

 %   MAPFEATURE(X1, X2) maps the two input features

 %   to quadratic features used in the regularization exercise.

 %

 %   Returns a new feature array with more features, comprising of

 %   X1, X2, X1.^, X2.^, X1*X2, X1*X2.^, etc..

 %

 %   Inputs X1, X2 must be the same size

 %

 degree = ;%6次函数

 out = ones(size(X1(:,)));

 for i = :degree

     for j = :i

         out(:, end+) = (X1.^(i-j)).*(X2.^j);

     end

 end

 end

主函数

 %% Machine Learning Online Class - Exercise : Logistic Regression

 %

 %  Instructions

 %  ------------

 %

 %  This file contains code that helps you get started on the second part

 %  of the exercise which covers regularization with logistic regression.

 %

 %  You will need to complete the following functions in this exericse:

 %

 %     sigmoid.m

 %     costFunction.m

 %     predict.m

 %     costFunctionReg.m

 %

 %  For this exercise, you will not need to change any code in this file,

 %  or any other files other than those mentioned above.

 %

 %% Initialization

 clear ; close all; clc

 %% Load Data

 %  The first two columns contains the X values and the third column

 %  contains the label (y).

 data = load('ex2data2.txt');

 X = data(:, [, ]); y = data(:, );

 plotData(X, y);

 % Put some labels

 hold on;

 % Labels and Legend

 xlabel('Microchip Test 1')

 ylabel('Microchip Test 2')

 % Specified in plot order

 legend('y = 1', 'y = 0')

 hold off;

 %% =========== Part : Regularized Logistic Regression ============

 %  In this part, you are given a dataset with data points that are not

 %  linearly separable. However, you would still like to use logistic

 %  regression to classify the data points.

 %

 %  To do so, you introduce more features to use -- in particular, you add

 %  polynomial features to our data matrix (similar to polynomial

 %  regression).

 %

 % Add Polynomial Features

 % Note that mapFeature also adds a column of ones for us, so the intercept

 % term is handled

 X = mapFeature(X(:,), X(:,));

 % Initialize fitting parameters

 initial_theta = zeros(size(X, ), );

 % Set regularization parameter lambda to

 lambda = ;

 % Compute and display initial cost and gradient for regularized logistic

 % regression

 [cost, grad] = costFunctionReg(initial_theta, X, y, lambda);

 fprintf('Cost at initial theta (zeros): %f\n', cost);

 fprintf('\nProgram paused. Press enter to continue.\n');

 pause;

 %% ============= Part : Regularization and Accuracies =============

 %  Optional Exercise:

 %  In this part, you will get to try different values of lambda and

 %  see how regularization affects the decision coundart

 %

 %  Try the following values of lambda (, , , ).

 %

 %  How does the decision boundary change when you vary lambda? How does

 %  the training set accuracy vary?

 %

 % Initialize fitting parameters

 initial_theta = zeros(size(X, ), );

 % Set regularization parameter lambda to  (you should vary this)

 lambda = ;

 % Set Options

 options = optimset('GradObj', 'on', 'MaxIter', );

 % Optimize

 [theta, J, exit_flag] = ...

     fminunc(@(t)(costFunctionReg(t, X, y, lambda)), initial_theta, options);

 % Plot Boundary

 plotDecisionBoundary(theta, X, y);

 hold on;

 title(sprintf('lambda = %g', lambda))

 % Labels and Legend

 xlabel('Microchip Test 1')

 ylabel('Microchip Test 2')

 legend('y = 1', 'y = 0', 'Decision boundary')

 hold off;

 % Compute accuracy on our training set

 p = predict(theta, X);

 fprintf('Train Accuracy: %f\n', mean(double(p == y)) * );

实验中参数lamda的大小也十分重要，不同的lamda可能会过拟合或者欠拟合。

ML 逻辑回归 Logistic Regression的更多相关文章

Coursera公开课笔记&colon; 斯坦福大学机器学习第六课&OpenCurlyDoubleQuote;逻辑回归(Logistic Regression)” 清晰讲解logistic-good&excl;&excl;&excl;&excl;&excl;&excl;
原文:http://52opencourse.com/125/coursera%E5%85%AC%E5%BC%80%E8%AF%BE%E7%AC%94%E8%AE%B0-%E6%96%AF%E5%9D ...
机器学习总结之逻辑回归Logistic Regression
机器学习总结之逻辑回归Logistic Regression 逻辑回归logistic regression,虽然名字是回归,但是实际上它是处理分类问题的算法.简单的说回归问题和分类问题如下: 回归问 ...
机器学习（四）--------逻辑回归(Logistic Regression)
逻辑回归(Logistic Regression) 线性回归用来预测,逻辑回归用来分类. 线性回归是拟合函数,逻辑回归是预测函数逻辑回归就是分类. 分类问题用线性方程是不行的线性方程拟合的是连 ...
机器学习入门11 - 逻辑回归 (Logistic Regression)
原文链接:https://developers.google.com/machine-learning/crash-course/logistic-regression/ 逻辑回归会生成一个介于 0 ...
机器学习方法（五）：逻辑回归Logistic Regression，Softmax Regression
欢迎转载,转载请注明:本文出自Bin的专栏blog.csdn.net/xbinworld. 技术交流QQ群:433250724,欢迎对算法.技术.应用感兴趣的同学加入. 前面介绍过线性回归的基本知识, ...
机器学习 (三) 逻辑回归 Logistic Regression
文章内容均来自斯坦福大学的Andrew Ng教授讲解的Machine Learning课程,本文是针对该课程的个人学习笔记,如有疏漏,请以原课程所讲述内容为准.感谢博主Rachel Zhang 的个人 ...
逻辑回归(Logistic Regression)详解,公式推导及代码实现
逻辑回归(Logistic Regression) 什么是逻辑回归: 逻辑回归(Logistic Regression)是一种基于概率的模式识别算法,虽然名字中带"回归",但实际上 ...
逻辑回归 Logistic Regression
逻辑回归(Logistic Regression)是广义线性回归的一种.逻辑回归是用来做分类任务的常用算法.分类任务的目标是找一个函数,把观测值匹配到相关的类和标签上.比如一个人有没有病,又因为噪声的 ...
【机器学习】Octave 实现逻辑回归 Logistic Regression
ex2data1.txt ex2data2.txt 本次算法的背景是,假如你是一个大学的管理者,你需要根据学生之前的成绩(两门科目)来预测该学生是否能进入该大学. 根据题意,我们不难分辨出这是一种二分 ...

随机推荐

GitHub for Windows 2&period;0使用教程
Git是目前最先进的分布式版本控制系统,作为一个程序员,我们需要掌握其用法. 一:下载GitHub for Windows 2.0 二:安装GitHub 下载之后点击进行安装过程,安装之后桌面上会有 ...
xenomai安装
一.Linux内核打实时补丁 1.将下载的Linux和xenomai安装包放在/usr/src目录下,并解压文件包,命令如下 tar xjf Linux-3.8.13.tar.bz2 tar x ...
reset内容
/*reset */div,p,a,span,body,dl,dt,dd,header,footer,img,section,time,h2,em,article,h3,h4,ul,li,labe ...
Web---JSP-EL表达式
EL表达式简介 EL 全名为Expression Language. EL主要作用: 获取数据: EL表达式主要用于替换JSP页面中的脚本表达式,以从各种类型的java对象中获取数据. 执行运算: 利 ...
【MyBatis源码分析】select源码分析及小结
示例代码之前的文章说过,对于MyBatis来说insert.update.delete是一组的,因为对于MyBatis来说它们都是update:select是一组的,因为对于MyBatis来说它就是 ...
基于Kafka Connect框架DataPipeline在实时数据集成上做了哪些提升？
在不断满足当前企业客户数据集成需求的同时,DataPipeline也基于Kafka Connect 框架做了很多非常重要的提升. 1. 系统架构层面. DataPipeline引入DataPipeli ...
&num;022 Python 实验课
拍7游戏描述 “拍7游戏”规则是:一堆人围成一圈,开始时,任意指定一人说出数字“1”后,一圈人按顺时针方向,每人按整数由小到大的顺序一人一个地报出后续数字“2”.“3”......,当遇到为“7”的 ...
ASP&period;NET Core2&period;2 多用户验证和授权
asp.net core2.2 用户验证和授权有很详细和特贴心的介绍,我感兴趣的主要是这两篇: cookie身份验证基于角色的授权我的项目有两类用户: 微信公众号用户,用户名为公众号的openi ...
checkpoint-BLCR部署和测试(源码)
1. 概述2. 部署过程2.1 源码下载2.2 解压安装2.3 添加库环境2.4 插入内核模块3. 测试3.1 创建测试程序3.2 功能测试4. 参考博客 1. 概述 checkpoint 2. 部署 ...
pgm11
这里简要的讨论 hybrid network 与一些时序数据的分析. hybrid network 指网络中存在离散随机变量与连续随机变量,这种情况下一般非常麻烦,这主要是因为连续型随机变量需要使用某 ...