参数设定：

reduction可取的值： 'none' | 'mean' | 'sum'.

'none': no reduction will be applied

'mean': the sum of the output will be divided by the number of elements in the output,求的是平均值，即各个差求和之后除以总数。

'sum': the output will be summed. Note: size_average and reduce are in the process of being deprecated, and in the meantime, specifying either of those two args will override reduction.只求和，不除总数。

Default: 'mean'

1.1.3 代码实现

设置reduction的值为默认值和sum，观察区别。

import torch
from torch.nn import L1Loss

inputs = torch.tensor([1,2,3],dtype=torch.float32)
targets = torch.tensor([1,2,5],dtype=torch.float32)

inputs = torch.reshape(inputs,(1,1,1,3))
targets = torch.reshape(targets,(1,1,1,3))

loss1 = L1Loss()
result1 = loss1(inputs,targets)
print(result1)

loss2 = L1Loss(reduction="sum")
result2 = loss2(inputs,targets)
print(result2)

当取值为默认值mean时，求的是平均值，sum=（1-1+2-2+5-3）=2, n=3, result = sum/n=0.6667
当取值为sum时，求的是和，即result=2

pytorch初学笔记（十四）：损失函数

1.2 MSE损失函数（平方和）

1.2.1 简介

均方误差（Mean Square Error,MSE）是回归损失函数中最常用的误差，它是预测值f(x)与目标值y之间差值平方和的均值，其公式如下所示：
pytorch初学笔记（十四）：损失函数

1.2.2 参数介绍

MSELoss — PyTorch 1.13 documentation

pytorch初学笔记（十四）：损失函数

与上面的L1损失函数一样，我们可以改变reduction的值来进行对应数值的输出。

1.2.3 代码实现

import torch
from torch.nn import L1Loss, MSELoss

inputs = torch.tensor([1,2,3],dtype=torch.float32)
targets = torch.tensor([1,2,5],dtype=torch.float32)

inputs = torch.reshape(inputs,(1,1,1,3))
targets = torch.reshape(targets,(1,1,1,3))

loss_mse1 = MSELoss()
result1 = loss_mse1(inputs,targets)
print(result1)

loss_mse2 = MSELoss(reduction="sum")
result2 = loss_mse2(inputs,targets)
print(result2)

可以看到reduction设置不同的值对应的输出也不同。

pytorch初学笔记（十四）：损失函数

1.3 损失函数的作用

计算实际输出和目标之间的差距
为更新输出（反向传播）提供一定的依据

二、在神经网络中使用loss function

2.1 使用交叉熵损失函数

使用上次定义的神经网络和CIFAR10数据集进行图像分类，分类问题使用交叉熵损失函数。

import torch.nn
from torch import nn
import torchvision.datasets
from torch.nn import Conv2d, MaxPool2d, Flatten, Linear, Sequential
from torch.utils.data import DataLoader
from torch.utils.tensorboard import SummaryWriter

dataset = torchvision.datasets.CIFAR10(root="./CIFAR10",train=False,transform=torchvision.transforms.ToTensor(),download=True)
dataloader = DataLoader(dataset,batch_size=1)

class Maweiyi(torch.nn.Module):
    def __init__(self):
        super(Maweiyi, self).__init__()
        self.model1 = Sequential(
            Conv2d(in_channels=3, out_channels=32, kernel_size=5, padding=2),
            MaxPool2d(kernel_size=2),
            Conv2d(in_channels=32, out_channels=32, kernel_size=5, padding=2),
            MaxPool2d(kernel_size=2),
            Conv2d(in_channels=32, out_channels=64, kernel_size=5, padding=2),
            MaxPool2d(kernel_size=2),
            Flatten(),
            Linear(in_features=1024, out_features=64),
            Linear(in_features=64, out_features=10)
        )

    def forward(self, x):
         x = self.model1(x)
         return x

maweiyi = Maweiyi()
# 使用交叉熵损失函数
loss_cross = nn.CrossEntropyLoss()

for data in dataloader:
    imgs,labels = data
    outputs = maweiyi(imgs)
    results = loss_cross(outputs,labels)
    print(results)

可以看到使用loss function计算出了在神经网路中预测的output和真实值labels之间的差距大小。

pytorch初学笔记（十四）：损失函数

2.2 反向传播

results_loss.backward()

pytorch初学笔记（十四）：损失函数

秒客网

pytorch初学笔记（十四）：损失函数

一、损失函数

1.1 L1损失函数

1.1.1 简介

1.1.2 参数设定

1.1.3 代码实现

1.2 MSE损失函数（平方和）

1.2.1 简介

1.2.2 参数介绍

1.2.3 代码实现

1.3 损失函数的作用

二、在神经网络中使用loss function

2.1 使用交叉熵损失函数

2.2 反向传播

相关文章