Python中if __name__=="__main__" 语句在调用多进程Process过程中的作用分析

2018年2月27日于创B515

引言
　　最近准备学习一下如何使用Python中的多进程。在翻看相关书籍、网上资料时发现所有代码都含有if __name__=="__main__"，在实验的过程中发现如果在运行代码过程中，没有这句话Python解释器就会报错。虽然Python对于multiprocessing的文档第17.2.1.1节中【1】提到必须如此使用，但是我觉得并没有根本上解释清楚。因此我决定从源码来解释我的疑惑。

# 代码0.1错误代码
import multiprocessing as mp

import os

def do():

    print("pid is : %s ..." % os.getpid())

print("parent id is : %s ..." % os.getpid())

p = mp.Process(target=do, args=())

p.start()

# 代码0.2正确代码

import multiprocessing as mp

import os

def do():

    print("pid is : %s ..." % os.getpid())

if __name__ == '__main__':

    print("parent id is : %s ..." % os.getpid())

    p = mp.Process(target=do, args=())

    p.start()

问题描述
问题
在运行代码-0.1时，会出现RuntimeError，错误提示如下。但是运行代码0.2时就不会，一切顺利。

    An attempt has been made to start a new process before the current process has finished its bootstrapping phase.

    This probably means that you are not using fork to start your child processes and you have forgotten to use the 
proper idiom in the main module:

　　　　 　　if __name__ == '__main__':

　　　　　　　　freeze_support()

　　　　　　　　...

　　The "freeze\_support()" line can be omitted if the program is not going to be frozen to produce an executable.

问题产生的环境

环境配置
运行环境:	Win10
IDE	Sublime Text3

简单解释
　　由于Python运行过程中，新创建进程后，进程会导入正在运行的文件，即在运行代码0.1的时候，代码在运行到mp.Process时，新的进程会重新读入改代码，对于没有if __name__=="__main__"保护的代码，新进程都认为是要再次运行的代码，这是子进程又一次运行mp.Process，但是在multiprocessing.Process的源码中是对子进程再次产生子进程是做了限制的，是不允许的，于是出现如上的错误提示。

详细解释
先谈一谈if__name__=="__main__"
　　在Python有关__main__的文档中【2】说明“__main__”是代码执行时的最高的命名空间（the name of the scope in which top-level code executes），当代码被当做脚本读入的时候，命名空间会被命名为“__main__”，对于在脚本运行过程中读入的代码命名空间都不会被命名为“__main__”。这也就是说创建的子进程是不会读取__name__=="__main__"保护下的代码。

再谈一谈multiprocessing（win32下的源码分析）
multiprocessing根据平台不同会执行不同的代码：在类UNIX系统下由于操作系统本身支持fork()语句，win32系统由于本身不支持fork()，因此在两种系统下multiprocessing会运行不同的代码，如图1 UNIX平台、图2 win32平台(包含在context.py文件中，Process的定义也是在context.py文件中)。