Docker 安装tensorflow

时间:2022-04-14 14:16:23

安装DOCKER

1. https://docs.docker.com/engine/installation/linux/docker-ce/ubuntu/

nstall from a package

If you cannot use Docker’s repository to install Docker CE, you can download the .deb file for your release and install it manually. You will need to download a new file each time you want to upgrade Docker CE.

  1. Go to https://download.docker.com/linux/ubuntu/dists/, choose your Ubuntu version, browse to pool/stable/ and choose amd64,armhf, or s390x. Download the .deb file for the Docker version you want to install.

    Note: To install an edge package, change the word stable in the URL to edgeLearn about stable and edge channels.

  2. Install Docker CE, changing the path below to the path where you downloaded the Docker package.

    $ sudo dpkg -i /path/to/package.deb

    The Docker daemon starts automatically.

  3. Verify that Docker CE is installed correctly by running the hello-world image.

    $ sudo docker run hello-world

    This command downloads a test image and runs it in a container. When the container runs, it prints an informational message and exits.

Docker CE is installed and running. You need to use sudo to run Docker commands. Continue to Post-installation steps for Linux to allow non-privileged users to run Docker commands and for other optional configuration steps.

2. https://github.com/widuu/chinese_docker/blob/master/installation/ubuntu.md#Ubuntu%E5%AE%89%E8%A3%85Docker

##Ubuntu Docker可选配置

这部分主要介绍了 Docker 的可选配置项,使用这些配置能够让 Docker 在 Ubuntu 上更好的工作。

  • 创建 Docker 用户组
  • 调整内存和交换空间(swap accounting)
  • 启用防火墙的端口转发(UFW)
  • 为 Docker 配置DNS服务

###创建 Docker 用户组

docker 进程通过监听一个 Unix Socket 来替代 TCP 端口。在默认情况下,docker 的 Unix Socket属于root用户,当然其他用户可以使用sudo方式来访问。因为这个原因, docker 进程就一直是root用户运行的。

为了在使用 docker 命令的时候前边不再加sudo,我们需要创建一个叫 docker 的用户组,并且为用户组添加用户。然后在 docker 进程启动的时候,我们的 docker 群组有了 Unix Socket 的所有权,可以对 Socket 文件进行读写。

注意:docker 群组就相当于root用户。有关系统安全影响的细节,请查看 Docker 进程表面攻击细节

创建 docker 用户组并添加用户

  1. 使用具有sudo权限的用户来登录你的Ubuntu。

    在这过程中,我们假设你已经登录了Ubuntu。

  2. 创建 docker 用户组并添加用户。

     $ sudo usermod -aG docker ubuntu
  3. 注销登录并重新登录

    这里要确保你运行用户的权限。

  4. 验证 docker 用户不使用 sudo 命令开执行 Docker

     $ docker run hello-world

创建用户组docker,可以避免使用sudo 
将docker和wxl(王小雷用户名,在创建主机时默认用户名称是ubuntu)添加到一个组内

#默认是ubuntu用户
#wxl@wxl-pc:~$ sudo usermod -aG docker ubuntu
# 将wxl的用户添加到docker用户组中,如果多个用户需要用空格隔开 如 wxl wxl1 wxl2用户
wxl@wxl-pc:~$ sudo usermod -aG docker wxl

注意需要重新启动计算机或者注销用户再登入,才能生效。这样就不需要使用sudo命令了。 
那么,如何将wxl从docker用户组移除? 
sudo gpasswd -d wxl docker 
如何删除刚才创建的docker用户组? 
sudo groupdel docker 
如何创建和删除新用户,如用户newuser 
sudo adduser newuser 
sudo userdel newuser

###调整内存和交换空间(swap accounting)

当我们使用 Docker 运行一个镜像的时候,我们可能会看到如下的信息提示:

WARNING: Your kernel does not support cgroup swap limit. WARNING: Your
kernel does not support swap limit capabilities. Limitation discarded.、

为了防止以上错误信息提示的出现,我们需要在系统中启用内存和交换空间。我们需要修改系统的 GUN GRUB (GNU GRand Unified Bootloader) 来启用内存和交换空间。开启方法如下:

  1. 使用具有sudo权限的用户来登录你的Ubuntu。

  2. 编辑 /etc/default/grub 文件

  3. 设置 GRUB_CMDLINE_LINUX 的值如下:

     GRUB_CMDLINE_LINUX="cgroup_enable=memory swapaccount=1"
  4. 保存和关闭文件

  5. 更新 GRUB

     $ sudo update-grub
  6. 重启你的系统。

1.5.如何更新Docker

wxl@wxl-pc:~$ sudo apt-get upgrade docker-engine

1.6.如何卸载Docker

wxl@wxl-pc:~$ sudo apt-get purge docker-engine

2.运行一个web应用–Python Flask

2.1.docker简单命令汇总如下:

  • docker run ubuntu /bin/echo “hello world” -运行ubuntu镜像并且在命令窗口输出”hello world”
  • docker run -t -i ubuntu /bin/bash -进入ubuntu这个镜像的bash命令窗口,可以操作本镜像ubuntu的命令如ls
  • docker ps - 列出当前运行的容器
  • docker logs - 展示容器的标准的输出(比如hello world)
  • docker stop - 停止正在运行的容器
  • docker version -可以查看守护的进程,docker版本以及go版本(docker本身是用go语言写的)

总结,可以看出docker的命令一般为 
[sudo] docker [subcommand] [flags] [arguments] 
如docker run -i -t ubuntu /bin/bash

3.GPU需+额外:https://medium.com/@gooshan/for-those-who-had-trouble-in-past-months-of-getting-google-s-tensorflow-to-work-inside-a-docker-9ec7a4df945b

https://github.com/NVIDIA/nvidia-docker

4. 安装nvidia-docker

http://blog.csdn.net/u011987514/article/details/70943378

 
  1. # Install nvidia-docker and nvidia-docker-plugin
  2. wget -P /tmp https://github.com/NVIDIA/nvidia-docker/releases/download/v1.0.1/nvidia-docker_1.0.1-1_amd64.deb
  3. sudo dpkg -i /tmp/nvidia-docker*.deb && rm /tmp/nvidia-docker*.deb
官方提供的测试方法需要下载一个1G左右的镜像才能测试
这里只需要输入sudo nvidia_docker info测试一下即可 
默认情况下Docker会把镜像安装在根目录下/var/lib/docker,这样镜像会大量占用系统盘空间,最终导致磁盘资源不足 
解决方案是修改默认安装目录
由于我的/home磁盘资源比较多,所以都安装到/home去
  1. zcw@ubuntu:~# mkdir docker
  2. zcw@ubuntu:~# vim /etc/default/docker
添加配置信息

DOCKER_OPTS="--graph=/home/docker"

保存退出

service docker restart

发现配置并没有生效
解决方案:
  1. zcw@ubuntu:~# mkdir -p /etc/systemd/system/docker.service.d
  2. zcw@ubuntu:~# cat /etc/systemd/system/docker.service.d/Using_Environment_File.conf
如果没有该文件则自行创建,添加以下内容

[plain] view plain copy

  1. EnvironmentFile=-/etc/default/docker
  2. ExecStart=
  3. ExecStart=/usr/bin/docker daemon -H fd:// $DOCKER_OPTS
载入配置重启服务
  1. zcw@ubuntu:~# systemctl daemon-reload
  2. zcw@ubuntu:~# service docker restart
查看配置是否生效
  1. zcw@ubuntu:~# ps -ef|grep docker

You need at least a 384.xx series driver for CUDA 9. NVIDIA recommends 384.81 or later. If you're installing from deb packages, note that the package name is different for nvidia-384 versus nvidia-375 in order that the major driver version upgrade is intentional rather than automatic.

Alternatively, you can also use a CUDA 8.0 image and not upgrade your driver:

nvidia-docker run --rm nvidia/cuda:8.0-devel nvidia-smi

3. Install Docker and nvidia-docker

# Install docker
curl -sSL https://get.docker.com/ | sh 
The docker container needs access to the GPU devices. For this purpose use `nvidia-docker` which is a wrapper around the standard `docker` command.
# Install nvidia-docker and nvidia-docker-plugin
wget -P /tmp https://github.com/NVIDIA/nvidia-docker/releases/download/v1.0.0-rc.3/nvidia-docker_1.0.0.rc.3-1_amd64.deb
sudo dpkg -i /tmp/nvidia-docker*.deb && rm /tmp/nvidia-docker*.deb
# Test nvidia-smi.
nvidia-docker run --rm nvidia/cuda nvidia-smi

You might need to use `nvidia-docker` with sudo!

安装Tensorflow

http://blog.csdn.net/cq361106306/article/details/54094517

http://blog.csdn.net/freewebsys/article/details/70237003

http://blog.csdn.net/dream_an/article/details/55520205

下载tensorflow-gpu版本的源

docker pull daocloud.io/daocloud/tensorflow:0.11.0-gpu

先查下你有几个GPU设备

[root@XXX ~]# ls -la /dev | grep nvidia
crw-rw-rw-. 1 root root 195, 0 Sep 16 13:49 nvidia0
crw-rw-rw-. 1 root root 195, 255 Sep 16 13:49 nvidiactl
crw-rw-rw-. 1 root root 247, 0 Sep 16 13:54 nvidia-uvm

然后再查你的docker镜像

y@y:~$ sudo docker images
[sudo] password for y:
REPOSITORY TAG IMAGE ID CREATED SIZE
daocloud.io/daocloud/tensorflow 0.11.0-gpu dd645f420f1d 8 weeks ago 2.713 GB
daocloud.io/daocloud/tensorflow 0.10.0-devel-gpu fa886c09638d 3 months ago 5.014 GB
hello-world

然后就可以启动咯

sudo docker run -ti -v /home/:/mnt/home --privileged=true --device /dev/nvidia0:/dev/nvidia0 --device /dev/nvidiactl:/dev/nvidiactl --device /dev/nvidia-uvm:/dev/nvidia-uvm daocloud.io/daocloud/tensorflow:0.11.0-gpu /bin/bash

Docker 安装tensorflow

上面这句有点长把它写到docker.sh文件,然后

sh docker.sh

完成。 上面的意思是把本地的/home映射到docker的/mnt目录 
以及各种显卡设备也映射进去

 

主机保存镜像为新版本

sudo docker ps -l
y@y:~$ sudo docker ps -l
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
a1f2ac36a2c9 daocloud.io/daocloud/tensorflow:0.11.0-gpu "/bin/bash" 10 minutes ago Up 10 minutes 6006/tcp, 8888/tcp

把a1f2ac36a2c9这个名字记住 
然后

docker commit a1f2ac36a2c9 新名字

OK了

1. https://medium.com/@gooshan/for-those-who-had-trouble-in-past-months-of-getting-google-s-tensorflow-to-work-inside-a-docker-9ec7a4df945b

4. Run a Tensorflow GPU-enable Docker container

The container itself is started as pointed out in the official documentation as follows:

# Run container
nvidia-docker run -d --name <some name> -p 8888:8888 -p 6006:6006 gcr.io/tensorflow/tensorflow:latest-gpu
# Log in
nvidia-docker exec -it <some name> bash

e.g.:

nvidia-docker run -d --name tf1 -p 8888:8888 -p 6006:6006 gcr.io/tensorflow/tensorflow:latest-gpu
nvidia-docker exec -it tf1 bash 

Note: Port 8888 is for ipython notebooks and port 6006 is for TensorBoard.

You can test if everything is alright by running this Python script.

 https://www.tensorflow.org/install/install_linux#InstallingDocker

GPU support

Prior to installing TensorFlow with GPU support, ensure that your system meets all NVIDIA software requirements. To launch a Docker container with NVidia GPU support, enter a command of the following format:

$ nvidia-docker run -it -p hostPort:containerPort TensorFlowGPUImage

where:

  • -p hostPort:containerPort is optional. If you plan to run TensorFlow programs from the shell, omit this option. If you plan to run TensorFlow programs as Jupyter notebooks, set both hostPort and containerPort to 8888.
  • TensorFlowGPUImage specifies the Docker container. You must specify one of the following values:
    • gcr.io/tensorflow/tensorflow:latest-gpu, which is the latest TensorFlow GPU binary image.
    • gcr.io/tensorflow/tensorflow:latest-devel-gpu, which is the latest TensorFlow GPU Binary image plus source code.
    • gcr.io/tensorflow/tensorflow:version-gpu, which is the specified version (for example, 0.12.1) of the TensorFlow GPU binary image.
    • gcr.io/tensorflow/tensorflow:version-devel-gpu, which is the specified version (for example, 0.12.1) of the TensorFlow GPU binary image plus source code.

We recommend installing one of the latest versions. For example, the following command launches the latest TensorFlow GPU binary image in a Docker container from which you can run TensorFlow programs in a shell:

$ nvidia-docker run -it gcr.io/tensorflow/tensorflow:latest-gpu bash

The following command also launches the latest TensorFlow GPU binary image in a Docker container. In this Docker container, you can run TensorFlow programs in a Jupyter notebook:

$ nvidia-docker run -it -p 8888:8888 gcr.io/tensorflow/tensorflow:latest-gpu

The following command installs an older TensorFlow version (0.12.1):

$ nvidia-docker run -it -p 8888:8888 gcr.io/tensorflow/tensorflow:0.12.1-gpu

Docker will download the TensorFlow binary image the first time you launch it. For more details see the TensorFlow docker readme.

Next Steps

You should now validate your installation.

使用:Docker

http://song*g.github.io/pages/2017/02/25/an-zhuang-dockerhe-tensorflow/

4. 镜像管理

4.0. 下载镜像

docker pull image-name

4.1. 查看本地镜像

docker images

4.2. 查看运行的容器

查看active镜像:  docker ps
查看所有镜像: docker ps -a
查看最近的镜像: docker ps -l

4.3. 删除镜像 删除容器

docker rmi image-name

docker rm <container id>

4.4. 在repo里搜索镜像

docker search [image-name]

4.5. 停止镜像

docker stop container-id

4.6. 停止并删除所有容器

docker stop $(docker ps -a -q)
docker rm $(docker ps -a -q)

5.4. 退出

exit

To restart the exited container:

docker start -a -i `docker ps -q -l`
docker start start a container (requires name or ID)
-a attach to container
-i interactive mode
docker ps List containers
-q list only container IDs
-l list only last created container

5.5. How do I install new libraries in Docker?

Therre are two ways to do this:

First methodd: Modify the Dockerfile directly to install new or update your existing libraries. You will need to do a docker build after you do this. If you just want to update to a newer version of the DL framework(s), you can pass them as CLI parameter using the --build-arg tag (see for details). The framework versions are defined at the top of the Dockerfile. For example, docker build -t shmhub/dl-docker:cpu -f Dockerfile.cpu --build-arg TENSORFLOW_VERSION=1.2.0 .

Second mthodd: you can install or upgrade in the container. After it is done, exit the cocntainer and do a commit as introducte below.

5.5. 向docker image提交container change

docker commit -m "install vim wget on ubuntu" -a "author: shm" 7de2c97f7a85 shm/ubuntu_custom

这里7de2c97f7a85是image id.也就是在bash下面看到的root@后面的id: root@7de2c97f7a85. 这时候比较前后的镜像,就会发现commit以后多了一个镜像

shm@ubuntu:~$ docker images
REPOSITORY TAG IMAGE ID CREATED SIZE
tensorflow/tensorflow latest ea40dcc45724 2 weeks ago 1.03 GB
ubuntu latest f49eec89601e 5 weeks ago 129 MB shm@ubuntu:~$ docker commit -m "install vim wget on ubuntu" -a "author: shm" 7de2c97f7a85 shm/ubuntu_custom
sha256:5ed742f690e11c65db83936847c7c5659c5834f6b2c93b52d110455936e6a224 shm@ubuntu:~$ docker images
REPOSITORY TAG IMAGE ID CREATED SIZE
shm/ubuntu_custom latest 5ed742f690e1 12 seconds ago 647 MB
tensorflow/tensorflow latest ea40dcc45724 2 weeks ago 1.03 GB
ubuntu latest f49eec89601e 5 weeks ago 129 MB

5.6. 向仓库提交镜像

首先登陆

docker login -u docker-username

然后push

5.6.1. list the image and get the tag id

shm@shm-xps9550:~/projects/dl_lessons/courses-master/deeplearning1/nbs$ `docker images`
REPOSITORY TAG IMAGE ID CREATED SIZE
shmhub/dl-docker cpu 0f1e40d1bed8 12 days ago 9.13 GB
ubuntu 16.04 6a2f32de169d 13 days ago 117 MB

5.6.2. tag the giage with the registoryhost

docker tag 0f1e40d1bed8 pinseng/dl-docker

5.6.3. push the image to the repo

docker push docker-username/docker-image-name
docker push pinseng/dl-docker

四、使用docker exec进入Docker容器

除了上面几种做法之外,docker在1.3.X版本之后还提供了一个新的命令exec用于进入容器,这种方式相对更简单一些,下面我们来看一下该命令的使用:

  1. $ sudo docker exec --help

Docker 安装tensorflow

接下来我们使用该命令进入一个已经在运行的容器

  1. $ sudo docker ps
  2. $ sudo docker exec -it 775c7c9ee1e1 /bin/bash

Docker 安装tensorflow

如何获取localhost的地址? 
打开一个新的terminal,查看container的地址:

sudo docker inspect clever_bohr | grep IPAddress
  • 1

这里的clever_bohr为该正在运行的container的名字,例子如下

Docker 安装tensorflow 
在浏览器中输入:172.17.0.6:8888

http://blog.csdn.net/tina_ttl/article/details/51417358

http://[all ip addresses on your system]:8888/==========================>http://172.17.0.2:8888/

Docker 安装tensorflow

=========================================================================================================================================================================================================2017/11/8更=====

docker pull tensorflow/tensorflow:nightly-gpu-py3

Docker 安装tensorflow

======================================docker run bash版本========================================

Docker 安装tensorflow

===========================================================本地ipython3 notebook=================================================

Docker 安装tensorflow

=======================================================make   password==================================================

Docker 安装tensorflow

========================================================use docker-tf jupyter   .sh       code========================================================Docker 安装tensorflow