Dataset：COCO数据集的简介、安装、使用方法之详细攻略

COCO数据集的简介

COCO数据集是一个大型的、丰富的物体检测，分割和字幕数据集。这个数据集以scene understanding为目标，主要从复杂的日常场景中截取，图像中的目标通过精确的segmentation进行位置的标定。图像包括91类目标，328,000影像和2,500,000个label。
该数据集主要解决3个问题：目标检测，目标之间的上下文关系，目标的2维上的精确定位。COCO数据集有91类，虽然比ImageNet和SUN类别少，但是每一类的图像多，这有利于获得更多的每类中位于某种特定场景的能力，对比PASCAL VOC，其有更多类和图像。
官网地址：http://cocodataset.org/#home

1、COCO数据集的特点
COCO is a large-scale object detection, segmentation, and captioning dataset. COCO has several features:

Object segmentation
Recognition in context
Superpixel stuff segmentation
330K images (>200K labeled)
1.5 million object instances
80 object categories
91 stuff categories
5 captions per image
250,000 people with keypoints

对象分割；
在上下文中可识别；
超像素分割；
330K图像（> 200K标记）；
150万个对象实例；
80个对象类别；
91个类别；
每张图片5个字幕；
有关键点的250,000人；

2、数据集的大小和版本

大小：25 GB（压缩）
记录数量： 330K图像、80个对象类别、每幅图像有5个标签、25万个关键点。
COCO数据集分两部分发布，前部分于2014年发布，后部分于2015年，2014年版本：82,783 training, 40,504 validation, and 40,775 testing images，有270k的segmented people和886k的segmented object；2015年版本：165,482 train, 81,208 val, and 81,434 test images。
(1)、2014年版本的数据，一共有20G左右的图片和500M左右的标签文件。标签文件标记了每个segmentation的像素精确位置+bounding box的精确坐标，其精度均为小数点后两位。

3、COCO数据集的展示

Dataset：COCO数据集的简介、安装、使用方法之详细攻略

COCO数据集的安装

数据集下载地址：

1、2014年

http://msvocds.blob.core.windows.net/coco2014/train2014.zip

2、2017年

http://images.cocodataset.org/zips/train2017.zip
http://images.cocodataset.org/annotations/annotations_trainval2017.zip

http://images.cocodataset.org/zips/val2017.zip
http://images.cocodataset.org/annotations/stuff_annotations_trainval2017.zip

http://images.cocodataset.org/zips/test2017.zip
http://images.cocodataset.org/annotations/image_info_test2017.zip

COCO数据集的使用方法

(1)、Download Images and Annotations from [MSCOCO]
(2)、Get the coco code
(3)、Build the coco code
(4)、Split the annotation to many files per image and get the image size info
(5)、 Create the LMDB file

秒客网

Dataset：COCO数据集的简介、安装、使用方法之详细攻略

COCO数据集的简介

COCO数据集的安装

COCO数据集的使用方法

相关文章