COCO datasetFZOe[VȂǂɎgK͂ȃJ[ʐ^̉摜f[^ZbgFAIE@BwK̃f[^ZbgT - IT

COCO datasetFZOe[VȂǂɎgK͂ȃJ[ʐ^̉摜f[^ZbgFAIE@BwK̃f[^ZbgT

f[^ZbguCOCOvɂ‚ĐB33̃J[ʐ^itxt20ȏj̉摜f[^ƃAme[VitxjŃ_E[hłǍm^ZOe[VAL[|Cgo^pALvV쐬ȂǂɗpłB

» 2021N0908 0500 J
[FFCfW^Ahoe[W]
uAIE@BwK̃f[^ZbgTṽCfbNX

Aږڎ

f[^Zbg

@Microsoft COCOiCommon Objects in Contextj́A33itxt͖20ȏAc̖12͋txȂj̑K͂ȁuJ[ʐ^v̉摜f[^Zbgłi}1jB

}1@COCOTCg }1@COCOTCg

pr

@ȗpri^XNj́A

  • ̌o^ZOe[VF 150‚̃̕CX^Xɑ΂āAE{bNXƁA80‚̃̕JeS[ƒZOe[V}XN
  • L[|Cgo^pF 25l̐lCX^XƁA17‚̃L[|Cgi@^ځ^^^I^^^G^Ȃǁj
  • LvV쐬F 摜Ƃ5‚̃LvViRɂj

ƂȂĂB^XN̎QlƂāAႦ2020NɍsꂽCOCOpRyeBVɈȉ̂̂B

@COCOł͂܂܂Ȏނ̃ZOe[V^XNŃRyeBVsĂBL2020Nɂ́AInstance SegmentationPanoptic Segmentation2ނsꂽB2019Nɂ́AuStuff SegmentationvƂ^XNsĂB

@ȂAX^btiStuffjƂ́AKXAǁAAȂǖ`̐ȂmwÁiObjectAThingFlAԁALȂǕIɌ`鐔郂mjƂ͋ʂBStuff SegmentationƂ́AX^btɑ΂Z}eBbNZOe[Visemantic segmentationĵƂłÂɑ΂ZOe[VłInstance SegmentationƂ͋ʂBȂ݂ɏLPanoptic SegmentatiońAX^btƕ̗̂ɑΉV[ZOe[Viscene segmentationĵƂłB

zzf[^̍\

@f[^ZbǵA摜f[^ƁAɑ΂鋳txłAme[VɕAɂ炪PiTrainj^؁iValj^eXgiTestjɂ炩ߕĂBCOCO͖N̂悤ɃAbvf[gĂA̓Iɂ͈ȉ̂悤ɍXVĂB

  • 2014NF P^ؗp̉摜f[^ƃAme[VAeXgp̉摜f[^iƉ摜j
  • 2015NF eXgp̉摜f[^iƉ摜j
  • 2017NF P^ؗp̉摜f[^ƃAme[VAStuff Segmentation̂߂̌P^ؗp̉摜f[^ƃAme[VAPanoptic Segmentation̂߂̌P^ؗp̉摜f[^ƃAme[VAeXgp̉摜f[^iƉ摜jAIvVƂāitwKȂǂɎgjtxȂ̉摜f[^iƉ摜j
  • 2018NF 2017N̑S摜ɑ΂銮SȁuStuff Segmentation̂߂̃Ame[VvƁuPanoptic Segmentation̂߂̃Ame[Vv
  • 2019NF ύXĂȂ
  • 2020NF ύXĂȂ

@ڂ́uCOCǑ_E[hy[WvQƂĂقB

p̂߂̏

@COCO̗pKɂƁACOCỎ摜gpꍇ́AFlickr̗pKɏ]KvƂ̂ƁB摜t@CƂCreative Commonŝꂩ̃CZX蓖ĂĂȀ񂪃Ame[VɊ܂܂ĂB

@f[^ZbgɊւ錤eQƂۂɎg_ȉɂ܂Ƃ߂ĂB

  • _ҁF Tsung-Yi Lin, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross Girshick, James Hays, Pietro Perona, Deva Ramanan, C. Lawrence Zitnick, Piotr Dollár
  • ^CgF Microsoft COCO: Common Objects in Context
  • JF May 01, 2014
  • _F arXiv:1405.0312 [cs.CV]
  • URLF https://cocodataset.org

p@

@ۂCOCOgɂ́ATensorFlowPyTorchƂeCu񋟂@\𗘗p邱Ƃ߂BȉɁAꂼ̃CuŁuǂ̂悤ȃR[hCOCOg邩v̓T^IȃR[hȒPɎĂiR[h̏ڍׂ͉ȂjB

TensorFlow Datasets

# !pip install tensorflow-datasets  # CuuTensorFlow DatasetsvCXg[

import tensorflow_datasets as tfds

coco2017_train = tfds.load(name="coco/2017", split="train")
coco2014_captions_train = tfds.load(name="coco_captions/2014", split="train")

Xg1@TensorFlow DatasetsCOCO𗘗p{IȃR[h

@TensorFlow DatasetsŎgpłf[^ZbǵATensorFlow Datasetsɂ܂Ƃ߂ĂAtensorflow_datasetsW[itfdsjload()֐痘płBCOCOf[^ZbǵA2ނ񋟂ĂB

PyTorch

@IɃ_E[hłȂdlȂ̂ŁAOɌq́u_E[h@vɎ@ŁAp摜f[^ƃAme[V̗_E[hāACӂ̃tH_[iႦΉ摜f[^u./images/train2017/vɁAAme[Vu./annotations/vjɔzuĂKvB

# !pip install torch torchvision  # CuuPyTorchvCXg[

import torch
import torchvision

coco_det_data = torchvision.datasets.CocoDetection(
    './images/train2017', annFile='./annotations/instances_train2017.json',
    transform=torchvision.transforms.ToTensor())

coco_cap_data = torchvision.datasets.CocoCaptions(
    './images/train2017', annFile='./annotations/captions_train2017.json',
    transform=torchvision.transforms.ToTensor())

data_loader_det = torch.utils.data.DataLoader(coco_det_data,  batch_size=4,  shuffle=True)
data_loader_cap = torch.utils.data.DataLoader(coco_cap_data,  batch_size=4,  shuffle=True)

Xg4@PyTorchCOCO𗘗p{IȃR[h

@torchvision.datasetsOԂ́A

ꂩ̃NX̃RXgN^[iɂ__init__֐jŃf[^Zbg̃IuWFNg𐶐ăf[^_E[hAtorch.utils.data.DataLoaderNX̃RXgN^[Ńf[^[_[̃IuWFNg𐶐ăf[^[hB

_E[h@

@COCÓAĽy[W_E[hłB̓Iȃ_E[h@́AÑy[Wɂ菇ɏ]ĂقB

}2@COCOf[^Zbg̃_E[hy[W }2@COCOf[^Zbg̃_E[hy[W

uAIE@BwK̃f[^ZbgTṽCfbNX

uAIE@BwK̃f[^ZbgTv

Copyright© Digital Advantage Corp. All Rights Reserved.

X|T[̂m点PR

ڂ̃e[}

Microsoft  WindowsőO2025
AI for GWjAO
[R[h^m[R[h Zg by IT - ITGWjArWlX̒SŊ􂷂gD
Cloud Native Central by IT - XP[uȔ\͂gD
VXeJmEnE yirzPR
Ȃɂ߂̋LPR

RSSɂ‚

ACeBfBAIDɂ‚

[}KWo^

IT̃[}KẂA AׂĖłBЃ[}KWwǂB