Refactor: Split COCO dataset into detection and segmentation datasets #280

Chandraveersingh1717 · 2026-01-07T16:48:16Z

Summary

Split coco_detection_dataset() into two specialized datasets for object detection and instance segmentation tasks.

Problem

Current implementation carries 500MB+ annotation object in memory for entire dataset lifetime
Confusing documentation mixing detection and segmentation use cases
Tasks never run together (different model architectures)
Poor cache organization with unidentified large files

Solution

coco_detection_dataset() - Object Detection Only

Returns: boxes, labels, area, iscrowd
Memory: ~250MB (50% reduction)
Use: Faster R-CNN, YOLO, SSD

coco_segmentation_dataset() - Instance Segmentation (NEW)

Returns: boxes, labels, area, iscrowd, segmentation, masks
Memory: ~250MB
Use: Mask R-CNN, DeepLab

Cache Organization: Files now stored in /coco subdirectory for better identification

Breaking Change

Segmentation users must migrate:

# Before
coco_detection_dataset(..., target_transform = target_transform_coco_masks)

# After  
coco_segmentation_dataset(..., target_transform = target_transform_coco_masks)

…ory reduction and better UX (Breaking: segmentation users migrate to coco_segmentation_dataset)

refactor: Split COCO into detection/segmentation datasets for 50% mem…

efa4584

…ory reduction and better UX (Breaking: segmentation users migrate to coco_segmentation_dataset)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor: Split COCO dataset into detection and segmentation datasets #280

Refactor: Split COCO dataset into detection and segmentation datasets #280

Uh oh!

Chandraveersingh1717 commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Refactor: Split COCO dataset into detection and segmentation datasets #280

Are you sure you want to change the base?

Refactor: Split COCO dataset into detection and segmentation datasets #280

Uh oh!

Conversation

Chandraveersingh1717 commented Jan 7, 2026

Summary

Problem

Solution

Breaking Change

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant