Coco dataset , fix for grayscales images, convert them to RGB by ExtReMLapin · Pull Request #45 · tlpss/keypoint-detection

ExtReMLapin · 2025-05-15T08:26:47Z

No description provided.

Copilot

Pull Request Overview

This PR fixes the handling of grayscale images in the COCO dataset by converting them to 3-channel RGB before further processing.

Added conversion of grayscale images to RGB by duplicating the single channel.
Ensured that images with an alpha channel are reduced to RGB.

tlpss · 2025-05-15T13:00:38Z

@ExtReMLapin thanks for your contribution, looks like a useful addition! I'll merge it soon.

ExtReMLapin · 2025-05-15T14:09:41Z

🖖🏻

A pleasure.
I also have more changes in staging for annother PR which adds max_image_size param
I'm training it on forensic images that have different resolutions and often high ones which causes :

OOM during training (because of big resolution)
Error during validation because of torch.stack trying to stack up different sizes.

tlpss · 2025-05-15T14:43:33Z

Hi @ExtReMLapin

Sounds like an interesting project.

I consider these steps part of the preprocessing to reduce the burden on the ML codebase (can't support everything in the training loops) and to increase data loading speeds (loading a huge image from disk and then resizing it can bottleneck the GPU because it has to wait on the CPU, which is not desirable).

I will probably not accept a PR that does image resizing in the dataloader (as a separation of concerns).

You should consider resizing the images upfront into a separate dataset and only then training a detector on them.

I have some code for this here if you are interested.

tlpss · 2025-05-16T15:28:25Z

@ExtReMLapin can you take a look at the CI failures? apparently one of the tests was broken by an update in torch ,but the fix should be straightforward.

Btw, I'm on a conference next week so will take some time for me to get back to you! But I do appreciate the PRs 🙂

ExtReMLapin · 2025-05-16T16:05:28Z

No worry with the delay.

To be frank i've been working on this forensic minutiae detector for two years and you have no idea how sometimes it's a pain in the ass to :

set up the whole repository env
transform your dataset
discover their undocumented training examples are not working

here it's just working with wandb integration, few issues with DDP but it's fine tbf

Coco dataset , fix for grayscales images, convert them to RGB

27def0d

tlpss requested a review from Copilot May 15, 2025 12:59

Copilot AI reviewed May 15, 2025

View reviewed changes

Comment thread keypoint_detection/data/coco_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Coco dataset , fix for grayscales images, convert them to RGB#45

Coco dataset , fix for grayscales images, convert them to RGB#45
ExtReMLapin wants to merge 1 commit intotlpss:mainfrom
ExtReMLapin:patch-1

ExtReMLapin commented May 15, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

tlpss commented May 15, 2025

Uh oh!

ExtReMLapin commented May 15, 2025 •

edited

Loading

Uh oh!

tlpss commented May 15, 2025

Uh oh!

tlpss commented May 16, 2025

Uh oh!

ExtReMLapin commented May 16, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ExtReMLapin commented May 15, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

tlpss commented May 15, 2025

Uh oh!

ExtReMLapin commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tlpss commented May 15, 2025

Uh oh!

tlpss commented May 16, 2025

Uh oh!

ExtReMLapin commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ExtReMLapin commented May 15, 2025 •

edited

Loading

ExtReMLapin commented May 16, 2025 •

edited

Loading