GithubHelp home page GithubHelp logo

Comments (10)

griffintin avatar griffintin commented on July 4, 2024

@rod409 I found similar mismatches and my instance training results was very bad.
In addition to the above problem you pointed out, I found some labels are wrong for instances.

images:
val/7d128593-0ccfea4c.jpg : background was labeled as truck
7d128593-0ccfea4c

val/a640b39e-f556d329.jpg background grass/curb was labeled as train
図1

val/a98e2bc9-9e2a0b07.jpg poles was labeled as car, some cars labeled as bus....

図2

(please ignore the color, I forgot to change RGB order)
@fyu
Hope you also noticed these wrong labels for instances.

from bdd100k.

XiaLiPKU avatar XiaLiPKU commented on July 4, 2024

I found a few images and labels that do not match in the images/10k images for semantic segmentation and the corresponding bitmasks downloaded in labels/sem_seg/masks. These are the following files

In images path but not in labels:

  • train/3d581db5-2564fb7e.jpg
  • train/52e3fd10-c205dec2.jpg
  • train/781756b0-61e0a182.jpg
  • train/78ac84ba-07bd30c2.jpg
  • val/80a9e37d-e4548ac1.jpg
  • val/9342e334-33d167eb.jpg

In labels path but not in images:

  • train/fee92217-63b3f87f.png
  • train/ff1e4d6d-f4d85cfd.png
  • train/ff3d3536-04986e25.png
  • train/ff3da814-c3463a43.png
  • val/ff55861e-a06b953c.png
  • val/ff7b98c7-3cb964ac.png

I am able to train a model successfully after removing these images but wanted to bring this to your attention.

Thanks.

Thanks for informing!
We also notice this problem and would like to publish a new version of the image zip these days

from bdd100k.

griffintin avatar griffintin commented on July 4, 2024

@XiaLiPKU

could you please have a look at the above validation images of wrong labels for instances segmentation?
I would like to mention that, in training labels, I found similar wrong labels.
But sorry that I am not able to check all 7000 images to give you details.

from bdd100k.

XiaLiPKU avatar XiaLiPKU commented on July 4, 2024

@XiaLiPKU

could you please have a look at the above validation images of wrong labels for instances segmentation?
I would like to mention that, in training labels, I found similar wrong labels.
But sorry that I am not able to check all 7000 images to give you details.

All thanks for your mentioning.
Currently, we have no method to deal with these errors, except to find them and let the labelers re-label them.
This issue is significant to us, so we will try to solve it as quickly as we can.

from bdd100k.

XiaLiPKU avatar XiaLiPKU commented on July 4, 2024

I found a few images and labels that do not match in the images/10k images for semantic segmentation and the corresponding bitmasks downloaded in labels/sem_seg/masks. These are the following files
In images path but not in labels:

  • train/3d581db5-2564fb7e.jpg
  • train/52e3fd10-c205dec2.jpg
  • train/781756b0-61e0a182.jpg
  • train/78ac84ba-07bd30c2.jpg
  • val/80a9e37d-e4548ac1.jpg
  • val/9342e334-33d167eb.jpg

In labels path but not in images:

  • train/fee92217-63b3f87f.png
  • train/ff1e4d6d-f4d85cfd.png
  • train/ff3d3536-04986e25.png
  • train/ff3da814-c3463a43.png
  • val/ff55861e-a06b953c.png
  • val/ff7b98c7-3cb964ac.png

I am able to train a model successfully after removing these images but wanted to bring this to your attention.
Thanks.

Thanks for informing!
We also notice this problem and would like to publish a new version of the image zip these days

To quickly solve this current problem, you can use the patch here: patch.tar.gz. Hope this can help you.

Moreover, we are updating downloading files, you shall see them today or tomorrow.

from bdd100k.

pcicales avatar pcicales commented on July 4, 2024

Hello, were the download files updated? I downloaded last week.

from bdd100k.

XiaLiPKU avatar XiaLiPKU commented on July 4, 2024

Hello, were the download files updated? I downloaded last week.

Sorry to let you wait.
We published a new version of image downloading files.
You can now download them on the website: https://bdd-data.berkeley.edu/
Now, 100k and 10k images are separated into two files.
For more information, you may refer to: https://doc.bdd100k.com/download.html#id1

from bdd100k.

pcicales avatar pcicales commented on July 4, 2024

@XiaLiPKU are the labels updated as well? I am getting poor accuracy with a cascade RCNN on the sem_seg_20 task for example, perhaps the same issue mentioned by @griffintin? Perhaps we could have a list of annotations that are defective in the meantime, so that we may exclude them? This seems like a serious issue @fyu ... can you please confirm if these errors exist?

I am also running the ins_seg task, and my code is currently showing that only 1696 images have associated annotations. Performance so far (using a 3 stage cascade RCNN) is also low. Im still looking into this and will post back here over the weekend.

from bdd100k.

guifentian avatar guifentian commented on July 4, 2024

@XiaLiPKU @fyu

I am trying to use the latest 10k images/labels , and the newest code for instance segmentation.
The labels are still wrong as reported here #103 (comment)

what should we users do with current partial wrong labels, any advices?
How can we clarify the cause for low AP , the network, the labels?
Or can you provide your results using current labels on typical models like Panoptic-FPN, DeepLab, etc..?

from bdd100k.

thomasehuang avatar thomasehuang commented on July 4, 2024

These issues should be already addressed, let us know if it's still an issue.

from bdd100k.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.