GithubHelp home page GithubHelp logo

Comments (13)

benjaminrwilson avatar benjaminrwilson commented on June 15, 2024

Hi @tom-bu,

Thanks for your interest!

  1. Have you tried increasing the open file limit? Additionally, does change the numworkers parameter resolve the issue?

  2. I think the second command you listed is missing --no-sign-request. Does this resolve the issue: SHARD_DIR={DESIRED PATH FOR TAR.GZ files} s5cmd --no-sign-request cp s3://argoai-argoverse/av2/tars/tbv/*.tar.gz ${SHARD_DIR}?

from av2-api.

johnwlambert avatar johnwlambert commented on June 15, 2024

Hey @tom-bu, thanks for trying out TbV. If you're having issues with s5cmd even after trying the fixes that Ben suggested, you can just download the 21 tar files, directly, given the links listed here:

wget https://s3.amazonaws.com/argoai-argoverse/av2/tars/tbv/TbV_v1.0_shard0.tar.gz
wget https://s3.amazonaws.com/argoai-argoverse/av2/tars/tbv/TbV_v1.0_shard1.tar.gz
...
wget https://s3.amazonaws.com/argoai-argoverse/av2/tars/tbv/TbV_v1.0_shard20.tar.gz

The TbV dataset is available in two identical formats, for convenience to users. Either you can download the 21 tar.gz files directly, and then use the untar_tbv.py script, or you can pull down all of the files (in their extracted form). Depending upon your connection, one may be much faster than the other (there are almost 8 million images files in the extracted format, for example).

from av2-api.

tom-bu avatar tom-bu commented on June 15, 2024

Thanks for the feedback! My s5cmd version doesn't seem to have a sync method to continue where I left off with option 1, so I'm currently trying option 2 with the tar files. I realized for option 2 I wasn't waiting long enough to download, but I believe the --no-sign-request is necessary.

from av2-api.

tom-bu avatar tom-bu commented on June 15, 2024

Hi @johnwlambert, thanks for the info. Another question I had about the TbV dataset is where can I find the labels for change? Is there a file that indicates different log pairs and what has changed?

Thanks!

from av2-api.

senselessdev1 avatar senselessdev1 commented on June 15, 2024

Hi @tom-bu, we'll upload more information about the train/val/test splits for TbV in the next day or two.

from av2-api.

tom-bu avatar tom-bu commented on June 15, 2024

Thanks for the update @johnlambert-argo. Look forward to it!

from av2-api.

senselessdev1 avatar senselessdev1 commented on June 15, 2024

@tom-bu I've provided a clustering of logs by spatial location in this PR: #26.

A few things to note:

  • Each log within a cluster shares some significant visual overlap with other logs within its cluster.
  • These are not necessarily before/after pairs. In some cases, all logs in a cluster may be "after" a change.
  • Each cluster has at least one log in the val or test set.
  • Logs of each cluster are provided in chronological order.

from av2-api.

tom-bu avatar tom-bu commented on June 15, 2024

@johnlambert-argo great, thanks! Are all of the logs clustered? I believe there are ~1000 logs? Also, will labels be released for which logs include change/no change?

from av2-api.

johnwlambert avatar johnwlambert commented on June 15, 2024

Hi @tom-bu, no, this is only a specific subset of logs, where some log in the cluster had a real-world change.

You could cluster all the log spatially though, using their poses.

Labels for the val set will be released, but labels for the test set will not be (they'll be used for an online leaderboard, which you will be welcome to submit to). We'll release those val set annotations in probably 1-3 days.

from av2-api.

tom-bu avatar tom-bu commented on June 15, 2024

Hi @johnlambert-argo,

I just realized that this map change dataset doesn't necessarily have before and after sensor data as shown in the image here. So it seems we're just checking if the corresponding vector map is up-to-date or not?

And I wanted to verify that the training/validation sets have no changes. Therefore, all we need to know is if a log is in the training/validation set to know the label?

Thanks,

Tom

from av2-api.

benjaminrwilson avatar benjaminrwilson commented on June 15, 2024

Hi @tom-bu, were you able to get your questions answered?

from av2-api.

tom-bu avatar tom-bu commented on June 15, 2024

Have the data splits been released yet? I think that's the only thing I'm waiting on.

from av2-api.

benjaminrwilson avatar benjaminrwilson commented on June 15, 2024

@tom-bu These are now available here: https://github.com/argoai/av2-api/blob/main/src/av2/datasets/tbv/splits.py

Please reach out if you have any other questions!

from av2-api.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.