GithubHelp home page GithubHelp logo

scalevln's People

Contributors

wz0919 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

scalevln's Issues

Scripts for extrating features

Hello @wz0919 @YicongHong @jialuli-luka , I want to express my gratitude for your outstanding contributions to the field of VLN.
Could you please release the script for extracting features for images as you have used additional environment HM3D, Gibson..
so could you please tell in detail how you have extracted features.

Thank you so much for your time.

Feature extraction script

Hey, thank you for the amazing work. Can you pls release the script to extract the below features:

  • clip_vit-h14_mp3d_hm3d_gibson.hdf5
  • clip_vit-b16_mp3d_hm3d_gibson.hdf5

Thank You

Release of EnvDrop Speaker

Hi, thanks for sharing such a great work!
I am wondering if it is possible to share the weights of the Speaker to generate navigational instructions in ScaleVLN.
I followed the paper to train an EnvDrop Speaker with the clip feature provided on the R2R dataset. However, when generating the instructions for HM3D environments with the provided feature, the results are much worse than the instructions in ScaleVLN.
Could I know what the problem is or is there any plan to release your trained speaker?
Thanks!

How did you install Matterport3DSimulator?

Hi, thanks for your interesting work!

I just wanted to ask how you installed Matterport3DSimulator. I think the command you give is for local installation without using docker (correct me if I am wrong). Is there a convenient way to use the docker version? Because I successfully installed Matterport3DSimulator with docker, but had difficulties installing the dependencies when I attempted to do it locally (because many of the packages they used are out of date).

Thanks for your time.

The code, data, and trained models for other downstream tasks

I am very grateful for your research on VLN. Could you please release or share the code, data, and trained models for other downstream tasks, such as REVERIE, R4R, and R2R-CE? Your work could greatly benefit my ongoing projects. Thank you so much for your time.

Depth images

Hi, @wz0919
Do you have a plan to release the depth images or the depth features?

In the "Appendices B.3.Effect of Depth Modality" of the paper, I see you compared RGB data and RGBD data.

R2r test failed

Hi, when I upload the test file of r2r to the leaderboard, I fail to test the results, it display ' from df045272aeba414dbefac729c49d92f5 to f45a8a43423e45788bde4e50d4ec1e2e but the navigation graph contains no edge between these viewpoints ' . Did you meet this problem. Looking forward to your replay.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.