GithubHelp home page GithubHelp logo

Comments (4)

garrickbrazil avatar garrickbrazil commented on July 2, 2024 1

Hello!

  1. The bbox2D_trunc field is the minimum enclosed projected 3D bounding box which is truncated to the image bounds when necessary. We tend to prefer this for training instead of bbox2D_proj which can result in very large boxes as z3d approaches 0.

  2. The R_cam matrix is a per object annotation which rotates an object in the camera space around its own origin (hence before any 3D translation to the cuboid, seen here). I'm not sure if the concept of world to camera is as applicable here but I see how the name could imply that. Hope that answers your question or feel free to ask more!

  3. To get the form from 8 projected vertices (8 x 3 Tensor), the general process is to take the min/max for each dim similar to:

x1 = vertices[:, 0].min(1)
y1 = vertices[:, 1].min(1)
x2 = vertices[:, 0].max(1)
y2 = vertices[:, 1].max(1)

from omni3d.

SyanneL avatar SyanneL commented on July 2, 2024

Thanks so much for your reply! I am also wondering how could I get the truncation?

from omni3d.

WangYueFt avatar WangYueFt commented on July 2, 2024

Hey @garrickbrazil , I have a follow up question regarding bbox2D_trunc and bbox2D_proj. Is bbox2D_trunc truncated to image bound from bbox2D_proj? How do you get the minimal enclosed projected 3D bounding box? Thanks!

from omni3d.

gkioxari avatar gkioxari commented on July 2, 2024

Hi @WangYueFt

To get the minimal enclosed 2D bounding box we use the help of rendering.
We represent the 3D bounding box as a mesh: 8 vertices and 12 triangular faces. Then we render the 3D box mesh with Py3D which renders the silhouette of the mesh and take the minimum enclosed bounding box of the rendered silhouette. Note that via rendering, the part of the box that is outside of the image plane is not rendered thus the resulting box is truncated, as intended.

from omni3d.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.