Hello sirs, I am trying to implement your code with my own small dataset. Is t

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Sharing the scripts to make hdf5 file for custom dataset about multiview2novelview HOT 4 CLOSED

shaohua0116 commented on July 25, 2024

Sharing the scripts to make hdf5 file for custom dataset

from multiview2novelview.

Comments (4)

shaohua0116 commented on July 25, 2024 1

I created the datasets with IPython and I'm afraid I don't have the command history now. But creating a dataset could be done by following the following instructions.

Train and test your own datasets:* Create a directory

$ mkdir ./datasets/YOUR_DATASET

Store your data as an h5py file datasets/YOUR_DATASET/data.hy and each data point contains
- 'image': has shape [h, w, c], where c is the number of channels (grayscale images: 1, color images: 3)
- 'pose': represented as a one-hot vector of a vector representing a 6DoF camera pose
Maintain a list datasets/YOUR_DATASET/id.txt listing ids of all data points and split the list into train.txt and test.txt
Modify trainer.py including args, data_info, etc.
Implement a data loader like ./datasets/shapenet_car.py and it under ./datasets/
Finally, train and test models:

$ python trainer.py --dataset YOUR_DATASET
$ python evaler.py --dataset YOUR_DATASET

Let me know if you still have a problem with your own dataset.

from multiview2novelview.

phongnhhn92 commented on July 25, 2024

@shaohua0116 Thanks for your kind explanation ! I understand it now. But I have a few questions when I didn't understand reading your paper.

When I checked the data inside the h5py file of KITTI dataset and ShapeNet dataset. I found that for each key image in the KITTI dataset, there are 3 inner key information: image, pose, and pose_matrix. But the in ShapeNet dataset, there are only 2 inner key information: image and pose. Can u explain why you need pose_matrix to train ?
My second question is that: in case of KITTI dataset. I can see that pose is a vector representing 6DoF and pose_matrix is a 3x4 matrix. Would you mind telling me how did you obtain those values from KITTI dataset. I checked the guideline at this link https://s3.eu-central-1.amazonaws.com/avg-kitti/devkit_raw_data.zip and it said that I can get poses from GPS/IMU data. So I would like to ask is that the way you get those pose information ?
Thank you in advance !

from multiview2novelview.

shaohua0116 commented on July 25, 2024

Only the “pose” is used for training and testing. Each 6DoF pose vector in my KITTI dataset consists of a translation vector (x, y, z) and rotation. Pose vectors can be computed from pose 3x4 matrix, which is provided in the original KITTI dataset.

from multiview2novelview.

phongnhhn92 commented on July 25, 2024

Thank you for your help ! I will try as your suggestion.

from multiview2novelview.

Sharing the scripts to make hdf5 file for custom dataset about multiview2novelview HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs