Comments (6)
Hi @darkdevahm, does your machine have at least 2 CPU cores available when running the training script?
If the error persists, please try setting --nworker
argument to 0
, which turns off multi-process data loading, but could potentially solve the problem.
from coperception.
Thanks for your reply. Yes I'm using 2 CPUs with 22GB memory and Tesla V100-SXM3-32GB, so on the hardware side, I think it should be enough.
I have tried today to set the --nworkers
to 0
but the issue still persists. It trains for a while (till ~33% of epoch 1), and then throws the error.
Which Pytorch version did you use?
from coperception.
Mine was 1.11.0
from coperception.
I have tried with pytorch 1.11.0
, and the same error still exists.
from coperception.
That's kind of strange...
We tested our code on Ubuntu and RHEL machines, and we also trained the models using V100 before, but we haven't encountered this issue.
I think it is some problem with PyTorch itself, but I searched online and didn't find any definite explanation or solution.
I would recommend trying to test using another machine if the error persists.
from coperception.
I also encountered the same problem. Have you solved it? @darkdevahm
from coperception.
Related Issues (20)
- Code Questions for the loss HOT 3
- question for trainning result HOT 9
- Questions for DiscoNet training on V2X-Sim-seg HOT 2
- Not able to download checkpoints HOT 2
- Some question about tracking dataset HOT 5
- Issue with Downloading the complete Dataset HOT 2
- Change the Perception Range HOT 2
- error in : make train_disco HOT 7
- Question about downloading dataset HOT 3
- Disco Test result query HOT 2
- Question about Downloading cam_int.zip HOT 5
- Question about V2VNet HOT 2
- DiscoNet test result : Error in IoU 0.5 & 0.7 reults HOT 1
- Plotting edge weights HOT 1
- About SyncNet HOT 3
- issue to train the tracker HOT 2
- question about data format HOT 2
- Question about dataset preprocessing HOT 5
- Question about dataset preprocessing HOT 5
- Question about Tracking HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from coperception.