Comments (5)
please follow this #36 (comment)
from upsnet.
Thank you!
from upsnet.
Thank you!
Hello,i use one gpu,but it occured:
ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memory (shm).
ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memory (shm).
ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memory (shm).
ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memory (shm).
Traceback (most recent call last):
File "upsnet/upsnet_end2end_train.py", line 414, in
upsnet_train()
File "upsnet/upsnet_end2end_train.py", line 268, in upsnet_train
data, label, _ = train_iterator.next()
File "/root/anaconda3/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 330, in next
idx, batch = self._get_batch()
File "/root/anaconda3/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 309, in _get_batch
return self.data_queue.get()
File "/root/anaconda3/lib/python3.7/multiprocessing/queues.py", line 352, in get
res = self._reader.recv_bytes()
File "/root/anaconda3/lib/python3.7/multiprocessing/connection.py", line 216, in recv_bytes
buf = self._recv_bytes(maxlength)
File "/root/anaconda3/lib/python3.7/multiprocessing/connection.py", line 407, in _recv_bytes
buf = self._recv(4)
File "/root/anaconda3/lib/python3.7/multiprocessing/connection.py", line 379, in _recv
chunk = read(handle, remaining)
File "/root/anaconda3/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 227, in handler
_error_if_any_worker_fails()
RuntimeError: DataLoader worker (pid 31613) is killed by signal: Bus error. Details are lost due to multiprocessing. Rerunning with num_workers=0 may give better error trace.
from upsnet.
Can I use one GPU with 12G memory to train? Where does the code need to change?
Thank you very much!
Hello,Can you run the code successfully on a gpu?
from upsnet.
Thank you for great work. what if i use horovod on a single gpu machine?I tried it and found it fast than not use horovod, do this have any problem?Moreover, how could i run multiple horovod worker to mimic multiple gpu on a single gpu machine, thanks a lot. Expect your reply.
from upsnet.
Related Issues (20)
- Why the PQ is not equal to SQ * RQ?
- Pan output exceeds number of classes
- Defining New Datasets in Cityscapes Format
- batch_size per gpu is limited to 1
- PQ calculation error HOT 2
- Segmentation fault (core dumped) HOT 1
- Can't generate Panoptic results, only Instance. HOT 3
- System and environment
- RuntimeError: DataLoader worker (pid 60087) is killed by signal: Killed. HOT 1
- Inference is slow when the classes is large.
- Getting NaN for loss/accuracy values on 4 GPU config file HOT 2
- Missing nvcc?
- AssertionError: Failed to read image
- How to infer one Image with COCO Config?
- TypeError: new(): invalid data type 'str'
- Panoptic head implementation
- RuntimeError: expected a Variable argument, but got list
- Dataset overview
- Shipments not recieved
- RuntimeError: CUDA error: no kernel image is available for execution on the device HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from upsnet.