Comments (5)
python2 tools/train_net.py
--cfg configs/getting_started/tutorial_8gpu_e2e_faster_rcnn_R-50-FPN.yaml
OUTPUT_DIR /tmp/detectron-output
problem the GPU utilization become zero, however CPU is using a lot resource.
from detectron.
terminate called after throwing an instance of 'caffe2::EnforceNotMet'
what(): [enforce fail at context_gpu.h:230] error == cudaSuccess. 4 vs 0. Error at: /home/pgao/caffe2/caffe2/caffe2/core/context_gpu.h:230: unspecified launch failure Error from operator:
input: "gpu_0/res5_0_branch2c_w_grad" output: "gpu_2/res5_0_branch2c_w_grad" name: "" type: "Copy" device_option { device_type: 1 cuda_gpu_id: 2 }
terminate called recursively
*** Aborted at 1516782983 (unix time) try "date -d @1516782983" if you are using GNU date ***
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
from detectron.
same problem...
from detectron.
@gaopeng-eugene, @terrychenism: can you try switching to the NCCL implementation of AllReduce to see if that resolves the problem? Instructions for building Caffe2 with NCCL support and enabling NCCL in Detectron can be found in #32.
from detectron.
fix it by using UCCL
from detectron.
Related Issues (20)
- the speed is slow
- 关于bbox损失函数部分,其中mask rcnn的λ权重平衡参数在哪啊 配置和代码里没看见啊。。。 HOT 1
- Detectron2 In-Place One-to-Many Augmentations HOT 2
- KeyError: u'Key TEST.SCALES was renamed to TEST.SCALE; please update your config. Note: Also convert from a tuple, e.g. (600, ), to a integer, e.g. 600.'
- libcaffe2_detectron_ops_gpu.so运行慢
- n/a
- How can I disable the logging system? HOT 1
- How to train Faster R-CNN on my own custom dataset? HOT 4
- Is there any script for batch inference?
- Detectron or Detection? HOT 1
- Project dependencies may have API risk issues
- Caffe 2 merged with pytorch new installation instruction?
- Convert Cityscapes to COCO format: How to convert to other classes (ex: traffic light) HOT 1
- detectron implantación inparcial,solicitud de inplantacion HOT 2
- Mask-RCNN model not properly generating segmentation masks for a specific class- custom dataset
- App etiquette HOT 2
- problem with adaptive streaming HOT 3
- why is threshold of detector confidence is 0.05, not 0.5? Helps, bros
- Really?
- Precision and recall (not AP not AR) values per class
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from detectron.