Comments (3)
Good advice.
Intuitively, I think that sigmoid affects gradient propagation and computation speed. However, we have not done relevant experiments in this area yet.
If we have time, we will do this experiment.
from van-classification.
Thanks a lot, I'm glad to hear from you about further improvement.
from van-classification.
Hi, We used your great backbone in our internal detection dataset and found two potential problems:
(1) When we export model to Libtorch and do inference with FP16, the model output is NAN, the source is the LKA, whose output value causes the overflow problem because of no normalization op (FP32 works well...).
(2) Training is unstable, when use the same optimizer config, VAN-base works well, but VAN-tiny/small can't converge.
FYI
from van-classification.
Related Issues (20)
- 预训练模型
- Loading pretrained model error HOT 2
- Failed to load van_base_828.pth.tar HOT 2
- [Question] Object detection and instance segmentation HOT 2
- When to use the freeze_patch_emb method HOT 3
- Question related to permutation operation used in the model HOT 1
- configuration for pre-trained models HOT 1
- Issue about training HOT 3
- The model used for classification problem HOT 2
- OverlapPatchEmbed HOT 6
- Why not use SyncBN HOT 2
- Code for detection HOT 1
- Why use .clone() for shortcut connection? HOT 1
- code for visualization
- RuntimeError: cuDNN error: CUDNN_STATUS_NOT_INITIALIZED HOT 1
- 22k finetune 1k pretrained weights
- where is the detection model?
- Class activation mapping (CAM) Target Layer HOT 1
- how to understand large kernel convolution decomposition?
- Attention vs Add in LKA HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from van-classification.