Comments (5)
1.不会呀,我这个时每次随机读取数据的
2.我看到你batch_size设置为112,为什么要设置这么大呢?32就最多了,再大的话在训练中是没用的,跟32条数据所代表的梯度区别不大。
from asrt_speechrecognition.
您好,请问如何判断得出32条数据所代表的梯度和大于32条数据所代表的梯度区别不大呢?
from asrt_speechrecognition.
@songmianmian
这个32是图像和计算机视觉领域的各个研究员普遍使用的标准的batch大小,而且有一些文章和课程视频有讲到为什么使用mini-batch梯度下降而不是一次使用所有数据集进行批量梯度下降,而且针对不同的领域应该使用多大的batch最好也有说到,您可以去看一下。
其实语音识别甚至连32都用不到,不过由于我使用的方法借鉴于计算机视觉,所以也建议32
from asrt_speechrecognition.
我来分享一下最新进展,供大家参考:
导入batch_size为112,训练了200k steps的模型,继续用 batch_size 32 进行了一天的训练,目前训练了134k steps,loss从16左右上升到26左右,错误率从20%左右上升到30%-40%,且非常不稳定。
从上述结果看,batch_size 112 貌似比 32 效果好。
from asrt_speechrecognition.
试试直接从头开始训练?
from asrt_speechrecognition.
Related Issues (20)
- asrserver运行client报错500,'Request' object has no attribute 'get_json' HOT 2
- AMD卡不支持CUDA可以跑吗 HOT 1
- FileNotFoundError: [Errno 2] No such file or directory: '/data/speech_data/magicdata/train/14_3858/14_3858_20170818131643.wav' HOT 1
- 音频文件规范问题 HOT 4
- 文件路径同时出现“/”、“\\”,导致找不到文件,请问怎么解决呢 HOT 3
- 数据集可以只采用thchs30进行训练和预测吗?
- 修改成支持英文识别的问题 HOT 1
- h5文件转tflite出错
- pip package conflict caused by protobuf==3.19.6 and grpcio-tools HOT 3
- Error with CUDA_ERROR_ILLEGAL_ADDRESS HOT 7
- 训练模型时出错 HOT 2
- 怎么能识别中英文混合的语音?
- No such file or directory(训练每次出现的缺失wav文件还不一样) HOT 2
- 可以提供麦克风的示例不 HOT 1
- ValueError: Expect x to be a non-empty array or dataset. HOT 2
- ARM64 的支持 HOT 1
- 有训练好的模型权重文件下载吗
- download_default_datalist 时出现 502 Bad Gateway HOT 1
- 请问,电脑安装不了cuda和cdnn的话,可以用服务器来代替吗?然后移除那部分的代码可以吗? HOT 1
- could not broadcast input array from shape (1043793,200,1) into shape (1600,200,1)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from asrt_speechrecognition.