GithubHelp home page GithubHelp logo

Comments (4)

YUZHIWANG-bug avatar YUZHIWANG-bug commented on June 11, 2024

补充:我使用了您提供的如下的方法转化了音频,但是还是出现上述错误
image

from asrt_speechrecognition.

nl8590687 avatar nl8590687 commented on June 11, 2024

很明显,这是音频文件的时间长度过长导致的,可以参考ASRT项目文档上所述的内容,一条语音数据的最长时间长度当前限制为不能超过16秒,超过的话很容易导致模型的数据尺寸过大进而引发Memory不足的问题,尤其是在使用不太先进的GPU运行的时候。如果存在较长时间的音频,首先应当切割为一段段比较短的音频片段。

from asrt_speechrecognition.

YUZHIWANG-bug avatar YUZHIWANG-bug commented on June 11, 2024

作者您好,我按照您说的,切割了一段15秒的音频,转成了wav格式,但是还是显示之前的错误,甚至第一个数据更大了,很是奇怪,我将再试试缩短,感谢您的回复,谢谢!
image

from asrt_speechrecognition.

YUZHIWANG-bug avatar YUZHIWANG-bug commented on June 11, 2024

很明显,这是音频文件的时间长度过长导致的,可以参考ASRT项目文档上所述的内容,一条语音数据的最长时间长度当前限制为不能超过16秒,超过的话很容易导致模型的数据尺寸过大进而引发Memory不足的问题,尤其是在使用不太先进的GPU运行的时候。如果存在较长时间的音频,首先应当切割为一段段比较短的音频片段。

您好,我又尝试了一个8s的视频,我的转换过程如下:
image
最终得到一个wav文件,但是送入预测时,还是显示一下错误:
image
如果是时长问题的话,8s应该满足条件了,是不是我视频转音频的处理过程有误,还请您指教一下,感谢!

from asrt_speechrecognition.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.