patui / nosub Goto Github PK

C++ Subtitle Editor (全新C++字幕软件，https://www.nosub.cn)

C++ 100.00%

nosub's Introduction

Nosub Official Website: https://www.nosub.cn

繁体中文Readme

Developer's vision: to make subtitling really easy for newbies;

Thoughts about the new version:

The new version of V2 has begun to subtract, focusing entirely on subtitle editing software that for the ultimate in subtitle editing and tweaking efficiency;

Improvements of V2 relative to V1 version:

Build cross-platform programs based on Cmake, easy to maintain and refactor;
Developed based on QT6.5 and C++ 20;
Rewrite the video player (the new version is based on libmpv), support hardware decoding and variable speed playback, greatly improve SEEK speed and compatibility;
Improve subtitle editor, support subtitle find and replace function;
Optimize the interaction logic of subtitle editor to improve the efficiency of subtitle editing;
Dramatically improve subtitle loading speed (parsing speed);
Cut down most of the non-core businesses, such as speech recognition, video compression, viewing video information, exporting MP3, etc.;
Simplify UI interaction logic;

(i) Nosub's supported platforms:

[ x ] Windows
MacOS (not supported yet, please be patient)
Linux (not supported at the moment, please be patient)

(ii) Core components of Nosub:

[ x] Video Player
Timeline (waveform charts)
Subtitle editor; (subtitle cutting, merging, deleting, modifying, timing)

(c) Nosub V2 version third-party library:

QT cross-platform GUI library ; (QT is a comprehensive C++ library , not just GUI)
MPV video player;
FFmpeg audio and video decoding library ;
Uchardet text encoding detection library;
Iconv text encoding conversion library.
SDL2 multimedia rendering library;
SDL2-ttf subtitle rendering library.

nosub's People

Contributors

Stargazers

Watchers

nosub's Issues

如何导出软件字幕

Hello Patui,

你好，我想问如何导出capcut （剪映海外版）手机软件的外语的字幕，比如日语韩语字幕。因为之前有使用SpeechJsonParser插件而可以顺利导出剪映的字幕。因此想问你要如何在插件内设定capcut以便在在emulator里面导出字幕，不知你可以协助？谢谢

新特性需求：增加快捷键

快捷键分几个层次，建议考虑，谢谢。
L1: 针对当前功能定义一下快捷键，希望增加上一句，下一句的挑战快捷键；
L2: 快捷键可自定义
L3: 可自定义类似vim的单键快捷键，j/k跳转到上/下句，i进入当前句子字幕编辑

Nosub Server语音任务失败客户端一直等待查询中。

后端语音识别引擎两个问题：
1.腾讯语音任务识别，没有标注任务识别的原因；
2.客户端，任务失败后，还一直显示查询中；

查询asr_task表中的任务：

"id": 876,
"task_id": "1753459270"
"asr_task_id": "776521430199424",

希望加入Automatic Forced Alignment（音文对齐）和Spleeter（音轨分离）

背景：
目前ASR效果还可以（提前Spleeter不知到能不能提高识别率）。
但是断句太烂，音频识别出文本后，可以考虑手动/只能断句，最后进行AFA（Automatic Forced Alignment）。

看了音文强制对齐，识别率不低啊，感觉可以语音转文字，支持的语言也多
即使是影视素材，英文识别率也挺高的，就有个思路，不知道大佬有没有兴趣：
1、语音转文本：Montreal Forced Aligner （可以用其他转写）
2、英文智能标点：http://bark.phon.ioc.ee/punctuator （中文没了解）
3、手动/半自动断句（.!?等），部分修正（"i'""i "等）
4、带时间戳音文对齐：Montreal Forced Aligner、aeneas、YouTube（添加字幕有自动对齐）

音文对齐工具
1、Montreal Forced Aligner（支持挺多语言的）：
https://zhuanlan.zhihu.com/p/86657478

效果：
https://www.youtube.com/watch?v=LgrX5gNgxx0&ab_channel=MahdiChtourou
https://www.youtube.com/watch?v=VONAIXelJYg&ab_channel=AdvancedSkeleton
https://www.youtube.com/watch?v=OLXrlcnndBs&ab_channel=YutingHsueh

2、aeneas（多语言支持，有free Webapp）：
https://www.readbeyond.it/aeneas/
使用感受：声音干净的素材对得很齐（如Audiobook，讲解等），影视在干净的地方还很准，嘈杂处对得很糟糕。
配合Spleeter/iZotope RX.8，人声分离后，准度会提高很多。
经过测试，Spleeter有600s持续运行时间限制，增加时长容易引起内存溢出，16G内存能跑800s左右。
SpleeterGui，傻瓜式安装，支持批量拖入，最高能跑700s左右。
长音频还是用iZotope RX.8吧。

新特性：增加导出当前语句的功能

需求说明：
NoSub在断句方面的速度很快，希望能增加一个导出当前句子对应的视频的功能。这样方便针对复杂句子反复听读，NoSub也可以当外语学习软件了。
个人看ffmpeg有个命令行，可以比较容易实现。建议放到右侧列表右键菜单或者底部波形图右键菜单中。
希望能增加此特性。谢谢。

语音识别上传文件报错

视频：RC01.mp4
识别引擎：Nosub后端服务语言识别；

<html>
<head><title>413 Request Entity Too Large</title></head>
<body>
<center><h1>413 Request Entity Too Large</h1></center>
<hr><center>nginx/1.20.1</center>
</body>
</html>

訊飛语音\訊飛语音引擎

Hello,

已在configcore改了訊飛

"AudioFileASREngineGroup": {
"IFlyAudioFileASREngine": {
"EngineGroup": [
{
"Name": "Wong xXXXXXX",
"APPID": "ldjflkjasd",
"APISecret": "44391aXXXXXXXXXXXXXXXXXX",
"APIKEY": " eb861XXXXXXXXXXXXXXXX",
"Language": "zh"

但走不下去.

感恩回覆....

Eric Wong

人工字幕有听不清的地方做了标注。

视频压制失败的问题分析；

FFmpeg Log日志：
`[Parsed_ass_0 @ 000002580dd21b00] ass_read_file(C:\Users\濮滃浗鍚沑AppData\Local\Temp\2021_03_31_09_53_47_941.ass): fopen failed
[Parsed_ass_0 @ 000002580dd21b00] Could not create a libass track when reading file 'C:\Users\濮滃浗鍚沑AppData\Local\Temp\2021_03_31_09_53_47_941.ass'
[AVFilterGraph @ 000002581039a7c0] Error initializing filter 'ass' with args ''C:\Users\濮滃浗鍚沑AppData\Local\Temp\2021_03_31_09_53_47_941.ass''

Error reinitializing filters!
Failed to inject frame into filter network: Invalid argument
Error while processing the decoded data for stream #0:0

Conversion failed!`

通过分析：
应该是ASS路径和编码格式导致的问题；

可以参考这样的压缩格式：
ffmpeg -i D:/TempDemo/AC99.mp4 -vcodec libx264 -filter_complex subtitles='C\:/ProgramData/AC99.ass:charenc=UTF-8' -y D:/TempDemo/Test_Video/UP_video/AC99_H264.mp4

上面省略了部分压制参数；

Windows下使用的是反斜杠"\"。

Linux下使用的是正斜杠"/"

另外注意ASS路径在单引号之间，而且盘符的冒号前面要加转移符\，非UTF-8编号，通过charenc=UTF-8强制转换为UTF-8格式；