dash subtitle extractor

Ttranslated from shaka-player project by xhlove.

output sample

python移植版

单文件需要先使用mp4split.exe分割

usage

pip install argparse

usage: python -m pyshaka.main [OPTION]...

A tool that to parse subtitle embedded in DASH stream

optional arguments:
  -h, --help            show this help message and exit
  -debug, --debug       debug is needed
  -type TYPE, --type TYPE
                        subtitle codec, only support wvtt and ttml now
  -timescale TIMESCALE, --timescale TIMESCALE
                        set timescale manually if no init segment
  -init-path INIT_PATH, --init-path INIT_PATH
                        init segment path
  -segments-path SEGMENTS_PATH, --segments-path SEGMENTS_PATH
                        segments folder path
  -segment-time SEGMENT_TIME, --segment-time SEGMENT_TIME
                        single segment duration, usually needed for ttml content, calculation method: d / timescale

e.g.

python -m pyshaka.main --init-path "test/dashvtt_subtitle_WVTT_zh-TW/init.mp4" --segments-path "test/dashvtt_subtitle_WVTT_zh-TW" --type wvtt
python -m pyshaka.main --segments-path "test/ismttml_text_TTML_pol" --segment-time 60 --type ttml
python -m pyshaka.main --segments-path "test/new" --type ttml

python移植版本只是完成了部分工作，早期是通过移植原版到node执行，如果你有兴趣知道如何移植为node本地执行，请点击下面按钮展开

node本地移植版

参考移植shaka-player字幕解析部分为本地程序

目前完成了demo，适配ing

如果自行修改了parser.js，那么记得重新编译下，编译前的配置参考上面的移植文档

npx google-closure-compiler --js parser.js --js shaka/**/*.js --js=node_modules/xmldom/**/*.js --js=node_modules/google-closure-library/**/*.js --js=!**/goog/asserts/asserts.js --dependency_mode=PRUNE --entry_point=goog:parser --js_output_file=parser_compiled.js

使用命令

node parser_compiled.js --init-segment=test/dashvtt_subtitle_WVTT_zh-TW/init.mp4 --segments-path=test/dashvtt_subtitle_WVTT_zh-TW --type=wvtt

node parser_compiled.js --init-segment=test/ttml_test/000.mp4 --segments-path=test/ttml_test --type=ttml

node parser_compiled.js --segments-path=test/ismttml_text_TTML_pol --type=ttml

路径参数请不要使用反斜杠
参数后面必须跟=
--init-segment 是init文件的路径对于TTML该选项不是必要的
--segments-path 是分段文件所在的路径
--type 指定字幕类型 wvtt ttml 二选一
--debug 可以输出一些debug信息

如果要用测试命令记得解压dashvtt_subtitle_WVTT_zh-TW.zip和ttml_test.zip

yuiiuy / dash-subtitle-extractor Goto Github PK

dash-subtitle-extractor's Introduction

dash subtitle extractor

python移植版

usage

node本地移植版

dash-subtitle-extractor's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs