GithubHelp home page GithubHelp logo

o7q / scribe Goto Github PK

View Code? Open in Web Editor NEW
8.0 8.0 2.0 510 KB

A compact (offline) GUI media transcriber that enables you to search for local content based on its spoken words.

Home Page: https://o7q.github.io/Scribe

License: MIT License

C# 99.72% Batchfile 0.28%
ffmpeg openai-whisper speech-to-text stable-ts transcriber whisper

scribe's Introduction

James

Profile Views Total Stars


  • ๐Ÿงผ I'm mainly working on MediaDownloader
  • ๐Ÿค– The coding languages I'm currently interested in are C++, C#, and Java
  • ๐ŸŽจ The programs I'm currently interested in are Blender, Unreal Engine, and Houdini
  • ๐Ÿ“ž You can reach me through Discord: o7q

ยท โ—ก ยท ๐Ÿ‘‰ https://o7q.github.io/o7q


Discord Presence
GitHub Stats Top Languages

trophy

scribe's People

Contributors

o7q avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Forkers

achyun 5l1v3r1

scribe's Issues

README if you are having issues with the latest version

There is currently a bug with numpy that causes whisper to not work.

To fix this:

  • Navigate to Scribe\engine\base\Scripts and open a command window in this location.
  • Type activate
  • Type pip uninstall numpy
  • Type pip install "numpy<2.0"

That's it!

the system cannot find the path specified

would be brilliant, thx

yet, I get: "the system cannot find the path specified" when writing the txt files, txt files do not get written?
x.sctemp files get written in the storage directory, yet they are empty AFAIK
Windows with python according to specs
this seems to be a ffmpeg issue (cf. https://stackoverflow.com/questions/75644012/the-system-cannot-find-the-file-specified-error-when-trying-to-execute-ffmpeg-co)
I tried running the file with admin rights too/on drive c ...

ffmpeg version 6.1-essentials_build-www.gyan.dev Copyright (c) 2000-2023 the FFmpeg developers
built with gcc 12.2.0 (Rev10, Built by MSYS2 project)
configuration: --enable-gpl --enable-version3 --enable-static --pkg-config=pkgconf --disable-w32threads --disable-autodetect --enable-fontconfig --enable-iconv --enable-gnutls --enable-libxml2 --enable-gmp --enable-bzlib --enable-lzma --enable-zlib --enable-libsrt --enable-libssh --enable-libzmq --enable-avisynth --enable-sdl2 --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxvid --enable-libaom --enable-libopenjpeg --enable-libvpx --enable-mediafoundation --enable-libass --enable-libfreetype --enable-libfribidi --enable-libharfbuzz --enable-libvidstab --enable-libvmaf --enable-libzimg --enable-amf --enable-cuda-llvm --enable-cuvid --enable-ffnvcodec --enable-nvdec --enable-nvenc --enable-dxva2 --enable-d3d11va --enable-libvpl --enable-libgme --enable-libopenmpt --enable-libopencore-amrwb --enable-libmp3lame --enable-libtheora --enable-libvo-amrwbenc --enable-libgsm --enable-libopencore-amrnb --enable-libopus --enable-libspeex --enable-libvorbis --enable-librubberband
libavutil 58. 29.100 / 58. 29.100
libavcodec 60. 31.102 / 60. 31.102
libavformat 60. 16.100 / 60. 16.100
libavdevice 60. 3.100 / 60. 3.100
libavfilter 9. 12.100 / 9. 12.100
libswscale 7. 5.100 / 7. 5.100
libswresample 4. 12.100 / 4. 12.100
libpostproc 57. 3.100 / 57. 3.100
Input #0, mov,mp4,m4a,3gp,3g2,mj2, from 'G:\My Drive\ABSA_Tweets-master\01HAPNHB8BPAXHQ1KM28DYR3SD-1592574611\bennett#TSP048 - Tom Bennett\1.m4a':
Metadata:
major_brand : isom
minor_version : 512
compatible_brands: isomiso2mp41
title : #TSP048 - Tom Bennett
artist : The Two Shot Podcast
date : 2018
encoder : Lavf58.39.101
comment : https://www.youtube.com/watch?v=qkk0CFuc3Ew
Duration: 00:48:02.99, start: 0.000000, bitrate: 126 kb/s
Stream #0:00x1: Audio: aac (LC) (mp4a / 0x6134706D), 44100 Hz, stereo, fltp, 125 kb/s (default)
Metadata:
handler_name : SoundHandler
vendor_id : [0][0][0][0]
[out#0/mp3 @ 000001f772f600c0] No explicit maps, mapping streams automatically...
[aost#0:0/libmp3lame @ 000001f7714c0300] Created audio stream from input stream 0:0
Stream mapping:
Stream #0:0 -> #0:0 (aac (native) -> mp3 (libmp3lame))
Press [q] to stop, [?] for help
[graph_0_in_0_0 @ 000001f771531540] tb:1/44100 samplefmt:fltp samplerate:44100 chlayout:stereo
Output #0, mp3, to 'Scribe\storage\1.m4a.mp3':
Metadata:
major_brand : isom
minor_version : 512
compatible_brands: isomiso2mp41
TIT2 : #TSP048 - Tom Bennett
TPE1 : The Two Shot Podcast
TDRC : 2018
comment : https://www.youtube.com/watch?v=qkk0CFuc3Ew
TSSE : Lavf60.16.100
Stream #0:0(und): Audio: mp3, 44100 Hz, stereo, fltp, delay 1105, 128 kb/s (default)
Metadata:
handler_name : SoundHandler
vendor_id : [0][0][0][0]
encoder : Lavc60.31.102 libmp3lame
[in#0/mov,mp4,m4a,3gp,3g2,mj2 @ 000001f7714bb2c0] EOF while reading input
[in#0/mov,mp4,m4a,3gp,3g2,mj2 @ 000001f7714bb2c0] Terminating demuxer thread
[aist#0:0/aac @ 000001f772fe6540] Decoder thread received EOF packet
[aist#0:0/aac @ 000001f772fe6540] Decoder returned EOF, finishing
[aist#0:0/aac @ 000001f772fe6540] Terminating decoder thread
No more output streams to write to, finishing.
[out#0/mp3 @ 000001f772f600c0] All streams finished
[out#0/mp3 @ 000001f772f600c0] Terminating muxer thread
[AVIOContext @ 000001f771525ec0] Statistics: 46129605 bytes written, 2 seeks, 177 writeouts
[out#0/mp3 @ 000001f772f600c0] Output file #0 (Scribe\storage\1.m4a.mp3):
[out#0/mp3 @ 000001f772f600c0] Output stream #0:0 (audio): 110365 frames encoded (127139840 samples); 110366 packets muxed (46128483 bytes);
[out#0/mp3 @ 000001f772f600c0] Total: 110366 packets (46128483 bytes) muxed
[out#0/mp3 @ 000001f772f600c0] video:0kB audio:45047kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.001528%
size= 45048kB time=00:48:02.97 bitrate= 128.0kbits/s speed=70.5x
[in#0/mov,mp4,m4a,3gp,3g2,mj2 @ 000001f7714bb2c0] Input file #0 (G:\My Drive\ABSA_Tweets-master\01HAPNHB8BPAXHQ1KM28DYR3SD-1592574611\bennett#TSP048 - Tom Bennett\1.m4a):
[in#0/mov,mp4,m4a,3gp,3g2,mj2 @ 000001f7714bb2c0] Input stream #0:0 (audio): 124160 packets read (45258713 bytes); 124160 frames decoded; 0 decode errors (127139840 samples);
[in#0/mov,mp4,m4a,3gp,3g2,mj2 @ 000001f7714bb2c0] Total: 124160 packets (45258713 bytes) demuxed
[AVIOContext @ 000001f7714c4f80] Statistics: 45816060 bytes read, 2 seeks
The system cannot find the path specified.
Could Not Find C:\Users\gnmarten\Downloads\Scribe\storage\1.m4a.txt
Press any key to continue . . .

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.