GithubHelp home page GithubHelp logo

byigitt / transcriptor Goto Github PK

View Code? Open in Web Editor NEW
5.0 1.0 0.0 1.76 MB

create transcripts with youtube links on google colab using whisper ai

Python 91.89% Shell 8.11%
python python3 transcript transcription whisper whisper-ai

transcriptor's Introduction

Transcriptor

This python project lets you create multiple transcripts with youtube links on Google Colab with Whisper AI.

Questions

Why Google Colab?

I used Google Colab because of my internet speed & free GPU usage. It works flawlessly in Turkish language (which I used this to get the transcripts on Google Oyun ve Uygulama Akademisi education videos).

Features

  • Downloads multiple links on the youtube_urls.txt file.
  • Creates transcripts for every mp3 file it downloaded.
  • Automatically deletes mp3 files after creating its transcript.
  • Uses youtube-dl as in nightly mode to remove some bugs from the new latest version.

Dependencies

  • Python 3.x (which Google Colab has)
  • whisper
  • torch

Executing the Program

  • Firstly, create yourself a google colab and change runtime type to make it as a GPU.
  • After that, create a youtube_urls.txt - You can find the example in our repo.
  • After creating youtube_urls.txt, use !git clone https://github.com/byigitt/transcriptor.git to get the source files.
  • You need to install our dependencies main.sh - In order to get it working, you need to do !chmod 755 main.sh and do !./main.sh in order to install everything.
  • After installing everything, program will open itself and do its job, you can download the .txt files and you are good to go!
  • Do not forget to open the tab while it does it job, otherwise your files in colab will be deleted!

For Issues and Questions

Dont feel shy to ask your questions/problems in issues tab! You can also contribute the code in Pull requests tab.

transcriptor's People

Contributors

byigitt avatar

Stargazers

 avatar  avatar Efe Çelik avatar Mikael KARA avatar ali avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.