GithubHelp home page GithubHelp logo

aguerrerolopez / transcriptioneditor Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 36 KB

Manual Transcription Editor: A tool for manually correcting automatic transcriptions

License: MIT License

Python 39.38% Shell 9.41% HTML 44.74% Batchfile 6.47%

transcriptioneditor's Introduction

๐Ÿ“œ Manual Transcription Editor

This is a simple yet useful app that helps you manually correct automatic transcriptions made by Whisper. The app allows you to play audio files and edit their corresponding transcriptions.

โœจ Features

  • ๐Ÿ”Š Play audio files directly from the browser.
  • โœ๏ธ Edit and save transcriptions.
  • ๐Ÿ“ Automatically saves the old version of the transcription with _oldversion.txt.

๐Ÿ›  Prerequisites

  • ๐Ÿ Python 3.x

๐Ÿ’พ Installation

  1. Clone this repository to your local machine.
  2. Place your audio dataset in the audios folder. For each .wav file, the transcription MUST have the same name but with a .txt extension.

Steps

  1. Run the Installation Script

    This script will create a virtual environment, install the necessary dependencies, and run the Flask application.

    ./install.sh
  2. Run the Application

    Use the following command to run the application if it's not already running from the installation script.

    ./run.sh
  3. Access the Application

    Open your web browser and go to http://127.0.0.1:5000/.

๐Ÿ“‚ Directory Structure

Your project directory should look like this:

project/
โ”œโ”€โ”€ app.py
โ”œโ”€โ”€ requirements.txt
โ”œโ”€โ”€ install.sh
โ”œโ”€โ”€ run.sh
โ”œโ”€โ”€ README.md
โ”œโ”€โ”€ static/
โ”‚ โ””โ”€โ”€ play.png # (Optional: sound icon if needed)
โ”œโ”€โ”€ templates/
โ”‚ โ””โ”€โ”€ index.html
โ”œโ”€โ”€ audios/ # Directory containing subfolders with .wav and .txt files

๐Ÿš€ Usage

  1. Place your .wav audio files and their corresponding .txt transcription files in the audios directory. Ensure that each .wav file has a corresponding .txt file with the same name.
  2. Run the application using the installation script.
  3. Use the web interface to play audio files and manually correct transcriptions.
  4. Press the "Next" button to save the edited transcription and proceed to the next file. The old version of the transcription will be saved with _oldversion.txt.

๐Ÿ™ How to cite or acknowledge

If you find this tool useful in your research or work, please consider citing or acknowledging it as follows:

Guerrero-Lรณpez, Alejandro. 2024. Manual Transcription Editor: A tool for manually correcting automatic transcriptions. Available at https://github.com/aguerrerolopez/TranscriptionEditor.

๐Ÿ™ Acknowledgements

Thanks to GPT for programming almost all of this project. Am I now a prompt engineer? ๐Ÿ˜‰

๐Ÿ“œ License

This project is licensed under the MIT License.

transcriptioneditor's People

Contributors

aguerrerolopez avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.