This is a simple yet useful app that helps you manually correct automatic transcriptions made by Whisper. The app allows you to play audio files and edit their corresponding transcriptions.
- ๐ Play audio files directly from the browser.
- โ๏ธ Edit and save transcriptions.
- ๐ Automatically saves the old version of the transcription with
_oldversion.txt
.
- ๐ Python 3.x
- Clone this repository to your local machine.
- Place your audio dataset in the
audios
folder. For each.wav
file, the transcription MUST have the same name but with a.txt
extension.
-
Run the Installation Script
This script will create a virtual environment, install the necessary dependencies, and run the Flask application.
./install.sh
-
Run the Application
Use the following command to run the application if it's not already running from the installation script.
./run.sh
-
Access the Application
Open your web browser and go to
http://127.0.0.1:5000/
.
Your project directory should look like this:
project/
โโโ app.py
โโโ requirements.txt
โโโ install.sh
โโโ run.sh
โโโ README.md
โโโ static/
โ โโโ play.png # (Optional: sound icon if needed)
โโโ templates/
โ โโโ index.html
โโโ audios/ # Directory containing subfolders with .wav and .txt files
- Place your
.wav
audio files and their corresponding.txt
transcription files in theaudios
directory. Ensure that each.wav
file has a corresponding.txt
file with the same name. - Run the application using the installation script.
- Use the web interface to play audio files and manually correct transcriptions.
- Press the "Next" button to save the edited transcription and proceed to the next file. The old version of the transcription will be saved with
_oldversion.txt
.
If you find this tool useful in your research or work, please consider citing or acknowledging it as follows:
Guerrero-Lรณpez, Alejandro. 2024. Manual Transcription Editor: A tool for manually correcting automatic transcriptions. Available at https://github.com/aguerrerolopez/TranscriptionEditor.
Thanks to GPT for programming almost all of this project. Am I now a prompt engineer? ๐
This project is licensed under the MIT License.