GithubHelp home page GithubHelp logo

gianlucatruda / auct Goto Github PK

View Code? Open in Web Editor NEW

This project forked from fergusdixon/auct

0.0 3.0 0.0 140.07 MB

UCT 3rd year capstone project providing an audio capture, segmentation and labelling tool

Home Page: http://bit.ly/auctPaper

License: MIT License

Java 4.61% Shell 0.27% CSS 65.08% HTML 7.96% JavaScript 22.07%

auct's Introduction

AuCT

(Audio Capture Tool)

Read the entire whitepaper.

South Africa has eleven official languages. Building out tools to support small-scale voice recognition across a large number of languages requires having a large amount of labeled audio files for training and testing. Your task is to create a web-based, or combination web- and mobile-based tool to support the collection and in situ segmentation and labeling of audio data. The use case is: a participant would use your tool to record a set of words from a list. These would then be segmented, with some control to throw out mistakes that may have been made during data capture, resulting in a set of individual, labeled audio files.

Core features

  • Input/select the word list that user will read aloud
  • Record audio while user reads the list
  • Fragment audio per word
  • Editor to sample audio files and label which word they correspond to
  • Some way to save the fragmented audio files (database, zip download, etc)
  • Audio playback
  • UNICODE text support

Optional features

  • “Whatever you want”
  • Can be Algorithmic or User-based splicing (or a combination)
    • ALGO : Record big chunks and process
    • USER : Capture button presses for words and mistakes WHILST recording audio
  • Adjust playback speed, scrubbing, skipping
  • Optional downloading at lower audio quality (bitrate change)
  • Customise UI for other languages
  • Choose language when first visiting site (on landing page)
  • Support different unit sizes (words, phrases, sentences)
  • Trim silent ends of words to remove all silence from a clip

Usage

Backend

Gradle handles the build, so make sure you have Gradle installed sudo apt-get install gradle

The wrapper is preconfigured, so to build & run from the AuctJavaServer root dir:

  • ./gradlew build
  • ./gradlew run

This will:

  • Check the Firebase DB for any files not yet marked as spliced
  • Download each file, split it per word, and upload back to the Cloud Storage bucket under "Output"
  • After each file has been uploaded, it adds a record of each segment in the DB/segments directory under the original file's name
  • The backend keeps no local files after termination

auct's People

Contributors

brigittesprung avatar fergusdixon avatar gianlucatruda avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.