GithubHelp home page GithubHelp logo

atonalfreerider / audioalign Goto Github PK

View Code? Open in Web Editor NEW

This project forked from protyposis/audioalign

0.0 0.0 0.0 657 KB

Audio Synchronization and Analysis Tool

License: GNU Affero General Public License v3.0

JavaScript 0.08% C# 99.92%

audioalign's Introduction

AudioAlign: Audio Synchronization And Analysis Tool

AudioAlign is a research tool to investigate automatic synchronization of audio and video recordings that have either been recorded in parallel at the same event or contain the same aural information. It is designed as a GUI for the Aurio library.

To see what AudioAlign can do, check the demo videos linked below.

Screenshot of the GUI

Use cases

AudioAlign has been developed for a research project with the goal to automatically synchronize audio and video recordings, recorded at the same time at the same event, e.g. a speech or a music concert. The idea was to synchronize all those videos taken from the crowd and combine them to

  • generate multicamera cuts by switching between perspectives (video),
  • create videos with full or at least better event coverage,
  • replace bad quality audio or video tracks with better ones, or
  • detect interesting moments (where many recordings have been captured at the same time).

AudioAlign can be used for a lot more uses cases though, including

  • video mashups (video)
  • comparison of live performances (video)
  • synchronization of different cover interpretations (video)
  • voice dubbing (video)
  • ground truth creation (website)
  • evaluation of fingerprinting algorithms
  • ...

What's new

See CHANGELOG.

Development Requirements

  • Visual Studio 2022
  • .NET SDK 6.0

Documentation

Controls & Shortcuts

  • Audio and video files can be drag & dropped into the timeline
    • Multiple files will be added as multiple tracks
    • Hold SHIFT to add the files as a single concatenated track
  • Navigating the multitrack view
    • Click anywhere or drag the caret in the time scale to set the current (playback) position
    • Press SPACE to start/pause playback
    • Vertically drag the bottom of a track to resize its height
    • Scroll the mouse wheel to scale the time resolution (zoom into/out of the timeline) at the current position
      • Hold CTRL to smoothly scroll the timeline
      • Hold CTRL + SHIFT to page through the timeline
    • Select a track by clicking its title bar
      • Hold CTRL to select multiple tracks
      • Hold SHIFT to select a range of tracks
    • Move tracks on the timeline by dragging them horizontally

Publications

Mario Guggenberger, Mathias Lux, and Laszlo Böszörmenyi. 2012. AudioAlign – Synchronization of A/V-Streams Based on Audio Data. 2012 IEEE International Symposium on Multimedia. Irvine, CA, USA, 2012, pp. 382-383. DOI=http://dx.doi.org/10.1109/ISM.2012.79

Mario Guggenberger. 2015. Aurio: Audio Processing, Analysis and Retrieval. In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA, 705-708. DOI=http://dx.doi.org/10.1145/2733373.2807408

Support

For questions and issues, please open an issue on the issue tracker. Commercial support, development and consultation is available through Protyposis Multimedia Solutions.

License

Copyright (C) 2010-2023 Mario Guggenberger [email protected]. This project is released under the terms of the GNU Affero General Public License. See LICENSE for details.

audioalign's People

Contributors

protyposis avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.