GithubHelp home page GithubHelp logo

yuan-manx / audiotsm Goto Github PK

View Code? Open in Web Editor NEW

This project forked from muges/audiotsm

1.0 1.0 0.0 90.92 MB

A python library for real-time audio time-scale modification procedures

Home Page: https://audiotsm.readthedocs.io/

License: MIT License

Python 100.00%

audiotsm's Introduction

A real-time audio time-scale modification library

Documentation Status

Build Status

AudioTSM is a python library for real-time audio time-scale modification procedures, i.e. algorithms that change the speed of an audio signal without changing its pitch.

Documentation:

https://audiotsm.readthedocs.io/

Examples:

https://muges.github.io/audiotsm/

Source code repository and issue tracker:

https://github.com/Muges/audiotsm/

Python Package Index:

https://pypi.python.org/pypi/audiotsm/

License:

MIT -- see the file LICENSE for details.

Installation

Audiotsm should work with python 2.7 and python 3.4+.

You can install the latest version of audiotsm with pip:

pip install audiotsm

If you want to use the gstreamer plugins, you should install PyGObject and python-gst, and use the following command to install audiotsm:

pip install audiotsm[gstreamer]

If you want to play the output of the TSM procedures in real time, or to use the examples, you should install audiotsm as follow:

pip install audiotsm[stream]

Basic usage

The audiotsm package implements several time-scale modification procedures:

  • OLA (Overlap-Add);
  • WSOLA (Waveform Similarity-based Overlap-Add);
  • Phase Vocoder.

The OLA procedure should only be used on percussive audio signals. The WSOLA and the Phase Vocoder procedures are improvements of the OLA procedure, and should both give good results in most cases.

If you are unsure which procedure to choose, the Phase Vocoder should sound best in most cases. You can listen to the output of the different procedures on various audio files and at various speeds on the examples page.

Python API

Below is a basic example showing how to reduce the speed of a wav file by half using the phase vocoder procedure:

from audiotsm import phasevocoder
from audiotsm.io.wav import WavReader, WavWriter

with WavReader(input_filename) as reader:
    with WavWriter(output_filename, reader.channels, reader.samplerate) as writer:
        tsm = phasevocoder(reader.channels, speed=0.5)
        tsm.run(reader, writer)

A complete example can be found in the examples/audiotsmcli.py file. Read the documentation for more details.

GStreamer plugins

The TSM procedures are also available as GStreamer plugins. A simple example implementing a basic GStreamer pipeline can be found in the examples/audiotsmcli_gst.py file, and a more complete one showing how to use the plugins in a GTK audio player can be found in the examples/audiotsmgtk.py file. Read the documentation for more details.

Thanks

If you are interested in time-scale modification procedures, I highly recommend reading A Review of Time-Scale Modification of Music Signals by Jonathan Driedger and Meinard Müller.

audiotsm's People

Contributors

muges avatar

Stargazers

Yuan-Man avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.