The homepage shows only 10secs for all the audios, i want to know if the audio length

Question: can we generate 1sec/ms length audio? about tango HOT 4 CLOSED

declare-lab commented on May 21, 2024

Question: can we generate 1sec/ms length audio?

from tango.

Comments (4)

deepanwayx commented on May 21, 2024

Do you mean trimming the audio to a desired length after generating a 10-second long sample? This is easily doable by truncating the generated wave in tango.py:

def generate(self, prompt, steps=100, guidance=3, samples=1, disable_progress=True, desired_length_in_seconds=10):
  """ Genrate audio for a single prompt string. """
  with torch.no_grad():
      latents = self.model.inference([prompt], self.scheduler, steps, guidance, samples, disable_progress=disable_progress)
      mel = self.vae.decode_first_stage(latents)
      wave = self.vae.decode_to_waveform(mel)
      # Sampling rate is 16 KHz
      wave = wave[:, desired_length_in_seconds * 16000]
  return wave[0]

However, constraining the generated audio such that the events described in the text appear within the first n seconds is not straightforward to control. The nature of the training dataset results in the generated audio having the events described in the text prompt being spread over the entire 10 seconds duration.

from tango.

wassimbj commented on May 21, 2024

yes, I meant to get the events within n seconds. do you mean if I trained it on a short-length audio files, I get short results too? what length should the dataset be in ur opinion? and what do you think should be done to control the length of the audio?

from tango.

deepanwayx commented on May 21, 2024

You need to train on shorter audio samples to achieve the control. The duration variable in train.py specifies the length of the audio in seconds. It is set to 10 which you can reduce to a smaller number and train with appropriate short audio samples.

from tango.

wassimbj commented on May 21, 2024

Thanks 😁

from tango.

Recommend Projects

Question: can we generate 1sec/ms length audio? about tango HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs