GithubHelp home page GithubHelp logo

lucab85 / avg Goto Github PK

View Code? Open in Web Editor NEW
4.0 4.0 4.0 49 KB

Source of Docker image lucab85/avg https://quay.io/lucab85/avg

Home Page: https://hub.docker.com/r/lucab85/avg

License: MIT License

Dockerfile 44.35% R 41.59% Shell 14.06%

avg's People

Contributors

jmernin avatar lucab85 avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

avg's Issues

Add language support

In addition to being able to specify different voices, perhaps we could allow different languages to be specified as well. I suggest this because I saw that Amazon Polly supports both voices and languages, which are technically different things.

That way, it could be possible to produce multiple different videos of the same presentation, in different languages but using an accent that is suited to each language.

Clarify which AWS service is doing the text-to-speech conversion

Provide some additional detailed in the README about how the text-to-speech conversion is being done, so people can better understand the other values they could provide in the default.env file, for example.

In fact, perhaps we could add comments into default.env, pointing people to the relevant AWS page to see the available values for each variable.

Create empty "shared" folder for quicker setup

How about creating an empty "share" folder in the AVG repo so that the step to create this (during the setup phase) is not actually needed?

For sure, people can select/use a different folder later on but for most use cases (and in the interest in increased adoption), maybe this would help?

Custom font support

If your slide deck used custom fonts (e.g. Red Hat Display), which are not available inside the docker container, then the resulting video may contain incorrectly formatted slides. It may be worth exploring if there's any way to add the .ttf files for the fonts in question to the ARI ecosystem.

Guidance on how long rendering takes

Add some details to the README to help people understand how long it takes to render each video. Clearly, it depends on the length of the audio files produced from the speaker notes, but is it exactly 100% of their combined length or somewhere in between?

Support for Apple M1 Silicon chip architecture

I recently switched to a new MacBook Pro powered by the Apple M1 Silicon chipset and I ran into some problems while trying to set up the AVG environment there. Specifically, I got this warning when I ran the docker run command:

$ docker run --name="avg" -dit --mount type=bind,source=$(pwd)/share,target=/share --env-file=default.env  lucab85/avg
WARNING: The requested image's platform (linux/amd64) does not match the detected host platform (linux/arm64/v8) and no specific platform was requested

Could it be that we need to build an alternative version of the the base image, that suits the M1 chipset?

Ability to control voice playback speed

A few people have commented on recent demos (that I've shared) saying that the voice we've used speaks too quickly. Personally, the voice in question (Joanna) is probably the clearest one I've heard but I'll admit that it could indeed be a little slower when talking.

It would be good if Amazon Polly allowed you to control the pitch (or speed) of the voice.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.