GithubHelp home page GithubHelp logo

noco-ai / spellbook-docker Goto Github PK

View Code? Open in Web Editor NEW
107.0 2.0 5.0 2.44 MB

AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models

Home Page: https://github.com/noco-ai/spellbook-docker/wiki

License: Open Software License 3.0

Dockerfile 37.03% Shell 62.24% TypeScript 0.73%
llama2 llm-inference mixtral stable-diffusion automatic-speech-recognition text-to-speech bark musicgeneration whisper xttsv2

spellbook-docker's People

Contributors

noco-ai avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

spellbook-docker's Issues

containers stop with errors

Windows 11 - Docker Desktop
WSL2 backend - Ubuntu 22.04 LTS
1 minor issue resolved
multiple other failures

Minor:
\spellbook-docker> docker compose up
[+] Running 1/0
โœ˜ Network spellbook-docker_spellbook-net Error 0.0s
failed to create network spellbook-docker_spellbook-net: Error response from daemon: invalid network config:
invalid subnet 10.23.82.0/16: it should be 10.23.0.0/16

*Resolved by editing docker-compose.yml and correcting the subnet address

Failures:

spellbook_rabbitmq | /usr/local/bin/docker-entrypoint.sh: /amqp-init.sh: /bin/bash^M: bad interpreter: No such file or directory
spellbook_rabbitmq | /usr/local/bin/docker-entrypoint.sh: line 50: /amqp-init.sh: Success
spellbook_rabbitmq exited with code 127

spellbook_mariadb | /usr/local/bin/docker-entrypoint.sh: /docker-entrypoint-initdb.d/1-db-init.sh: /bin/bash^M: bad interpreter: No such file or directory
spellbook_mariadb exited with code 126

spellbook_arcane_bridge | Starting child process with 'node dist/arcane.bridge.js -vh http://10.23.82.2:8200 -vt /vault_share/write-token -cs'
spellbook_arcane_bridge | failed to start vault
spellbook_arcane_bridge | ๐Ÿ’ฃ service class VaultService did not start correctly, exiting program ๐Ÿ’ฃ
spellbook_arcane_bridge | Program node dist/arcane.bridge.js -vh http://10.23.82.2:8200 -vt /vault_share/write-token -cs exited with code 0

the only log entry in spellbook-docker-vault is:
exec /usr/local/bin/unseal.sh: no such file or directory

The build log for Vault was completed normally

I may try running this directly in WSL2 Ubuntu 22.04 and see if that makes a difference as I usually run all my docker stuff on other Ubuntu VMs but my Windows machine is where my GPUs are.

Nice.

Really cool project.

I'm working on something similar (structurally at least), a manga-to-anime pipeline.
It involves a lot of different steps/models, similar to this project:

  • Pre processing (alignment, upscaling, coloring).
  • Separating pages into panels.
  • Ordering the panels in the right reading order (took so much more effort than I thought...)
  • Segmentation (using segment-anything)
  • Extracting bubbles, the tails of bubbles/their vector, faces, bodies, backgrounds. Most of that necessitated training custom models.
  • Assigning a character identity to each face/body.
  • Making a naive association between faces and bubbles.
  • Reading the text of bubbles.
  • I feed all that data to GPT4-V, and ask it to "read" each panel, telling it what happened in previous panels, what bubble is associated with what face, etc, asking it to "understand" what is happening in the panel, and to "deduce" some associations between the items, the tone of voice, etc. I tried "just" asking GPT4-V to read manga pages without all the steps above, and it was terrible at it. But with all the provided info (which causes easily 10k-token prompts, just for the text), it gets much better at it. It's sort of "pre-chewing" the work for him.
  • That's where Iย am at now, the next step is going to be generating voice (what I'm working on now, bark/whisper/other models), sound effects, and then generating animation and special effects, and finally assembling all that into video.

I'll be looking closer into your project, in particular how it's organized, thanks a lot for sharing.
I'd be curious if you have any insights on how you'd do manga reading if you had to.

Cheers!

masked

panel

prompt.json

prompt.txt

reading.json

response.txt

result.json

6253

6254

Ollama support?

Hey, very exciting project, I feel like I wanted to build something like this for a while now.

I've recently started playing with local models using ollama (hosted by a small computer on my network).

Wondering if it is possible to connect this to ollama? (maybe I'm asking if it is possible to use this without the router that requires nvidia hardware)

Also, beyond installation instructions and high level design, is there a wiki where it is explained how individual parts work?

Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.