GithubHelp home page GithubHelp logo

chattymppromnica / generativeaiexamples Goto Github PK

View Code? Open in Web Editor NEW

This project forked from nvidia/generativeaiexamples

0.0 0.0 0.0 9.68 MB

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

License: Apache License 2.0

Shell 0.34% JavaScript 0.03% Python 53.21% Go 17.17% CSS 0.06% Makefile 2.74% HTML 0.96% Smarty 0.16% Jupyter Notebook 23.88% Dockerfile 0.66% Jinja 0.80%

generativeaiexamples's Introduction

NVIDIA Generative AI Examples

Introduction

State-of-the-art Generative AI examples that are easy to deploy, test, and extend. All examples run on the high performance NVIDIA CUDA-X software stack and NVIDIA GPUs.

NVIDIA NGC

Generative AI Examples uses resources from the NVIDIA NGC AI Development Catalog.

Sign up for a free NGC developer account to access:

  • The GPU-optimized NVIDIA containers, models, scripts, and tools used in these examples
  • The latest NVIDIA upstream contributions to the respective programming frameworks
  • The latest NVIDIA Deep Learning and LLM software libraries
  • Release notes for each of the NVIDIA optimized containers
  • Links to developer documentation

Retrieval Augmented Generation (RAG)

A RAG pipeline embeds multimodal data -- such as documents, images, and video -- into a database connected to a Large Language Model. RAG lets users use an LLM to chat with their own data.

Name Description LLM Framework Multi-GPU Multi-node Embedding TRT-LLM Triton VectorDB K8s
Linux developer RAG Single VM, single GPU llama2-13b Langchain + Llama Index No No e5-large-v2 Yes Yes Milvus No
Windows developer RAG RAG on Windows llama2-13b Llama Index No No NA Yes No FAISS NA
Developer LLM Operator for Kubernetes Single node, single GPU llama2-13b Langchain + Llama Index No No e5-large-v2 Yes Yes Milvus Yes

Large Language Models

NVIDIA LLMs are optimized for building enterprise generative AI applications.

Name Description Type Context Length Example License
nemotron-3-8b-qa-4k Q&A LLM customized on knowledge bases Text Generation 4096 No NVIDIA AI Foundation Models Community License Agreement
nemotron-3-8b-chat-4k-steerlm Best out-of-the-box chat model with flexible alignment at inference Text Generation 4096 No NVIDIA AI Foundation Models Community License Agreement
nemotron-3-8b-chat-4k-rlhf Best out-of-the-box chat model performance Text Generation 4096 No NVIDIA AI Foundation Models Community License Agreement

Integration Examples

NVIDIA support

In each of the READMEs, we indicate the level of support provided.

Feedback / Contributions

We're posting these examples on GitHub to better support the community, facilitate feedback, as well as collect and implement contributions using GitHub Issues and pull requests. We welcome all contributions!

Known issues

  • In each of the READMEs, we indicate any known issues and encourage the community to provide feedback.
  • The datasets provided as part of this project is under a different license for research and evaluation purposes.
  • This project will download and install additional third-party open source software projects. Review the license terms of these open source projects before use.

generativeaiexamples's People

Contributors

shubhadeepd avatar sumitkbh avatar fciannella avatar dependabot[bot] avatar dharmendrach avatar jliberma avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.