GithubHelp home page GithubHelp logo

premai-io / state-of-open-source-ai Goto Github PK

View Code? Open in Web Editor NEW
1.4K 22.0 83.0 745 KB

:closed_book: Clarity in the current fast-paced mess of Open Source innovation

Home Page: https://book.premai.io/state-of-open-source-ai

License: Other

TeX 81.56% HTML 2.29% Python 9.79% JavaScript 2.22% CSS 4.14%
book jupyter-book ai ml mlops open-source hacktoberfest

state-of-open-source-ai's People

Contributors

anindyadeep avatar biswaroop1547 avatar casperdcl avatar devsjr avatar estherbester avatar filopedraz avatar flaxsearch avatar janaka-steph avatar johnsbeharry avatar mudler avatar nsosio avatar rafaelbod avatar richiejp avatar skandavivek avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

state-of-open-source-ai's Issues

Decentralized AI

Type

new chapter

Chapter/Page

Something else

Description

No response

email: `Enter` to submit

Follow-up to #42

  • should we have :valid and :invalid CSS?
  • also can hitting the Enter key work (for keyboard warriors who don't like clicking buttons)?

Have had reports of people giving up on accessing the book because they didn't realise they currently have to use the mouse to click Submit

Move RAG to different as a totally different chapter.

Type

new chapter

Chapter/Page

Something else

Description

Right now RAG is written under Fine-tuning section. However, I am not very much sure, whether that would be the correct place. Since, RAG does not change any parameters of the model or adds new parameters.

So, it would be much better to add it under different section or a dedicated chapter or add it to the VectorDatabase chapter. The concept of RAG is actually not very new. FAISS was used readily before the surge if LLMs (mostly in image search or semantic search).

In the RAG chapter we can populate by these contents:

  1. What is RAG (already covered).
  2. How RAG works on a high level.
  3. RAG in LLMs.
  4. RAG/vector dbs in computer vision.

Resume Email Wall, but include verification

Type

other (e.g. typos, factual errors, etc.)

Chapter/Page

Something else

Description

Description

We need to go back to the email wall, but we need to make sure that email provided are valid.

Fine-tuning tools and frameworks

Type

other (e.g. typos, factual errors, etc.)

Chapter/Page

fine-tuning

Description

The fine-tuning chapter should include all the tools and frameworks in order to do fine-tuning.

Different evaluation frameworks for LLMs

Type

new chapter

Chapter/Page

eval-datasets

Description

The evaluation page is really good, however, it would be awesome if we could add some information on the following evaluation frameworks.

  1. HELM by Stanford.
  2. LM Evaluation Harness by Eluther AI.
  3. Code Evaluation Harness by BigCode.

The content should be mainly regarding how they are trying to do evaluation and how to get started with each.

v1 framework

Follow-up to #11

v1.1

chapter: model-formats

  • create model-formats.md (see e.g. 70892bd for adding a chapter)

Use markdown-MyST syntax (e.g. source ➡️ rendered)

Potential subheadings:

  • GGML
  • ONNX
  • TVM

Please also report any jupyter-book build framework-related problems/questions in #12 :)

Update logo and guidelines

Type

other (e.g. typos, factual errors, etc.)

Chapter/Page

Something else

Description

Use new Prem logo and guidelines.

Adding Parameter efficient Fine-tuning techniques for LLM

Type

new chapter

Chapter/Page

fine-tuning

Description

The contents on this chapter should be the following:

  1. What is Parameter efficient fine-tuning methods and why we do it.
  2. Different PEFT methods:
    a. Prompt Tuning
    b. LoRA
    c. QLoRA
    etc

WebGPU

Type

new chapter

Chapter/Page

Something else

Description

No response

Include quantization techniques

Type

new chapter

Chapter/Page

Something else

Description

I suggest that we have a separate chapter on Quantization. The chapter should include all the new methods created to handle quantization and the most famous libraries associated with them.

  • GPTQ
  • AWQ

Audiobook format (or an text-to-speech friendly format)

Type

other (e.g. typos, factual errors, etc.)

Chapter/Page

Something else

Description

For many people the most effective way to read a book is in audiobook format.
I would like to ask to have some text-to-speech transcription available, or, considering the involved costs, a text-to-speech friendly artifact available, so I can execute the TTS by myself.
I volunteer to provide the generated "audiobook" to be shared among this book audience.

Model Parallelism

Type

new chapter

Chapter/Page

Something else

Description

No response

Inference Optimization Chapter

Type

new chapter

Chapter/Page

Something else

Description

Doing training or inference models are fairly easy, when we have smaller number of parameters. But when the scale of parameters and data increases, it becomes increasingly difficult to optimize that (interms of compute and performance). So a dedicated chapter on inference optimization and arithmetics on resource calculations becomes very useful.

Some reference links:

Mixture of Experts (MoE)

Type

new chapter

Chapter/Page

models

Description

Although MoE has been there in general Deep learning, just now Mistral Released their second models with MoE, so now it is very important to release those also.

chapter: mlops-engines

  • create mlops-engines.md (see e.g. 70892bd for adding a chapter)

Use markdown-MyST syntax (e.g. source ➡️ rendered)

Potential subheadings:

  • Difficulties of Working with OpenSource MLOps
  • Python Bindings and More
  • PyTorch Toolchain - From C/C++ to Python
  • llama.cpp
  • ONNX Runtime
  • Apache TVM

Please also report any jupyter-book build framework-related problems/questions in #12 :)

Cannot place static html page in Sphinx template

Hey Team,

As you know, we've been trying to incorporate a custom index.html page in our [Jupyter Book/Sphinx] book that does not adhere to the standard theme and I've hit a roadblock.

Challenge

After exploring various options, it appears that Jupyter Book and Sphinx are not designed to easily allow a completely custom index.html that diverges from the set themes so the https://premai-book.netlify.app seems unusable at this point.

Proposed Solution

The fastest way to implement our custom index.html seems to be post-build where we'd:

  1. Run the standard Jupyter Book or Sphinx build command to generate the documentation.
  2. Execute a post-build script to replace the default index.html with our custom version.

Next Steps

  • I wouldn't want to break anything with the current site so would need someone to help with this script

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.