premai-io / state-of-open-source-ai Goto Github PK

View Code? Open in Web Editor NEW

1.4K 22.0 83.0 745 KB

:closed_book: Clarity in the current fast-paced mess of Open Source innovation

Home Page: https://book.premai.io/state-of-open-source-ai

License: Other

TeX 81.56% HTML 2.29% Python 9.79% JavaScript 2.22% CSS 4.14%

book jupyter-book ai ml mlops open-source hacktoberfest

state-of-open-source-ai's People

Contributors

Stargazers

Watchers

Forkers

filopedraz anindyadeep architectureofthings mekongdelta-mind moseti1 harris44 seshakiran trainmachines shashankpathak95 xiaozhangzaima hertera1 alejouribe manjunathshiva mz0in dkmahto morganzhh productinfo tspannhw eugeniomagana ssarswat arvindmits anhandouts jeffara weiplanet kushal-h lyhiving nishat-khan stoneshao blueoceandevops flaxsearch darcstar-solutions-tech roapple10 cjephuneh tuananhnguyenkim essammahmood imiracle thinkerchina ufukemre ptzagk ailabteam hxdon dani-el-lo britto10 f901107 tomhuynhsg supreeth-c pizchy-wachida tranminhquan c183rcr3470r benlebovitz amit1nayak erohin vishwa15 restevesd keyman9848 himorik eccstartup 5l1v3r1 richiejp janaka-steph jeremyw-dobeu emarc permo01 andyle0302 anthonyhyphen mryoupiter rencire rafaelbod estherbester allthingsllm devsjr susheelshetty2 jakecho holisticcoder wannaphong iflab dueprincipati andreydelpozo2

state-of-open-source-ai's Issues

Type

new chapter

Chapter/Page

Something else

Description

No response

email: `Enter` to submit

Follow-up to #42

should we have :valid and :invalid CSS?

also can hitting the Enter key work (for keyboard warriors who don't like clicking buttons)?

Have had reports of people giving up on accessing the book because they didn't realise they currently have to use the mouse to click Submit

state-of-open-source-ai/licences/

Licences — State of Open Source AI Book

https://book.premai.io/state-of-open-source-ai/licences/

Move RAG to different as a totally different chapter.

Type

new chapter

Chapter/Page

Something else

Description

Right now RAG is written under Fine-tuning section. However, I am not very much sure, whether that would be the correct place. Since, RAG does not change any parameters of the model or adds new parameters.

So, it would be much better to add it under different section or a dedicated chapter or add it to the VectorDatabase chapter. The concept of RAG is actually not very new. FAISS was used readily before the surge if LLMs (mostly in image search or semantic search).

In the RAG chapter we can populate by these contents:

What is RAG (already covered).
How RAG works on a high level.
RAG in LLMs.
RAG/vector dbs in computer vision.

Adding different Parameter efficient fine-tuning methods for Generative Vison models.

Type

new chapter

Chapter/Page

fine-tuning

Description

Similar to LLMs, we use similar fine-tuning strategies for diffusion models too. So this chapter should include different ways of fine-tuning those models.

fix licence

I'm not aware of any licence that works for both code & text. I'd suggest separate licences for each (this is quite common elsewhere):

Text: CC-BY-4.0
Code: Apache-2.0

thoughts @filopedraz?

Resume Email Wall, but include verification

Type

other (e.g. typos, factual errors, etc.)

Chapter/Page

Something else

Description

We need to go back to the email wall, but we need to make sure that email provided are valid.

jupyter-book framework

state-of-open-source-ai/sdk/

Software Development toolKits — State of Open Source AI Book

https://book.premai.io/state-of-open-source-ai/sdk/

Reinforcement Learning

Type

new chapter

Chapter/Page

Something else

Description

No response

Full Homomorphic Encryption

Type

new chapter

Chapter/Page

Something else

Description

No response

link to SOTA

https://github.com/premAI-io/state-of-open-source-ai/issues

it's difficult to keep track of all the innovations. There's been enormous progress in the field in {term}the last year .

this links to sota? why that it doesn't make sense

local build fails when following instruction from index on editing the book

while following state-of-open-source-ai/#editing-the-book, it breaks on line jupyter-book build -b dirhtml --all . with the following error:

➜ jupyter-book build -b dirhtml --all .
Usage: jupyter-book build [OPTIONS] PATH_SOURCE
Try 'jupyter-book build -h' for help.

Error: No such option: -b

New chapter dedicated to cybersecurity

Have you considered adding a chapter dedicated to security of open source AI? Can help contribute.

chapter: eval-datasets

Use markdown-MyST syntax (e.g. source ➡️ rendered)
Please also report any jupyter-book build framework-related problems/questions in #12 :)

Fine-tuning tools and frameworks

Type

other (e.g. typos, factual errors, etc.)

Chapter/Page

fine-tuning

Description

The fine-tuning chapter should include all the tools and frameworks in order to do fine-tuning.

Federated Learning

Type

new chapter

Chapter/Page

Something else

Description

No response

chapter: licences

Possible rendering issue for links (from mobile)

Type

other (e.g. typos, factual errors, etc.)

Chapter/Page

Something else

Description

example

Unrendered trailing character

No trailing character shown

Ascetic issue, not serious

Finetuning Diffusers with DRLX

Type

new URL/reference/table row

Chapter/Page

fine-tuning

Description

Expand the Finetuning chapter in order to include the DRLX technique.

Mixture of Experts

Type

new chapter

Chapter/Page

Something else

Description

https://github.com/IBM/ModuleFormer
https://github.com/SkunkworksAI/hydra-moe
Decentralized Mixture of Experts

Different evaluation frameworks for LLMs

Type

new chapter

Chapter/Page

eval-datasets

Description

The evaluation page is really good, however, it would be awesome if we could add some information on the following evaluation frameworks.

HELM by Stanford.
LM Evaluation Harness by Eluther AI.
Code Evaluation Harness by BigCode.

The content should be mainly regarding how they are trying to do evaluation and how to get started with each.

Add re-captcha to the form to avoid bot

v1 framework

Follow-up to #11

v1.1

rename repo?
fix all TODO notes
fix all {{ wip_chapter }}/% TODO/Work in Progress notes
CI: speed up builds
- maybe parallel linkcheck + html + pdf #69
- smaller pdf docker container? (easier since https://github.com/xu-cheng/latex-action/releases/tag/3.0.0) (alternatively, use different/quicker pdf action?)
- add minimal pdf LaTeX to DevContainer (and use https://github.com/devcontainers/ci)?

chapter: model-formats

create model-formats.md (see e.g. 70892bd for adding a chapter)

Use markdown-MyST syntax (e.g. source ➡️ rendered)

Potential subheadings:

GGML
ONNX
TVM

Please also report any jupyter-book build framework-related problems/questions in #12 :)

state-of-open-source-ai/eval-datasets/

Evaluation & Datasets — State of Open Source AI Book

https://book.premai.io/state-of-open-source-ai/eval-datasets/

Update logo and guidelines

Type

other (e.g. typos, factual errors, etc.)

Chapter/Page

Something else

Description

Use new Prem logo and guidelines.

Adding Parameter efficient Fine-tuning techniques for LLM

Type

new chapter

Chapter/Page

fine-tuning

Description

The contents on this chapter should be the following:

What is Parameter efficient fine-tuning methods and why we do it.
Different PEFT methods:
a. Prompt Tuning
b. LoRA
c. QLoRA
etc

WebGPU

Type

new chapter

Chapter/Page

Something else

Description

No response

Include quantization techniques

Type

new chapter

Chapter/Page

Something else

Description

I suggest that we have a separate chapter on Quantization. The chapter should include all the new methods created to handle quantization and the most famous libraries associated with them.

GPTQ
AWQ

chapter: desktop-apps

https://github.com/premAI-io/state-of-open-source-ai/blob/main/desktop-apps.md

Use markdown-MyST syntax (e.g. source ➡️ rendered)
Please also report any jupyter-book build framework-related problems/questions in #12 :)

Audiobook format (or an text-to-speech friendly format)

Type

other (e.g. typos, factual errors, etc.)

Chapter/Page

Something else

Description

For many people the most effective way to read a book is in audiobook format.
I would like to ask to have some text-to-speech transcription available, or, considering the involved costs, a text-to-speech friendly artifact available, so I can execute the TTS by myself.
I volunteer to provide the generated "audiobook" to be shared among this book audience.

Model Parallelism

Type

new chapter

Chapter/Page

Something else

Description

No response

Inference Optimization Chapter

Type

new chapter

Chapter/Page

Something else

Description

Doing training or inference models are fairly easy, when we have smaller number of parameters. But when the scale of parameters and data increases, it becomes increasingly difficult to optimize that (interms of compute and performance). So a dedicated chapter on inference optimization and arithmetics on resource calculations becomes very useful.

Some reference links:

Mixture of Experts (MoE)

Type

new chapter

Chapter/Page

models

Description

Although MoE has been there in general Deep learning, just now Mistral Released their second models with MoE, so now it is very important to release those also.

comments: Sign in not working

When signing in with GitHub to add a comment in a chapter I've got 404.

state-of-open-source-ai/models/

Models — State of Open Source AI Book

https://book.premai.io/state-of-open-source-ai/models/

chapter: mlops-engines

create mlops-engines.md (see e.g. 70892bd for adding a chapter)

Use markdown-MyST syntax (e.g. source ➡️ rendered)

Potential subheadings:

Difficulties of Working with OpenSource MLOps
Python Bindings and More
PyTorch Toolchain - From C/C++ to Python
llama.cpp
ONNX Runtime
Apache TVM

Please also report any jupyter-book build framework-related problems/questions in #12 :)

Cannot place static html page in Sphinx template

Hey Team,

As you know, we've been trying to incorporate a custom index.html page in our [Jupyter Book/Sphinx] book that does not adhere to the standard theme and I've hit a roadblock.

Challenge

After exploring various options, it appears that Jupyter Book and Sphinx are not designed to easily allow a completely custom index.html that diverges from the set themes so the https://premai-book.netlify.app seems unusable at this point.

Proposed Solution

The fastest way to implement our custom index.html seems to be post-build where we'd:

Run the standard Jupyter Book or Sphinx build command to generate the documentation.
Execute a post-build script to replace the default index.html with our custom version.

Next Steps

I wouldn't want to break anything with the current site so would need someone to help with this script

premai-io / state-of-open-source-ai Goto Github PK

state-of-open-source-ai's People

Contributors

Stargazers

Watchers

Forkers

state-of-open-source-ai's Issues

Type

Chapter/Page

Description

Licences — State of Open Source AI Book

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Description

Software Development toolKits — State of Open Source AI Book

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Description

Type

Chapter/Page

Description

v1.1

Evaluation & Datasets — State of Open Source AI Book

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Type

Chapter/Page

Description

Models — State of Open Source AI Book

Challenge

Proposed Solution

Next Steps

Recommend Projects

Recommend Topics

Recommend Org