GithubHelp home page GithubHelp logo

freedomfromfiat / embedchain Goto Github PK

View Code? Open in Web Editor NEW

This project forked from embedchain/embedchain

0.0 0.0 0.0 2.8 MB

Data platform for LLMs - Load, index, retrieve and sync any unstructured data

Home Page: https://embedchain.ai

License: Apache License 2.0

Shell 0.04% JavaScript 0.10% Python 83.25% TypeScript 3.91% Makefile 0.13% Jupyter Notebook 12.58%

embedchain's Introduction

embedchain

ROSS Index - Fastest Growing Open-Source Startups in Q3 2023 | Runa Capital

PyPI Slack Discord Twitter Substack Open in Colab codecov

Embedchain is a Data Platform for LLMs - load, index, retrieve, and sync any unstructured data. Using embedchain, you can easily create LLM powered apps over any data. If you want a javascript version, check out embedchain-js

Community

  • Join embedchain community on slack by accepting this invite

๐Ÿค Schedule a 1-on-1 Session

Book a 1-on-1 Session with Taranjeet, the founder, to discuss any issues, provide feedback, or explore how we can improve Embedchain for you.

๐Ÿ”ง Quick install

pip install --upgrade embedchain

๐Ÿ” Demo

Try out embedchain in your browser:

Open in Colab

๐Ÿ“– Documentation

The documentation for embedchain can be found at docs.embedchain.ai.

๐Ÿ’ป Usage

Embedchain empowers you to create ChatGPT like apps, on your own dynamic dataset.

Data types supported

  • Youtube video
  • PDF file
  • CSV file
  • Web page
  • MDX file
  • XML file
  • Sitemap
  • Doc file
  • Notion
  • JSON file
  • OpenAPI specs
  • Code docs website
  • Unstructured file loader and many more

You can find the full list of data types on our documentation.

Queries

For example, you can use Embedchain to create an Elon Musk bot using the following code:

import os
from embedchain import Pipeline as App

# Create a bot instance
os.environ["OPENAI_API_KEY"] = "YOUR API KEY"
elon_bot = App()

# Embed online resources
elon_bot.add("https://en.wikipedia.org/wiki/Elon_Musk")
elon_bot.add("https://www.forbes.com/profile/elon-musk")
elon_bot.add("https://www.youtube.com/watch?v=RcYjXbSJBN8")

# Query the bot
elon_bot.query("How many companies does Elon Musk run and name those?")
# Answer: Elon Musk currently runs several companies. As of my knowledge, he is the CEO and lead designer of SpaceX, the CEO and product architect of Tesla, Inc., the CEO and founder of Neuralink, and the CEO and founder of The Boring Company. However, please note that this information may change over time, so it's always good to verify the latest updates.

# (Optional): Deploy app to Embedchain Platform
app.deploy()
# ๐Ÿ”‘ Enter your Embedchain API key. You can find the API key at https://app.embedchain.ai/settings/keys/
# ec-xxxxxx

# ๐Ÿ› ๏ธ Creating pipeline on the platform...
# ๐ŸŽ‰๐ŸŽ‰๐ŸŽ‰ Pipeline created successfully! View your pipeline: https://app.embedchain.ai/pipelines/xxxxx

# ๐Ÿ› ๏ธ Adding data to your pipeline...
# โœ… Data of type: web_page, value: https://www.forbes.com/profile/elon-musk added successfully.

Examples

LLM Google Colab Replit
OpenAI Open In Colab Try with Replit Badge
Anthropic Open In Colab Try with Replit Badge
Azure OpenAI Open In Colab Try with Replit Badge
VertexAI Open In Colab Try with Replit Badge
Cohere Open In Colab Try with Replit Badge
Hugging Face Open In Colab Try with Replit Badge
JinaChat Open In Colab Try with Replit Badge
GPT4All Open In Colab Try with Replit Badge
Llama2 Open In Colab Try with Replit Badge
Embedding model Google Colab Replit
OpenAI Open In Colab Try with Replit Badge
VertexAI Open In Colab Try with Replit Badge
GPT4All Open In Colab Try with Replit Badge
Hugging Face Open In Colab Try with Replit Badge
Vector DB Google Colab Replit
ChromaDB Open In Colab Try with Replit Badge
Elasticsearch Open In Colab Try with Replit Badge
Opensearch Open In Colab Try with Replit Badge
Pinecone Open In Colab Try with Replit Badge

๐Ÿค Contributing

Contributions are welcome! Please check out the issues on the repository, and feel free to open a pull request. For more information, please see the contributing guidelines.

For more reference, please go through Development Guide and Documentation Guide.

Telemetry

We collect anonymous usage metrics to enhance our package's quality and user experience. This includes data like feature usage frequency and system info, but never personal details. The data helps us prioritize improvements and ensure compatibility. If you wish to opt-out, set the app.config.collect_metrics = False in the code. We prioritize data security and don't share this data externally.

Citation

If you utilize this repository, please consider citing it with:

@misc{embedchain,
  author = {Taranjeet Singh, Deshraj Yadav},
  title = {Embedchain: Data platform for LLMs - load, index, retrieve, and sync any unstructured data},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/embedchain/embedchain}},
}

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.