chainlink-assistant's Introduction

Chainlink AI Assistant

This is a collection of LLM programs for a personalized AI assistant that is driven by Chainlink’s publicly-available developer resources:

Our goal is to improve the productivity of developers that are building with Chainlink infrastructure. Many developers already use ChatGPT, but this is a general model that (i) often outputs instructions that are out of date, (ii) isn’t specialized towards developing on top of Chainlink.

We use a recent approach to personalizing AI assistants, called in-context retrieval-augmented language models (see research overview here), which has the advantage of citing sources and reducing hallucination (making stuff up).

You can find further details of the project in this doc.

What's in this Repo?

1. Data

We have scraped various data sources relevent to Chainlink development such as the Chainlink Developer Docs, Chainlink Tags on Stack Overflow and Chainlink Academy. We run this text through the OpenAI embedding model and store it in a vector db. This data can be found in the algovate/data folder e.g. documents.pkl.

2. LLM Assistants

We have experimented with a variety of LLM assistants using LLM programming approaches like LLM workflows/chains and agents, with frameworks such as LangChain and Llama Index. These LLM assistants use a variety of retrieval methods (e.g. vector-based retrieval), logic and models (e.g. the new 16k token context window model from OpenAI). These can be found in the algovate/langchain and algovate/llama dirs, and notebooks.

How to ingest data

How to run chat/qanda

chainlink-assistant's People

Contributors

Watchers

chainlink-assistant's Issues

Only use verified answers for Stack Overflow data source

From Daniel:

There's a scenario where an attacker could plant unsafe information in one of the sources and the widget would serve it.
If a bad actor answers questions with potentially harmful content, we should not serve it through the widget.

Update repo to be main repo for chainlink consultancy

Basic keyword search for deliverable 2

work on getting data from new source

New source
https://data.chain.link/
eg: https://data.chain.link/ethereum/mainnet/crypto-usd/eth-usd

read data
convert to simple English

Potential questions asked by devs

check if a feed is verified, ex: Is eth/usd a verified feed?
is eth/usd feed backed by staking?
under what asset class does eth/usd fall?
what is the tier of the eth/usd feed on binance?
what is the deviation threshold of eth/usd on binance?
how many oracles carry eth/usd on binance?
(replace asset and blockchain name by any combination)