Support functions/tools in the API to enable more use cases. Refer to the OpenAI d

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

pr: <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id

Support functions/tools in OpenAI API about llm-on-ray HOT 4 OPEN

carsonwang commented on September 14, 2024

Support functions/tools in OpenAI API

from llm-on-ray.

carsonwang commented on September 14, 2024 1

We can use mistral 7b to test this. It will also be useful to add a Langchain example that leverage this API.

from llm-on-ray.

carsonwang commented on September 14, 2024

@xuechendi is working on it.

from llm-on-ray.

xuechendi commented on September 14, 2024

@carsonwang, I've been working on this issue but a little lost on How to enable.
Observations:

HuggingFace support for 'tools', 'tool_choice': no native parameter support in generate function for "tools", "tool_choice". Seems huggingface has their own way to support Tools: https://huggingface.co/docs/transformers/main/en/custom_tools
RayLLM tools/tool_choice support. RayLLM supports these two keywords in client API, but after digging into inference codes, can't find how these two keywords passing to model.inference function either vLLMEngine or TRTEngine.
Even though AnyScale blog mentioned their fully support for function_calls, it is not come to RayLLM codes yet. Anyscale Endpoints: JSON Mode and Function calling Features
llama_cpp toos/tool_choice support. I find function_call implementation in llama_cpp_python for two models: functionary and chatML. But only functionary is accepting these two keywords natually, chatML is still convert it as plain text as Langchain.initialize_agent did. https://github.com/abetlen/llama-cpp-python/blob/main/llama_cpp/llama_chat_format.py#L2032
vLLM / triton tensor-RT / huggingface issues research on tools/tool_chain support. Not found aligned conclusion.

Below is my branch for duplicating RayLLM supports for tools/tool_choice:
xuechendi@42d9613

from llm-on-ray.

xuechendi commented on September 14, 2024

from llm-on-ray.

Recommend Projects