sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-24 16:05:07 +01:00

Debanjum Singh Solanky a5c16ad600 Move Web client config page to /configure from /config url path

Update docs, clients and error messages to point to /configure
instead of /config

2024-07-16 16:13:27 +05:30

2.2 KiB

Raw Blame History

sidebar_position
1

Use OpenAI Proxy

:::info This is only helpful for self-hosted users. If you're using Khoj Cloud, you're limited to our first-party models. :::

:::info Khoj natively supports local LLMs available on HuggingFace in GGUF format. Using an OpenAI API proxy with Khoj maybe useful for ease of setup, trying new models or using commercial LLMs via API. :::

Khoj can use any OpenAI API compatible server including Ollama, LMStudio and LiteLLM. Configuring this allows you to use non-standard, open or commercial, local or hosted LLM models for Khoj

Combine them with Khoj can turn your favorite LLM into an AI agent. Allowing you to chat with your docs, find answers from the internet, build custom agents and run automations.

For specific integrations, see our Ollama, LMStudio and LiteLLM setup docs. For general instructions to setup Khoj with an OpenAI API proxy see below.

General Setup

Start your preferred OpenAI API compatible app
Create a new OpenAI Processor Conversation Config on your Khoj admin panel
- Name: proxy-name
- Api Key: any string
- Api Base Url: URL of your Openai Proxy API
Create a new Chat Model Option on your Khoj admin panel.
- Name: llama3 (replace with the name of your local model)
- Model Type: Openai
- Openai Config: <the proxy config you created in step 3>
- Max prompt size: 2000 (replace with the max prompt size of your model)
- Tokenizer: Do not set for OpenAI, mistral, llama3 based models
Create a new Server Chat Setting on your Khoj admin panel
- Default model: <name of chat model option you created in step 4>
- Summarizer model: <name of chat model option you created in step 4>
Go to your config and select the model you just created in the chat model dropdown.

2.2 KiB Raw Blame History

Use OpenAI Proxy

General Setup

2.2 KiB

Raw Blame History