khoj/documentation/docs/features/voice_chat.md

1.6 KiB

Voice

You can talk to Khoj using your voice. Khoj will respond to your queries using the same models as the chat feature. You can use voice chat on the web, Desktop, and Obsidian apps.

Voice Chat

Click on the little mic icon to send your voice message to Khoj. It will send back what it heard via text. You'll have some time to edit it before sending it, if required. Try it at https://app.khoj.dev/.

Voice Response

When you get a response from Khoj, you can click on the speaker icon to hear the response. This feature is available only on the web view right now.

Speaker Icon

Setup (Self-Hosting)

Voice chat will automatically be configured when you initialize the application. The default configuration will run locally. If you want to use the OpenAI whisper API for voice chat, you can set it up by following these steps:

  1. Setup your OpenAI API key. See instructions here.
  2. Create a new configuration at http://localhost:42110/server/admin/database/speechtotextmodeloptions/. We recommend the value whisper-1 and model type Openai.

If you want to use the Text to Speech feature, you can set it up by following these steps:

  1. Setup your account on ElevenLabs.io.
  2. Configure your API key in your environment variables with the key ELEVEN_LABS_API_KEY.
  3. (Optional) Create a new Voice model option with a specific voice ID from whichever voice you want to use. You can explore the options here.