khoj/documentation/docs/features/image_generation.md

1.7 KiB

Image Generation

You can use Khoj to generate images from text prompts. You can get deeper into the details of our image generation flow in this blog post: https://blog.khoj.dev/posts/how-khoj-generates-images/.

To generate images, you just need to provide a prompt to Khoj in which the image generation is in the instructions. Khoj will automatically detect the image generation intent, augment your generation prompt, and then create the image. Here are some examples:

Prompt Image
Paint a picture of the plants I got last month, pixar-animation plants
Create a picture of my dream house, based on my interests house

Setup (Self-Hosting)

You have a couple of image generation options.

Image Generation Models

We support most state of the art image generation models, including Ideogram, Flux, and Stable Diffusion. These will run using Replicate. Here's how to set them up:

  1. Get a Replicate API key here.
  2. Create a new Text to Image Model. Set the type to Replicate. Use any of the model names you see on this list.

OpenAI

  1. Get an OpenAI API key.
  2. Setup your OpenAI API key, if you haven't already. See instructions here
  3. Create a text to image config at http://localhost:42110/server/admin/database/texttoimagemodelconfig/. We recommend the model name dall-e-3. Make sure to associate it with the OpenAI API chat configuration you setup in step 2 with Openai config field.