diff --git a/documentation/docs/features/image_generation.md b/documentation/docs/features/image_generation.md index da6af1ac..df8d743f 100644 --- a/documentation/docs/features/image_generation.md +++ b/documentation/docs/features/image_generation.md @@ -10,6 +10,20 @@ To generate images, you just need to provide a prompt to Khoj in which the image ## Setup (Self-Hosting) -Right now, we only support integration with OpenAI's DALL-E. You need to have an OpenAI API key to use this feature. Here's how you can set it up: -1. Setup your OpenAI API key. See instructions [here](/get-started/setup#2-configure) -2. Create a text to image config at http://localhost:42110/server/admin/database/texttoimagemodelconfig/. We recommend the value `dall-e-3`. +You have a couple of image generation options. + +### OpenAI + +1. Get [an OpenAI API key](https://platform.openai.com/settings/organization/api-keys). +2. Setup your OpenAI API key, if you haven't already. See instructions [here](/get-started/setup#2-configure) +3. Create a text to image config at http://localhost:42110/server/admin/database/texttoimagemodelconfig/. We recommend the `model name` `dall-e-3`. Make sure to associate it with the OpenAI API chat configuration you setup in step 2 with `Openai config` field. + +### Flux + +1. You need a Replicate API key. You can find one [here](https://replicate.com/account/api-tokens). +1. Create a new [Text to Image Model](https://app.khoj.dev/server/admin/database/texttoimagemodelconfig/). Set the `type` to `Replicate`. We recommend the `model name` `black-forest-labs/flux-1.1-pro`. + +### Stable Diffusion + +1. Get an API key from [Stable Diffusion](https://www.stablediffusion.com/). +2. Create a new [Text to Image Model](https://app.khoj.dev/server/admin/database/texttoimagemodelconfig/). Set the `type` to `Stabilityai`. We recommend the `model name` `sd3-large`.