sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-12-18 18:47:11 +00:00

Author	SHA1	Message	Date
Debanjum	5723a3778e	Speed up Docker image builds using multi-stage parallel pipelines (#987 ) ## Objective Improve build speed and size of khoj docker images ## Changes ### Improve docker image build speeds - Decouple web app and server build steps - Build the web app and server in parallel - Cache docker layers for reuse across dockerize github workflow runs - Split Docker build layers for improved cacheability (e.g separate `yarn install` and `yarn build` steps) ### Reduce size of khoj docker images - Use an up-to-date `.dockerignore` to exclude unnecessary directories - Do not installing cuda python packages for cpu builds ### Improve web app builds - Use consistent mechanism to get fonts for web app - Make tailwind extensions production instead of dev dependencies - Make next.js create production builds for the web app (via `NODE_ENV=production` env var)	2024-11-24 21:49:46 -08:00
Debanjum	4a5646c8da	Cache docker layers, nextjs builds in dockerize github workflow	2024-11-24 21:06:22 -08:00
Debanjum	6a39651ad3	Standardize loading fonts locally across pages on web app	2024-11-24 20:41:15 -08:00
sabaimran	9368699b2c	Migrate the pre-commit config Some checks failed dockerize / Publish Khoj Docker Images (push) Waiting to run Details build and deploy github pages for documentation / deploy (push) Waiting to run Details pypi / Publish Python Package to PyPI (push) Waiting to run Details test / Run Tests (push) Has been cancelled Details pre-commit / Setup Application and Lint (push) Has been cancelled Details	2024-11-24 14:54:26 -08:00
sabaimran	6eb59464da	Add additional reinforcement to coax gemini into giving a minimum helpful response	2024-11-24 14:53:53 -08:00
sabaimran	15f062b34a	Remove print statement for agent style map	2024-11-24 14:53:53 -08:00
sabaimran	d7e68a2d1b	Wait for iplcodata to load before first message - Fix the console khoj ai ascii art - Remove some not so good suggested prompt	2024-11-24 14:53:53 -08:00
Debanjum	f51e0f7859	Make Next.js create production builds of web app for Docker images	2024-11-24 13:59:40 -08:00
Debanjum	710e00ad9e	Make tailwind extensions prod, instead of dev, deps of web app	2024-11-24 13:59:40 -08:00
Debanjum	4b486ea5f6	Exclude unnecessary directories from final docker builds	2024-11-24 13:59:40 -08:00
Debanjum	78d8ca49ec	Skip Nvidia GPU python packages during Server install in Dockerfiles	2024-11-24 13:59:39 -08:00
Debanjum	37887a175a	Speed up Docker image builds using multi-stage parallel pipelines Decouple web app, server builds in parallel to speed up Docker builds	2024-11-24 12:48:30 -08:00
Debanjum	7c77d65d35	Improve logic to disable telemetry via KHOJ_TELEMETRY_DISABLE env var Some checks failed dockerize / Publish Khoj Docker Images (push) Waiting to run Details build and deploy github pages for documentation / deploy (push) Waiting to run Details pre-commit / Setup Application and Lint (push) Waiting to run Details pypi / Publish Python Package to PyPI (push) Waiting to run Details test / Run Tests (push) Waiting to run Details build khoj.el / build (push) Has been cancelled Details desktop / 🖥️ Build, Release Desktop App (push) Has been cancelled Details test khoj.el / test (27.1) (push) Has been cancelled Details test khoj.el / test (27.2) (push) Has been cancelled Details test khoj.el / test (28.1) (push) Has been cancelled Details test khoj.el / test (28.2) (push) Has been cancelled Details test khoj.el / test (snapshot) (push) Has been cancelled Details The newly added KHOJ_TELEMETRY_DISABLE env var knob to disable telemetry should override old config mechanism when set	2024-11-24 00:54:16 -08:00
sabaimran	2d683898c2	Release Khoj version 1.30.7	2024-11-23 22:51:10 -08:00
sabaimran	914ff994f7	Fix cost addition to chat_metadata Some checks are pending build khoj.el / build (push) Waiting to run Details desktop / 🖥️ Build, Release Desktop App (push) Waiting to run Details dockerize / Publish Khoj Docker Images (push) Waiting to run Details build and deploy github pages for documentation / deploy (push) Waiting to run Details pre-commit / Setup Application and Lint (push) Waiting to run Details pypi / Publish Python Package to PyPI (push) Waiting to run Details test / Run Tests (push) Waiting to run Details test khoj.el / test (27.1) (push) Waiting to run Details test khoj.el / test (27.2) (push) Waiting to run Details test khoj.el / test (28.1) (push) Waiting to run Details test khoj.el / test (28.2) (push) Waiting to run Details test khoj.el / test (snapshot) (push) Waiting to run Details	2024-11-23 22:50:45 -08:00
Debanjum	caaa127dcf	Release Khoj version 1.30.6	2024-11-23 21:07:00 -08:00
Debanjum	57b8273002	Fix apt install for musl-dev in prod.Dockerfile	2024-11-23 21:06:09 -08:00
Debanjum	8f966b11ec	Release Khoj version 1.30.5	2024-11-23 20:49:05 -08:00
Debanjum	498895a47d	Fix libmusl error using pre-built llama-cpp-python wheel in prod Docker	2024-11-23 20:47:41 -08:00
Debanjum	e5b211a743	Release Khoj version 1.30.4	2024-11-23 19:48:21 -08:00
Debanjum	9848d89d03	Try build docker images with github high cpu, ram runner	2024-11-23 19:09:36 -08:00
Debanjum	04bb3d6f15	Fix libmusl error using pre-built llama-cpp-python wheel via Docker Seems like llama-cpp-python pre-built wheels need libmusl. Otherwise you run into runtime errors on Khoj startup via Docker.	2024-11-23 18:46:44 -08:00
Debanjum	8dd2122817	Set sample size to 200 for automated eval runs as well	2024-11-23 14:48:38 -08:00
Debanjum	c4ef31d86f	Release Khoj version 1.30.3 Some checks are pending build khoj.el / build (push) Waiting to run Details desktop / 🖥️ Build, Release Desktop App (push) Waiting to run Details dockerize / Publish Khoj Docker Images (push) Waiting to run Details build and deploy github pages for documentation / deploy (push) Waiting to run Details pypi / Publish Python Package to PyPI (push) Waiting to run Details test khoj.el / test (27.1) (push) Waiting to run Details test khoj.el / test (27.2) (push) Waiting to run Details test khoj.el / test (28.1) (push) Waiting to run Details test khoj.el / test (28.2) (push) Waiting to run Details test khoj.el / test (snapshot) (push) Waiting to run Details	2024-11-23 14:40:06 -08:00
Debanjum	15ae22bdcf	Use pre-built llama-cpp-python wheel in Khoj docker images Reduces build time and resolves FileNotFoundError 'ninja' during llama-cpp-python local build.	2024-11-23 14:38:07 -08:00
sabaimran	4ac49ca90f	Release Khoj version 1.30.2	2024-11-23 12:00:28 -08:00
Debanjum	5aa5cb1941	Add "New" section with latest updates to Readme Some checks are pending build and deploy github pages for documentation / deploy (push) Waiting to run Details	2024-11-23 01:36:50 -08:00
sabaimran	7f5bf35806	Disambiguate renewal_date type. Previously, being used as None, False, and Datetime in different places. Some checks failed dockerize / Publish Khoj Docker Images (push) Waiting to run Details build and deploy github pages for documentation / deploy (push) Waiting to run Details pypi / Publish Python Package to PyPI (push) Waiting to run Details pre-commit / Setup Application and Lint (push) Has been cancelled Details test / Run Tests (push) Has been cancelled Details	2024-11-22 12:06:20 -08:00
sabaimran	5e8c824ecc	Improve the experience for finding past conversation - add a conversation title search filter, and an agents filter, for finding conversations - in the chat session api, return relevant agent style data	2024-11-22 12:03:01 -08:00
sabaimran	a761865724	Fix handling of customer.subscription.updated event to process new renewal end date	2024-11-22 12:03:01 -08:00
sabaimran	6a054d884b	Add quicker/easier filtering on auth	2024-11-22 12:03:01 -08:00
Debanjum	b9a889ab69	Fix Khoj responses when code generated charts in response context The current fix should improve Khoj responses when charts in response context. It truncates code context before sharing with response chat actors. Previously Khoj would respond with it not being able to create chart but than have a generated chart in it's response in default mode. The truncate code context was added to research chat actor for decision making but it wasn't added to conversation response generation chat actors. When khoj generated charts with code for its response, the images in the context would exceed context window limits. So the truncation logic to drop all past context, including chat history, context gathered for current response. This would result in chat response generator 'forgetting' all for the current response when code generated images, charts in response context.	2024-11-21 14:43:52 -08:00
Debanjum	5475a262d4	Move truncate code context func for reusability across modules It needs to be used across routers and processors. It being in run_code tool makes it hard to be used in other chat provider contexts due to circular dependency issues created by send_message_to_model_wrapper func	2024-11-21 14:27:39 -08:00
Debanjum	f434c3fab2	Fix toggling prompt tracer on/off in Khoj via PROMPTRACE_DIR env var Previous changes to depend on just the PROMPTRACE_DIR env var instead of KHOJ_DEBUG or verbosity flag was partial/incomplete. This fix adds all the changes required to only depend on the PROMPTRACE_DIR env var to enable/disable prompt tracing in Khoj.	2024-11-21 14:06:00 -08:00
Debanjum	4a40cf79c3	Add docs on how to cross-device access self-hosted khoj using tailscale	2024-11-21 11:07:18 -08:00
Debanjum	1f96c13f72	Enable starting khoj uvicorn server with ssl cert file, key for https Pass your domain cert files via the --sslcert, --sslkey cli args. For example, to start khoj at https://example.com, you'd run command: KHOJ_DOMAIN=example.com khoj --sslcert example.com.crt --sslkey example.com.key --host example.com This sets up ssl certs directly with khoj without requiring a reverse proxy like nginx to serve khoj behind https endpoint for simple setups. More complex setups should, of course, still use a reverse proxy for efficient request processing	2024-11-21 11:07:18 -08:00
sabaimran	9fea02f20f	In telemetry, differentiate create_user google and email	2024-11-21 11:01:37 -08:00
sabaimran	9db885b5f7	Limit access to chat models to futurist users	2024-11-21 07:53:24 -08:00
sabaimran	7a00a07398	Add trailing slash to Ollama url in docs	2024-11-21 07:48:18 -08:00
sabaimran	3519dd76f0	Fix type of excalidraw image response	2024-11-20 19:01:13 -08:00
sabaimran	467de76fc1	Improve the image diagramming prompts and response parsing	2024-11-20 18:59:40 -08:00
Debanjum	50d8405981	Enable khoj to use terrarium code sandbox as tool in eval workflow	2024-11-20 14:19:27 -08:00
Debanjum	2203236e4c	Update desktop app dependencies	2024-11-20 13:05:55 -08:00
Debanjum	409204917e	Update documentation website dependencies	2024-11-20 13:05:32 -08:00
Debanjum	6f1adcfe67	Track Usage Metrics in Chat API. Track Running Cost, Accuracy in Evals (#985 ) - Track, return cost and usage metrics in chat api response Track input, output token usage and cost of interactions with openai, anthropic and google chat models for each call to the khoj chat api - Collect, display and store costs & accuracy of eval run currently in progress This provides more insight into eval runs during execution instead of having to wait until the eval run completes.	2024-11-20 12:59:44 -08:00
Debanjum	ffbd0ae3a5	Fix eval github workflow to run on releases, i.e on tags push	2024-11-20 12:57:42 -08:00
Debanjum	ed364fa90e	Track running costs & accuracy of eval runs in progress Collect, display and store running costs & accuracy of eval run. This provides more insight into eval runs during execution instead of having to wait until the eval run completes.	2024-11-20 12:40:51 -08:00
Debanjum	bbd24f1e98	Improve dropdown menus on web app setting page with scroll & min-width - Previously when settings list became long the dropdown height would overflow screen height. Now it's max height is clamped and y-scroll - Previously the dropdown content would take width of content. This would mean the menu could sometimes be less wide than the button. It felt strange. Now dropdown content is at least width of parent button	2024-11-20 12:27:13 -08:00
Debanjum	c53c3db96b	Track, return cost and usage metrics in chat api response - Track input, output token usage and cost for interactions via chat api with openai, anthropic and google chat models - Get usage metadata from OpenAI using stream_options - Handle openai proxies that do not support passing usage in response - Add new usage, end response events returned by chat api. - This can be optionally consumed by clients at a later point - Update streaming clients to mark message as completed after new end response event, not after end llm response event - Ensure usage data from final response generation step is included - Pass usage data after llm response complete. This allows gathering token usage and cost for the final response generation step across streaming and non-streaming modes	2024-11-20 12:17:58 -08:00
Debanjum	80df3bb8c4	Enable prompt tracing only when PROMPTRACE_DIR env var set Better to decouple prompt tracing from debug mode or verbosity level and require explicit, independent config to enable prompt tracing	2024-11-20 11:54:02 -08:00

1 2 3 4 5 ...

4080 commits