sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-23 15:38:55 +01:00

Author	SHA1	Message	Date
sabaimran	6a054d884b	Add quicker/easier filtering on auth	2024-11-22 12:03:01 -08:00
Debanjum	b9a889ab69	Fix Khoj responses when code generated charts in response context The current fix should improve Khoj responses when charts in response context. It truncates code context before sharing with response chat actors. Previously Khoj would respond with it not being able to create chart but than have a generated chart in it's response in default mode. The truncate code context was added to research chat actor for decision making but it wasn't added to conversation response generation chat actors. When khoj generated charts with code for its response, the images in the context would exceed context window limits. So the truncation logic to drop all past context, including chat history, context gathered for current response. This would result in chat response generator 'forgetting' all for the current response when code generated images, charts in response context.	2024-11-21 14:43:52 -08:00
Debanjum	5475a262d4	Move truncate code context func for reusability across modules It needs to be used across routers and processors. It being in run_code tool makes it hard to be used in other chat provider contexts due to circular dependency issues created by send_message_to_model_wrapper func	2024-11-21 14:27:39 -08:00
Debanjum	f434c3fab2	Fix toggling prompt tracer on/off in Khoj via PROMPTRACE_DIR env var Previous changes to depend on just the PROMPTRACE_DIR env var instead of KHOJ_DEBUG or verbosity flag was partial/incomplete. This fix adds all the changes required to only depend on the PROMPTRACE_DIR env var to enable/disable prompt tracing in Khoj.	2024-11-21 14:06:00 -08:00
Debanjum	4a40cf79c3	Add docs on how to cross-device access self-hosted khoj using tailscale	2024-11-21 11:07:18 -08:00
Debanjum	1f96c13f72	Enable starting khoj uvicorn server with ssl cert file, key for https Pass your domain cert files via the --sslcert, --sslkey cli args. For example, to start khoj at https://example.com, you'd run command: KHOJ_DOMAIN=example.com khoj --sslcert example.com.crt --sslkey example.com.key --host example.com This sets up ssl certs directly with khoj without requiring a reverse proxy like nginx to serve khoj behind https endpoint for simple setups. More complex setups should, of course, still use a reverse proxy for efficient request processing	2024-11-21 11:07:18 -08:00
sabaimran	9fea02f20f	In telemetry, differentiate create_user google and email	2024-11-21 11:01:37 -08:00
sabaimran	9db885b5f7	Limit access to chat models to futurist users	2024-11-21 07:53:24 -08:00
sabaimran	7a00a07398	Add trailing slash to Ollama url in docs	2024-11-21 07:48:18 -08:00
sabaimran	3519dd76f0	Fix type of excalidraw image response	2024-11-20 19:01:13 -08:00
sabaimran	467de76fc1	Improve the image diagramming prompts and response parsing	2024-11-20 18:59:40 -08:00
Debanjum	50d8405981	Enable khoj to use terrarium code sandbox as tool in eval workflow	2024-11-20 14:19:27 -08:00
Debanjum	2203236e4c	Update desktop app dependencies	2024-11-20 13:05:55 -08:00
Debanjum	409204917e	Update documentation website dependencies	2024-11-20 13:05:32 -08:00
Debanjum	6f1adcfe67	Track Usage Metrics in Chat API. Track Running Cost, Accuracy in Evals (#985 ) - Track, return cost and usage metrics in chat api response Track input, output token usage and cost of interactions with openai, anthropic and google chat models for each call to the khoj chat api - Collect, display and store costs & accuracy of eval run currently in progress This provides more insight into eval runs during execution instead of having to wait until the eval run completes.	2024-11-20 12:59:44 -08:00
Debanjum	ffbd0ae3a5	Fix eval github workflow to run on releases, i.e on tags push	2024-11-20 12:57:42 -08:00
Debanjum	ed364fa90e	Track running costs & accuracy of eval runs in progress Collect, display and store running costs & accuracy of eval run. This provides more insight into eval runs during execution instead of having to wait until the eval run completes.	2024-11-20 12:40:51 -08:00
Debanjum	bbd24f1e98	Improve dropdown menus on web app setting page with scroll & min-width - Previously when settings list became long the dropdown height would overflow screen height. Now it's max height is clamped and y-scroll - Previously the dropdown content would take width of content. This would mean the menu could sometimes be less wide than the button. It felt strange. Now dropdown content is at least width of parent button	2024-11-20 12:27:13 -08:00
Debanjum	c53c3db96b	Track, return cost and usage metrics in chat api response - Track input, output token usage and cost for interactions via chat api with openai, anthropic and google chat models - Get usage metadata from OpenAI using stream_options - Handle openai proxies that do not support passing usage in response - Add new usage, end response events returned by chat api. - This can be optionally consumed by clients at a later point - Update streaming clients to mark message as completed after new end response event, not after end llm response event - Ensure usage data from final response generation step is included - Pass usage data after llm response complete. This allows gathering token usage and cost for the final response generation step across streaming and non-streaming modes	2024-11-20 12:17:58 -08:00
Debanjum	80df3bb8c4	Enable prompt tracing only when PROMPTRACE_DIR env var set Better to decouple prompt tracing from debug mode or verbosity level and require explicit, independent config to enable prompt tracing	2024-11-20 11:54:02 -08:00
Debanjum	9ab76ccaf1	Skip adding agent to chat metadata when chat unset to avoids null ref	2024-11-19 21:10:23 -08:00
Debanjum	4da0499cd7	Stream responses by openai's o1 model series, as api now supports it Previously o1 models did not support streaming responses via API. Now they seem to do	2024-11-19 21:10:23 -08:00
sabaimran	e5347dac8c	Fix base image used for prod in docs	2024-11-19 15:51:27 -08:00
sabaimran	b943069577	Fix button text, and login url in self-hosted auth docs	2024-11-19 15:50:13 -08:00
sabaimran	3b5e6a9f4d	Update authentication documentation	2024-11-19 15:45:47 -08:00
Debanjum	7bdc9590dd	Fix handling sources, output in chat actor when is automated task Remove unnecessary ```python prefix removal. It isn't being triggered in json deserialize path.	2024-11-19 13:49:27 -08:00
Debanjum	0e7d611a80	Remove ```python codeblock prefix from raw json before deserialize	2024-11-19 12:53:52 -08:00
Debanjum	001c13ef43	Upgrade web app package dependencies	2024-11-19 12:53:52 -08:00
sabaimran	4f5c1eeded	Update some of the open graph data for the documentation website	2024-11-19 11:14:46 -08:00
sabaimran	5134d49d71	Release Khoj version 1.30.1	2024-11-18 17:30:33 -08:00
sabaimran	8bdd0b26d3	And a connections clean up decorator to all scheduled tasks	2024-11-18 17:19:36 -08:00
Debanjum	817601872f	Update default offline models enabled	2024-11-18 16:38:17 -08:00
Debanjum	45c623f95c	Dedupe, organize chat actor, director tests - Move Chat actor tests that were previously in chat director tests file - Dedupe online, offline io selector chat actor tests	2024-11-18 16:10:50 -08:00
Debanjum	2a76c69d0d	Run online, offine chat actor, director tests for any supported provider - Previously online chat actors, director tests only worked with openai. This change allows running them for any supported onlnie provider including Google, Anthropic and Openai. - Enable online/offline chat actor, director in two ways: 1. Explicitly setting KHOJ_TEST_CHAT_PROVIDER environment variable to google, anthropic, openai, offline 2. Implicitly by the first API key found from openai, google or anthropic. - Default offline chat provider to use Llama 3.1 3B for faster, lower compute test runs	2024-11-18 15:11:37 -08:00
Debanjum	653127bf1d	Improve data source, output mode selection - Set output mode to single string. Specify output schema in prompt - Both thesee should encourage model to select only 1 output mode instead of encouraging it in prompt too many times - Output schema should also improve schema following in general - Standardize variable, func name of io selector for readability - Fix chat actors to test the io selector chat actor - Make chat actor return sources, output separately for better disambiguation, at least during tests, for now	2024-11-18 15:11:37 -08:00
Debanjum	e3fd51d14b	Pass user arg to create title from query in new automation flow	2024-11-18 12:58:10 -08:00
Debanjum	9e74de9b4f	Improve serializing conversation JSON to print messages on console - Handle chatml message.content with non-json serializable data like WebP image binary data used by Gemini models	2024-11-18 12:57:05 -08:00
sabaimran	3f70d2f685	Add more graceful exception handling when tool selection doesn't work	2024-11-18 09:34:49 -08:00
Debanjum	a2ccf6f59f	Fix github workflow to start Khoj, connect to PG and upload results - Do not trigger tests to run in ci on update to evals	2024-11-18 04:25:15 -08:00
Debanjum	7c0fd71bfd	Add GitHub workflow to quiz Khoj across modes and specified evals (#982 ) - Evaluate khoj on random 200 questions from each of google frames and openai simpleqa benchmarks across general, default and research modes - Run eval with Gemini 1.5 Flash as test giver and Gemini 1.5 Pro as test evaluator models - Trigger eval workflow on release or manually - Make dataset, khoj mode and sample size configurable when triggered via manual workflow - Enable Web search, webpage read tools during evaluation	2024-11-18 02:19:30 -08:00
sabaimran	f75085dc7a	Release Khoj version 1.30.0	2024-11-17 21:36:22 -08:00
sabaimran	c72813ba67	Merge pull request #981 from rznzippy/bugfix/980/database-connections-leakage Fix database connections leakage (#980)	2024-11-17 21:01:06 -08:00
sabaimran	7d50c6590d	Merge pull request #977 from khoj-ai/features/improve-tool-selection - JSON extract from LLMs is pretty decent now, so get the input tools and output modes all in one go. It'll help the model think through the full cycle of what it wants to do to handle the request holistically. - Make slight improvements to tool selection indicators	2024-11-17 20:08:19 -08:00
sabaimran	282f47e0d6	Add Jina documentation to readme for self-hosting	2024-11-17 17:20:28 -08:00
Debanjum	48567fd468	Do not erase partial message when generation stopped via button on web app Previously, we'd replace the generated message with an error message when message generation stopped via stop button on chat page of web app. So the partially generated message (which could be useful) gets lost. This change just stops generation, while keeping the generated response so any useful information from the partially generated message can be retrieved.	2024-11-17 16:29:18 -08:00
Debanjum	285006d6c9	Sync chat models in Khoj with OpenAI proxies (e.g Ollama) on startup - Allows managing chat models in the OpenAI proxy service like Ollama. - Removes need to manually add, remove chat models from Khoj Admin Panel for these OpenAI compatible API services when enabled. - Khoj still mantains the chat models configs within Khoj, so they can be configured via the Khoj admin panel as usual.	2024-11-17 15:34:36 -08:00
Debanjum	4a7f5d1abe	Set API keys in docker-compose.yml to enable web search, scrape tools	2024-11-17 15:34:36 -08:00
Debanjum	d6eece63f4	Use Jina API Key of Jina web scraper if configured in DB Previously Jina search didn't API key. Now that it does need API key, we should re-use the API key set in the Jina web scraper config, otherwise fallback to using JINA_API_KEY from environment variable, if either is present. Resolves #978	2024-11-17 15:34:14 -08:00
sabaimran	6531f24ca0	Further improvements for descriptions to LLM on modes, code, diagram, image.	2024-11-17 13:23:57 -08:00
sabaimran	0eba6ce315	When diagram generation fails, save to conversation log - Update tool name when choosing tools to execute	2024-11-17 13:23:12 -08:00

1 2 3 4 5 ...

3950 commits