sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-23 15:38:55 +01:00

Author	SHA1	Message	Date
Debanjum	ed364fa90e	Track running costs & accuracy of eval runs in progress Collect, display and store running costs & accuracy of eval run. This provides more insight into eval runs during execution instead of having to wait until the eval run completes.	2024-11-20 12:40:51 -08:00
Debanjum	c53c3db96b	Track, return cost and usage metrics in chat api response - Track input, output token usage and cost for interactions via chat api with openai, anthropic and google chat models - Get usage metadata from OpenAI using stream_options - Handle openai proxies that do not support passing usage in response - Add new usage, end response events returned by chat api. - This can be optionally consumed by clients at a later point - Update streaming clients to mark message as completed after new end response event, not after end llm response event - Ensure usage data from final response generation step is included - Pass usage data after llm response complete. This allows gathering token usage and cost for the final response generation step across streaming and non-streaming modes	2024-11-20 12:17:58 -08:00
Debanjum	7bdc9590dd	Fix handling sources, output in chat actor when is automated task Remove unnecessary ```python prefix removal. It isn't being triggered in json deserialize path.	2024-11-19 13:49:27 -08:00
Debanjum	0e7d611a80	Remove ```python codeblock prefix from raw json before deserialize	2024-11-19 12:53:52 -08:00
Debanjum	001c13ef43	Upgrade web app package dependencies	2024-11-19 12:53:52 -08:00
sabaimran	4f5c1eeded	Update some of the open graph data for the documentation website	2024-11-19 11:14:46 -08:00
sabaimran	5134d49d71	Release Khoj version 1.30.1	2024-11-18 17:30:33 -08:00
sabaimran	8bdd0b26d3	And a connections clean up decorator to all scheduled tasks	2024-11-18 17:19:36 -08:00
Debanjum	817601872f	Update default offline models enabled	2024-11-18 16:38:17 -08:00
Debanjum	45c623f95c	Dedupe, organize chat actor, director tests - Move Chat actor tests that were previously in chat director tests file - Dedupe online, offline io selector chat actor tests	2024-11-18 16:10:50 -08:00
Debanjum	2a76c69d0d	Run online, offine chat actor, director tests for any supported provider - Previously online chat actors, director tests only worked with openai. This change allows running them for any supported onlnie provider including Google, Anthropic and Openai. - Enable online/offline chat actor, director in two ways: 1. Explicitly setting KHOJ_TEST_CHAT_PROVIDER environment variable to google, anthropic, openai, offline 2. Implicitly by the first API key found from openai, google or anthropic. - Default offline chat provider to use Llama 3.1 3B for faster, lower compute test runs	2024-11-18 15:11:37 -08:00
Debanjum	653127bf1d	Improve data source, output mode selection - Set output mode to single string. Specify output schema in prompt - Both thesee should encourage model to select only 1 output mode instead of encouraging it in prompt too many times - Output schema should also improve schema following in general - Standardize variable, func name of io selector for readability - Fix chat actors to test the io selector chat actor - Make chat actor return sources, output separately for better disambiguation, at least during tests, for now	2024-11-18 15:11:37 -08:00
Debanjum	e3fd51d14b	Pass user arg to create title from query in new automation flow	2024-11-18 12:58:10 -08:00
Debanjum	9e74de9b4f	Improve serializing conversation JSON to print messages on console - Handle chatml message.content with non-json serializable data like WebP image binary data used by Gemini models	2024-11-18 12:57:05 -08:00
sabaimran	3f70d2f685	Add more graceful exception handling when tool selection doesn't work	2024-11-18 09:34:49 -08:00
Debanjum	a2ccf6f59f	Fix github workflow to start Khoj, connect to PG and upload results - Do not trigger tests to run in ci on update to evals	2024-11-18 04:25:15 -08:00
Debanjum	7c0fd71bfd	Add GitHub workflow to quiz Khoj across modes and specified evals (#982 ) - Evaluate khoj on random 200 questions from each of google frames and openai simpleqa benchmarks across general, default and research modes - Run eval with Gemini 1.5 Flash as test giver and Gemini 1.5 Pro as test evaluator models - Trigger eval workflow on release or manually - Make dataset, khoj mode and sample size configurable when triggered via manual workflow - Enable Web search, webpage read tools during evaluation	2024-11-18 02:19:30 -08:00
sabaimran	f75085dc7a	Release Khoj version 1.30.0	2024-11-17 21:36:22 -08:00
sabaimran	c72813ba67	Merge pull request #981 from rznzippy/bugfix/980/database-connections-leakage Fix database connections leakage (#980)	2024-11-17 21:01:06 -08:00
sabaimran	7d50c6590d	Merge pull request #977 from khoj-ai/features/improve-tool-selection - JSON extract from LLMs is pretty decent now, so get the input tools and output modes all in one go. It'll help the model think through the full cycle of what it wants to do to handle the request holistically. - Make slight improvements to tool selection indicators	2024-11-17 20:08:19 -08:00
sabaimran	282f47e0d6	Add Jina documentation to readme for self-hosting	2024-11-17 17:20:28 -08:00
Debanjum	48567fd468	Do not erase partial message when generation stopped via button on web app Previously, we'd replace the generated message with an error message when message generation stopped via stop button on chat page of web app. So the partially generated message (which could be useful) gets lost. This change just stops generation, while keeping the generated response so any useful information from the partially generated message can be retrieved.	2024-11-17 16:29:18 -08:00
Debanjum	285006d6c9	Sync chat models in Khoj with OpenAI proxies (e.g Ollama) on startup - Allows managing chat models in the OpenAI proxy service like Ollama. - Removes need to manually add, remove chat models from Khoj Admin Panel for these OpenAI compatible API services when enabled. - Khoj still mantains the chat models configs within Khoj, so they can be configured via the Khoj admin panel as usual.	2024-11-17 15:34:36 -08:00
Debanjum	4a7f5d1abe	Set API keys in docker-compose.yml to enable web search, scrape tools	2024-11-17 15:34:36 -08:00
Debanjum	d6eece63f4	Use Jina API Key of Jina web scraper if configured in DB Previously Jina search didn't API key. Now that it does need API key, we should re-use the API key set in the Jina web scraper config, otherwise fallback to using JINA_API_KEY from environment variable, if either is present. Resolves #978	2024-11-17 15:34:14 -08:00
sabaimran	6531f24ca0	Further improvements for descriptions to LLM on modes, code, diagram, image.	2024-11-17 13:23:57 -08:00
sabaimran	0eba6ce315	When diagram generation fails, save to conversation log - Update tool name when choosing tools to execute	2024-11-17 13:23:12 -08:00
sabaimran	7e662a05f8	Merge branch 'master' of github.com:khoj-ai/khoj into features/improve-tool-selection	2024-11-17 12:26:55 -08:00
Ilya Khrustalev	00b1af8f99	Fix database connections leakage (#980 )	2024-11-17 19:15:05 +01:00
Debanjum	69ef6829c1	Simplify integrating Ollama, OpenAI proxies with Khoj on first run - Integrate with Ollama or other openai compatible APIs by simply setting `OPENAI_API_BASE' environment variable in docker-compose etc. - Update docs on integrating with Ollama, openai proxies on first run - Auto populate all chat models supported by openai compatible APIs - Auto set vision enabled for all commercial models - Minor - Add huggingface cache to khoj_models volume. This is where chat models and (now) sentence transformer models are stored by default - Reduce verbosity of yarn install of web app. Otherwise hit docker log size limit & stops showing remaining logs after web app install - Suggest `ollama pull <model_name>` to start it in background	2024-11-17 02:08:20 -08:00
Debanjum	2366fa08b9	Update default vision supported & anthropic chat models on first run - Update to latest initialize with new claude 3.5 sonnet and haiku models - Update to set vision enabled for google and anthropic models by default. Previously we didn't support but we've supported this for a month or two now	2024-11-17 02:08:20 -08:00
Debanjum	23ab258d78	Improve user conversation config details on Admin panel Show user email and chat model that is associated with the user conversation config	2024-11-17 02:08:20 -08:00
Debanjum	41d9011a26	Move evaluation script into tests/evals directory This should give more space for eval scripts, results and readme	2024-11-17 02:08:20 -08:00
Debanjum	d9d5884958	Enable evaluating Khoj on the OpenAI SimpleQA bench using eval script - Just load the raw csv from OpenAI bucket. Normalize it into FRAMES format - Improve docstring for frames datasets as well - Log the load dataset perf timer at info level	2024-11-17 02:08:20 -08:00
Debanjum	eb5bc6d9eb	Remove Talc search bench from Khoj eval script	2024-11-17 02:08:20 -08:00
Debanjum	fc45aceecf	Delete unused favicon ico in old web app directory	2024-11-17 02:08:20 -08:00
Debanjum	a16fc3ade8	Only add /research prefix when no slash command in message on web app - Explictly adding a slash command is a higher priority intent than research mode being enabled in the background. Respect that for a more intuitive UX flow. - Explicit slash commands do not currently work in research mode. You've to turn research mode off to use other slash commands. This is strange, unnecessary given intent priority is clear.	2024-11-17 02:08:20 -08:00
sabaimran	a1b4587b34	Remove extract_images flag from PDF loader	2024-11-15 21:46:35 -08:00
sabaimran	15b4cec1e8	Add documentation for how to use the text to image model configs, reduce to Replicate	2024-11-15 15:26:14 -08:00
sabaimran	759873ec44	Add documentation for how to use the text to image model configs	2024-11-15 15:22:06 -08:00
sabaimran	c77dc84a68	Remove output_modes function reference in chat tests	2024-11-15 14:03:07 -08:00
sabaimran	e3f1ea9dee	Improve tool, output mode selection process - JSON extract from LLMs is pretty decent now, so get the input tools and output modes all in one go. It'll help the model think through the full cycle of what it wants to do to handle the request holistically. - Make slight improvements to tool selection indicators	2024-11-15 13:53:53 -08:00
sabaimran	c1a5b32ebf	Do not start server when importing the main.py file, unless gunicorn - Add more graceful shutdown when closing bg scheduler thread	2024-11-14 17:36:51 -08:00
sabaimran	be3ee5ec9f	Add cool new suggestion cards for math, diagramming	2024-11-14 17:36:51 -08:00
Debanjum	9fc44f1a7f	Enable evaluation Khoj on the Talc Search Bench using Eval script - Just load the raw jsonl from Github and normalize it into FRAMES format - Color printed accuracy in eval script to blue for readability	2024-11-13 22:50:14 -08:00
Debanjum	8e009f48ce	Show tool call error in next iteration. Allow rerun if model requests. Previously errors would get eaten up but the model wouldn't see anything. And the model wouldn't be allowed re-run the same query-tool combination in the next iteration. This update should give it insight into why it didn't get a result. So it can make an informed (hopefully better) decision on what to do next. And re-run the previous query if appropriate.	2024-11-13 22:50:14 -08:00
Debanjum	604da90fa8	Wrap try/catch around online search in research mode like other tools Previously when call to online search API etc. failed, it'd error out of response to query in research mode. Khoj should skip tool use that iteration but continue to try respond.	2024-11-13 16:46:09 -08:00
Debanjum	8851b5f78a	Standardize chat message truncation and serialization before print Previously chatml messages were just strings, now they can be list of strings or list of dicts as well. - Use json seriallization to manage their variations and truncate them before printing for context. - Put logic in single function for use across chat models	2024-11-13 16:30:17 -08:00
Debanjum	f4e37209a2	Improve error handling, display and configurability of eval script - Default to evaluation decision of None when either agent or evaluator llm fails. This fixes accuracy calculations on errors - Fix showing color for decision True - Enable arg flags to specify output results file paths	2024-11-13 14:32:22 -08:00
Debanjum	15b0cfa3dd	Improve structured message truncation in logger Previously chatml messages were just strings. Since gemini, anthropic models always have messages as list of strings, truncate those strings instead of the list of message content	2024-11-13 14:32:22 -08:00

1 2 3 4 5 ...

3927 commits