sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2025-02-17 08:04:21 +00:00

Author	SHA1	Message	Date
Debanjum	22f3ed3f5d	Research Mode: Give Khoj the ability to perform more advanced reasoning (#952 ) ## Overview Khoj can now go into research mode and use a python code interpreter. These are experimental features that are being released early for feedback and testing. - Research mode allows Khoj to dynamically select the tools it needs to best answer the question. It is also allowed more iterations to get to a satisfactory answer. Its more dynamic train of thought is shown to improve visibility into its thinking. - Adding ability for Khoj to use a python code interpreter is an adjacent capability. It can help Khoj do some data analysis and generate charts for you. A sandboxed python to run code is provided using [cohere-terrarium](https://github.com/cohere-ai/cohere-terrarium?tab=readme-ov-file), [pyodide](https://pyodide.org/). ## Analysis Research mode (significantly?) improves Khoj's information retrieval for more complex queries requiring multi-step lookups but takes longer to run. It can research for longer, requiring less back-n-forth with the user to find an answer. Research mode gives most gains when used with more advanced chat models (like o1, 4o, new claude sonnet and gemini-pro-002). Smaller models improve their response quality but tend to get into repetitive loops more often. ## Next Steps - Get community feedback on research mode. What works, what fails, what is confusing, what'd be cool to have. - Tune Khoj's capabilities for longer autonomous runs and to generalize across a larger range of model sizes ## Miscellaneous Improvements - Khoj's train of thought is saved and shown for all messages, not just the latest one - Render charts generated by Khoj and code running using the code tool on the web app - Align chat input color to currently selected agent color	2024-11-01 14:46:29 -07:00
sabaimran	baa939f4ce	When running code, strip any code delimiters. Disable application json type specification in Gemini request.	2024-11-01 13:47:39 -07:00
sabaimran	8fd2fe162f	Determine if research mode is enabled by checking the conversation commands and 'linting' them in the selection phase	2024-11-01 13:12:34 -07:00
sabaimran	cead1598b9	Don't reset research mode after completing research execution	2024-11-01 13:00:11 -07:00
Debanjum	c1c779a7ef	Do not yaml format raw code results in context for LLM. It's confusing	2024-11-01 12:45:26 -07:00
Debanjum	cd75151431	Do not allow auto selecting research mode as tool for now. You are required to manually turning it on. This takes longer and should be a high intent activity initiated by user	2024-11-01 12:07:52 -07:00
Debanjum	0b0cfb35e6	Simplify in research mode check in api_chat. - Dedent code for readability - Use better name for in research mode check - Continue to remove inferred summarize command when multiple files in file filter even when not in research mode - Continue to show select information source train of thought. It was removed by mistake earlier	2024-11-01 12:07:08 -07:00
Debanjum	73750ef286	Merge branch 'master' into features/advanced-reasoning	2024-11-01 11:42:01 -07:00
Debanjum	1c920273dd	Add Prompt Tracer to Visualize, Analyze and Debug Khoj's Train of Thought (#951 ) ## Overview Use git to capture prompt traces of khoj's train of thought. View, analyze and debug them using your favorite git client (e.g vscode, magit). - Each commit captures an interaction with an LLM The commit writes the query, response and system message each to a separate file in the repo. The commit message captures the chat model, Khoj version and other metadata - Each conversation turn can have multiple interactions with an LLM (e.g Khoj's train of thought) - Each new conversation turn forks from and merges back into its conversation branch - Each new conversation branches from the user branch - Each new user branches from root commit on the main branch ## Usage 1. Set `KHOJ_DEBUG=true` or start khoj in very verbose mode with `khoj -vv` to turn on prompt tracing 2. Chat with Khoj as usual 3. Open the promptrace git repo to view the generated prompt traces using your favorite git porcelain. The Khoj prompt trace git repo is created at `/tmp/khoj_promptrace` by default. You can configure the prompt trace directory by setting the `PROMPTRACE_DIR`environment variable. ## Implementation - Add utility functions to capture prompt traces using git (via `gitpython`) - Make each model provider in Khoj commit their LLM interactions with promptrace - Weave chat metadata from chat API through all chat actors and commit it to the prompt trace	2024-11-01 11:33:54 -07:00
sabaimran	7ebf999688	Merge branch 'master' of github.com:khoj-ai/khoj into features/advanced-reasoning	2024-10-31 18:15:13 -07:00
sabaimran	159ea44883	Remove frame references in the diagramming prompts	2024-10-31 18:14:51 -07:00
sabaimran	8d1ecb9bd8	Add optional brew steps for docker install	2024-10-31 12:41:53 -07:00
Debanjum	adca6cbe9d	Merge branch 'master' into add-prompt-tracer-for-observability	2024-10-31 02:28:34 -07:00
Debanjum	1448b8b3fc	Use 3rd person for user in research prompt to reduce person confusion Models were getting a bit confused about who is search for who's information. Using third person to explicitly call out on who's behalf these searches are running seems to perform better across models (gemini's, gpt etc.), even if the role of the message is user.	2024-10-30 13:49:48 -07:00
Debanjum	b8c6989677	Separate example from actual question in extract question prompt	2024-10-30 13:49:48 -07:00
Debanjum	86ffd7a7a2	Handle \n, dedupe json cleaning into single function for reusability Use placeholder for newline in json object values until json parsed and values extracted. This is useful when research mode models outputs multi-line codeblocks in queries etc.	2024-10-30 13:49:48 -07:00
Debanjum	83ca820abe	Encourage Anthropic models to output json object using { prefill Anthropic API doesn't have ability to enforce response with valid json object, unlike all the other model types. While the model will usually adhere to json output instructions. This step is meant to more strongly encourage it to just output json object when response_type of json_object is requested.	2024-10-30 13:49:48 -07:00
Debanjum	dc8e89b5de	Pass tool AIs iteration history as chat history for better context Separate conversation history with user from the conversation history between the tool AIs and the researcher AI. Tools AIs don't need top level conversation history, that context is meant for the researcher AI. The invoked tool AIs need previous attempts at using the tool in this research runs iteration history to better tune their next run. Or at least that is the hypothesis to break the models looping.	2024-10-30 13:49:48 -07:00
Debanjum	d865994062	Rename code tool arg `previous_iteration_history' to` context'	2024-10-30 13:49:48 -07:00
Debanjum	06aeca2670	Make researcher, docs search AIs ask more diverse retrieval questions Models weren't generating a diverse enough set of questions. They'd do minor variations on the original query. What is required is asking queries from a bunch of different lenses to retrieve the requisite information. This prompt updates shows the AIs the breadth of questions to by example and instruction. Seem like performance improved based on vibes	2024-10-30 13:49:48 -07:00
Debanjum	01881dc7a2	Revert "Make extract question prompt in 1st person wrt user as its a user message" This reverts commit 6d3602798aa1b95a30c557576fd4f93ddef2ae76.	2024-10-30 13:49:48 -07:00
Debanjum	3e695df198	Make extract question prompt in 1st person wrt user as its a user message Divide Example from Actual chat history section in prompt	2024-10-30 13:49:48 -07:00
Debanjum	a3751d6a04	Make extract relevant information system prompt work for any document Previously it was too strongly tuned for extracting information from only webpages. This shouldn't be necessary	2024-10-30 13:49:48 -07:00
Debanjum	a39e747d07	Improve passing user name in pick next research tool prompt	2024-10-30 13:49:48 -07:00
Debanjum	deff512baa	Improve research mode prompts to reduce looping, increase webpage reads	2024-10-30 13:49:48 -07:00
Debanjum	d3184ae39a	Simplify storing and displaying document results in research mode - Mention count of notes and files disovered - Store query associated with each compiled reference retrieved for easier referencing	2024-10-30 13:49:48 -07:00
Debanjum	8bd94bf855	Do not use a message branch if no msg id provided to prompt tracer	2024-10-30 13:49:48 -07:00
sabaimran	b63fbc5345	Add a simple badget to the dropdown menu that shows subscription status	2024-10-30 13:00:16 -07:00
sabaimran	82f3d79064	Merge branch 'master' of github.com:khoj-ai/khoj into features/advanced-reasoning	2024-10-30 11:32:10 -07:00
sabaimran	2b2564257e	Handle subscription case where it's set to trial, but renewal_date is not set. set the renewal_date for LENGTH_OF_FREE_TRIAL days from subscription creation.	2024-10-30 11:05:31 -07:00
Debanjum	9935d4db0b	Do not use a message branch if no msg id provided to prompt tracer	2024-10-28 17:50:27 -07:00
Debanjum	d184498038	Pass context in separate message from user query to research chat actor	2024-10-28 15:37:28 -07:00
Debanjum	d75ce4a9e3	Format online, notes, code context with YAML to be legibile for LLM	2024-10-28 15:37:28 -07:00
sabaimran	5bea0c705b	Use break-words in the train of thought for better formatting	2024-10-28 15:36:06 -07:00
sabaimran	1f1b182461	Automatically carry over research mode from home page to chat - Improve mobile friendliness with new research mode toggle, since chat input area is now taking up more space - Remove clunky title from the suggestion card - Fix fk lookup error for agent.creator	2024-10-28 15:29:24 -07:00
sabaimran	ebaed53069	Merge branch 'master' of github.com:khoj-ai/khoj into features/advanced-reasoning	2024-10-28 12:39:00 -07:00
sabaimran	889dbd738a	Add keyword diagram to diagram output mode description	2024-10-28 12:20:46 -07:00
Debanjum	50ffd7f199	Merge branch 'master' into features/advanced-reasoning	2024-10-28 04:10:59 -07:00
Debanjum	a5d0ca6e1c	Use selected agent color to theme the chat input area on home page	2024-10-28 03:47:40 -07:00
Debanjum	aad7528d1b	Render slash commands popup below chat input text area on home page	2024-10-28 02:06:04 -07:00
Debanjum	3e17ab438a	Separate notes, online context from user message sent to chat models (#950 ) Overview --- - Put context into separate user message before sending to chat model. This should improve model response quality and truncation logic in code - Pass online context from chat history to chat model for response. This should improve response speed when previous online context can be reused - Improve format of notes, online context passed to chat models in prompt. This should improve model response quality Details --- The document, online search context are now passed as separate user messages to chat model, instead of being added to the final user message. This will improve - Models ability to differentiate data from user query. That should improve response quality and reduce prompt injection probability - Make truncation logic simpler and more robust When context window hit, can simply pop messages to auto truncate context in order of context, user, assistant message for each conversation turn in history until reach current user query The complex, brittle logic to extract user query from context in last user message isn't required.	2024-10-28 02:03:18 -07:00
Debanjum	8ddd70f3a9	Put context into separate message before sending to offline chat model Align context passed to offline chat model with other chat models - Pass context in separate message for better separation between user query and the shared context - Pass filename in context - Add online results for webpage conversation command	2024-10-28 00:22:21 -07:00
Debanjum	ee0789eb3d	Mark context messages with user role as context role isn't being used Context role was added to allow change message truncation order based on context role as well. Revert it for now since currently this is not currently being done.	2024-10-28 00:04:14 -07:00
Debanjum	4e39088f5b	Make agent name in home page carousel not text wrap on mobile	2024-10-27 23:03:53 -07:00
Debanjum	94074b7007	Focus chat input on toggle research mode. v-align it with send button	2024-10-27 22:54:55 -07:00
sabaimran	a691ce4aa6	Batch entries into smaller groups to process	2024-10-27 20:43:41 -07:00
sabaimran	2924909692	Add a research mode toggle to the chat input area	2024-10-27 16:37:40 -07:00
sabaimran	68499e253b	Auto-collapse train of thought, show after chat response in history	2024-10-27 15:48:13 -07:00
sabaimran	101ea6efb1	Add research mode as a slash command, remove from default path	2024-10-27 15:47:44 -07:00
sabaimran	0bd78791ca	Let user exit from command mode with esc, click out, etc.	2024-10-27 15:01:49 -07:00

1 2 3 4 5 ...

3731 commits