sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-23 23:48:56 +01:00

Author	SHA1	Message	Date
Debanjum	f64f5b3b6e	Handle add/delete file filter operation on non-existent conversation	2024-10-30 14:00:21 -07:00
Debanjum	b3a63017b5	Support setting seed for reproducible LLM response generation Anthropic models do not support seed. But offline, gemini and openai models do. Use these to debug and test Khoj via KHOJ_LLM_SEED env var	2024-10-30 14:00:21 -07:00
Debanjum	d44e68ba01	Improve handling embedding model config from admin interface - Allow server to start if loading embedding model fails with an error. This allows fixing the embedding model config via admin panel. Previously server failed to start if embedding model was configured incorrectly. This prevented fixing the model config via admin panel. - Convert boolean string in config json to actual booleans when passed via admin panel as json before passing to model, query configs - Only create default model if no search model configured by admin. Return first created search model if its been configured by admin.	2024-10-30 14:00:21 -07:00
Debanjum	358a6ce95d	Defer turning cursor color to selected agents color for later Capability exists but idea needs to be investigated further	2024-10-30 14:00:21 -07:00
Debanjum	2ac840e3f2	Make cursor in chat input take on selected agent color	2024-10-30 14:00:21 -07:00
Debanjum	1448b8b3fc	Use 3rd person for user in research prompt to reduce person confusion Models were getting a bit confused about who is search for who's information. Using third person to explicitly call out on who's behalf these searches are running seems to perform better across models (gemini's, gpt etc.), even if the role of the message is user.	2024-10-30 13:49:48 -07:00
Debanjum	b8c6989677	Separate example from actual question in extract question prompt	2024-10-30 13:49:48 -07:00
Debanjum	86ffd7a7a2	Handle \n, dedupe json cleaning into single function for reusability Use placeholder for newline in json object values until json parsed and values extracted. This is useful when research mode models outputs multi-line codeblocks in queries etc.	2024-10-30 13:49:48 -07:00
Debanjum	83ca820abe	Encourage Anthropic models to output json object using { prefill Anthropic API doesn't have ability to enforce response with valid json object, unlike all the other model types. While the model will usually adhere to json output instructions. This step is meant to more strongly encourage it to just output json object when response_type of json_object is requested.	2024-10-30 13:49:48 -07:00
Debanjum	dc8e89b5de	Pass tool AIs iteration history as chat history for better context Separate conversation history with user from the conversation history between the tool AIs and the researcher AI. Tools AIs don't need top level conversation history, that context is meant for the researcher AI. The invoked tool AIs need previous attempts at using the tool in this research runs iteration history to better tune their next run. Or at least that is the hypothesis to break the models looping.	2024-10-30 13:49:48 -07:00
Debanjum	d865994062	Rename code tool arg `previous_iteration_history' to` context'	2024-10-30 13:49:48 -07:00
Debanjum	06aeca2670	Make researcher, docs search AIs ask more diverse retrieval questions Models weren't generating a diverse enough set of questions. They'd do minor variations on the original query. What is required is asking queries from a bunch of different lenses to retrieve the requisite information. This prompt updates shows the AIs the breadth of questions to by example and instruction. Seem like performance improved based on vibes	2024-10-30 13:49:48 -07:00
Debanjum	01881dc7a2	Revert "Make extract question prompt in 1st person wrt user as its a user message" This reverts commit 6d3602798aa1b95a30c557576fd4f93ddef2ae76.	2024-10-30 13:49:48 -07:00
Debanjum	3e695df198	Make extract question prompt in 1st person wrt user as its a user message Divide Example from Actual chat history section in prompt	2024-10-30 13:49:48 -07:00
Debanjum	a3751d6a04	Make extract relevant information system prompt work for any document Previously it was too strongly tuned for extracting information from only webpages. This shouldn't be necessary	2024-10-30 13:49:48 -07:00
Debanjum	a39e747d07	Improve passing user name in pick next research tool prompt	2024-10-30 13:49:48 -07:00
Debanjum	deff512baa	Improve research mode prompts to reduce looping, increase webpage reads	2024-10-30 13:49:48 -07:00
Debanjum	d3184ae39a	Simplify storing and displaying document results in research mode - Mention count of notes and files disovered - Store query associated with each compiled reference retrieved for easier referencing	2024-10-30 13:49:48 -07:00
Debanjum	8bd94bf855	Do not use a message branch if no msg id provided to prompt tracer	2024-10-30 13:49:48 -07:00
sabaimran	b63fbc5345	Add a simple badget to the dropdown menu that shows subscription status	2024-10-30 13:00:16 -07:00
sabaimran	82f3d79064	Merge branch 'master' of github.com:khoj-ai/khoj into features/advanced-reasoning	2024-10-30 11:32:10 -07:00
sabaimran	2b2564257e	Handle subscription case where it's set to trial, but renewal_date is not set. set the renewal_date for LENGTH_OF_FREE_TRIAL days from subscription creation.	2024-10-30 11:05:31 -07:00
Debanjum	9935d4db0b	Do not use a message branch if no msg id provided to prompt tracer	2024-10-28 17:50:27 -07:00
Debanjum	d184498038	Pass context in separate message from user query to research chat actor	2024-10-28 15:37:28 -07:00
Debanjum	d75ce4a9e3	Format online, notes, code context with YAML to be legibile for LLM	2024-10-28 15:37:28 -07:00
sabaimran	5bea0c705b	Use break-words in the train of thought for better formatting	2024-10-28 15:36:06 -07:00
sabaimran	1f1b182461	Automatically carry over research mode from home page to chat - Improve mobile friendliness with new research mode toggle, since chat input area is now taking up more space - Remove clunky title from the suggestion card - Fix fk lookup error for agent.creator	2024-10-28 15:29:24 -07:00
sabaimran	ebaed53069	Merge branch 'master' of github.com:khoj-ai/khoj into features/advanced-reasoning	2024-10-28 12:39:00 -07:00
sabaimran	889dbd738a	Add keyword diagram to diagram output mode description	2024-10-28 12:20:46 -07:00
Debanjum	50ffd7f199	Merge branch 'master' into features/advanced-reasoning	2024-10-28 04:10:59 -07:00
Debanjum	a5d0ca6e1c	Use selected agent color to theme the chat input area on home page	2024-10-28 03:47:40 -07:00
Debanjum	aad7528d1b	Render slash commands popup below chat input text area on home page	2024-10-28 02:06:04 -07:00
Debanjum	3e17ab438a	Separate notes, online context from user message sent to chat models (#950 ) Overview --- - Put context into separate user message before sending to chat model. This should improve model response quality and truncation logic in code - Pass online context from chat history to chat model for response. This should improve response speed when previous online context can be reused - Improve format of notes, online context passed to chat models in prompt. This should improve model response quality Details --- The document, online search context are now passed as separate user messages to chat model, instead of being added to the final user message. This will improve - Models ability to differentiate data from user query. That should improve response quality and reduce prompt injection probability - Make truncation logic simpler and more robust When context window hit, can simply pop messages to auto truncate context in order of context, user, assistant message for each conversation turn in history until reach current user query The complex, brittle logic to extract user query from context in last user message isn't required.	2024-10-28 02:03:18 -07:00
Debanjum	8ddd70f3a9	Put context into separate message before sending to offline chat model Align context passed to offline chat model with other chat models - Pass context in separate message for better separation between user query and the shared context - Pass filename in context - Add online results for webpage conversation command	2024-10-28 00:22:21 -07:00
Debanjum	ee0789eb3d	Mark context messages with user role as context role isn't being used Context role was added to allow change message truncation order based on context role as well. Revert it for now since currently this is not currently being done.	2024-10-28 00:04:14 -07:00
Debanjum	4e39088f5b	Make agent name in home page carousel not text wrap on mobile	2024-10-27 23:03:53 -07:00
Debanjum	94074b7007	Focus chat input on toggle research mode. v-align it with send button	2024-10-27 22:54:55 -07:00
sabaimran	a691ce4aa6	Batch entries into smaller groups to process	2024-10-27 20:43:41 -07:00
sabaimran	2924909692	Add a research mode toggle to the chat input area	2024-10-27 16:37:40 -07:00
sabaimran	68499e253b	Auto-collapse train of thought, show after chat response in history	2024-10-27 15:48:13 -07:00
sabaimran	101ea6efb1	Add research mode as a slash command, remove from default path	2024-10-27 15:47:44 -07:00
sabaimran	0bd78791ca	Let user exit from command mode with esc, click out, etc.	2024-10-27 15:01:49 -07:00
sabaimran	a121d67b10	Persist the train of thought in the conversation history	2024-10-26 23:46:15 -07:00
sabaimran	9e8ac7f89e	Fix input/output mismatches in the /summarize command	2024-10-26 16:37:58 -07:00
sabaimran	e4285941d1	Use the advanced chat model if the user is subscribed	2024-10-26 16:00:54 -07:00
sabaimran	33e48aa27e	Merge branch 'add-prompt-tracer-for-observability' of github.com:khoj-ai/khoj into features/advanced-reasoning	2024-10-26 14:09:00 -07:00
sabaimran	fd71a4b086	Add better exception handling in the prompt trace logic, use default value from parameters	2024-10-26 14:08:00 -07:00
Debanjum	3e5b5ec122	Encourage model to read webpages more often after online search Previously model would rarely read webpages after webpage search. Need the model to webpages more regularly for deeper research and to stop getting stuck in repetitive online search loops	2024-10-26 10:49:09 -07:00
Debanjum	bf96d81943	Format online results as YAML to pass it in more readable form to model Previous passing of online results as json dump in prompts was less readable for humans, and I'm guessing less readable for models (trained on human data) as well?	2024-10-26 10:49:09 -07:00
Debanjum	3e97ebf0c7	Unescape special characters in prompt traces for better readability	2024-10-26 10:49:09 -07:00
Debanjum	8af9dc3ee1	Unescape special characters in prompt traces for better readability	2024-10-26 10:45:42 -07:00
Debanjum Singh Solanky	0f3927e810	Send gathered references to client after code results calculated	2024-10-26 05:59:10 -07:00
Debanjum Singh Solanky	f04f871a72	Merge branch 'add-prompt-tracer-for-observability' of github.com:khoj-ai/khoj into features/advanced-reasoning - Start from this branches src/khoj/routers/api_chat.py Add tracer to all old and new chat actors that don't have it set when they are called. - Update the new chat actors like apick next tool etc to use tracer too	2024-10-26 05:56:13 -07:00
Debanjum Singh Solanky	ddc6ccde2d	Merge branch 'master' into features/advanced-reasoning - Conflicts: Combine both sides of the conflict in all 3 files below - src/khoj/processor/conversation/utils.py - src/khoj/routers/helpers.py - src/khoj/utils/helpers.py	2024-10-26 05:15:51 -07:00
Debanjum Singh Solanky	ea0712424b	Commit conversation traces using user, chat, message branch hierarchy - Message train of thought forks and merges from its conversation branch - Conversation branches from user branch - User branches from root commit on the main branch - Weave chat tracer metadata from api endpoint through all chat actors and commit it to the prompt trace	2024-10-26 05:08:47 -07:00
Debanjum Singh Solanky	a3022b7556	Allow Offline Chat model calling functions to save conversation traces	2024-10-26 05:08:47 -07:00
Debanjum Singh Solanky	eb6424f14d	Allow Anthropic API calling functions to save conversation traces	2024-10-26 05:08:47 -07:00
Debanjum Singh Solanky	6fcd6a5659	Allow Gemini API calling functions to save conversation traces	2024-10-26 05:08:47 -07:00
Debanjum Singh Solanky	384f394336	Allow OpenAI API calling functions to save conversation traces	2024-10-26 04:59:21 -07:00
Debanjum Singh Solanky	10c8fd3b2a	Save conversation traces to git for visualization	2024-10-26 04:59:19 -07:00
sabaimran	7e0a692d16	Release Khoj version 1.27.1	2024-10-25 15:23:07 -07:00
sabaimran	b257fa1884	Add a None check before doing a DT comparison when getting subscription type	2024-10-25 15:22:48 -07:00
sabaimran	0f6f282c30	Release Khoj version 1.27.0	2024-10-25 14:11:14 -07:00
sabaimran	479e156168	Add to the ConversationCommand.Image description to LLM	2024-10-25 09:14:32 -07:00
sabaimran	a11b5293fb	Add uploaded images to research mode, code slash command, include code references	2024-10-24 23:56:24 -07:00
sabaimran	5acf40c440	Clean up summarization code paths Use assumption of summarization response being a str	2024-10-24 23:56:24 -07:00
sabaimran	12b32a3d04	Resolve merge conflicts	2024-10-24 23:43:55 -07:00
Debanjum	adee5a3e20	Give Vision to Anthropic models in Khoj (#948 ) ### Major - Give Vision to Anthropic models in Khoj ### Minor - Reuse logic to format messages for chat with anthropic models - Make the get image from url function more versatile and reusable - Encourage output mode chat actor to output only json and nothing else	2024-10-24 18:02:38 -07:00
Debanjum Singh Solanky	01d740debd	Return typed image from image_with_url function for readability	2024-10-24 17:58:46 -07:00
Debanjum Singh Solanky	37317e321d	Dedupe user location passed in image, diagram generation prompts	2024-10-24 01:03:29 -07:00
Debanjum Singh Solanky	2a32836d1a	Log more descriptive error when image gen fails with Replicate	2024-10-24 01:03:29 -07:00
sabaimran	30f9225021	Merge branch 'master' of github.com:khoj-ai/khoj into features/advanced-reasoning	2024-10-23 19:15:51 -07:00
sabaimran	5120597d4e	Remove user customized search model (#946 ) - Use a single standard search model across the server. There's diminishing benefits for having multiple user-customizable search models. - We may want to add server-level customization for specific tasks - Store the search model used to generate a given entry on the `Entry` object - Remove user-facing APIs and view - Add a management command for migrating the default search model on the server In a future PR (after running the migration), we'll also remove the `UserSearchModelConfig`	2024-10-23 17:38:37 -07:00
Debanjum Singh Solanky	8d588e0765	Encourage output mode chat actor to output only json and nothing else Latest claude model wanted to say more than just give the json output. The updated prompt encourages the model to ouput just json. This is similar to what is already being done for other prompts	2024-10-23 17:19:21 -07:00
Debanjum Singh Solanky	abad5348a0	Give Vision to Anthropic models in Khoj	2024-10-23 17:19:21 -07:00
Debanjum Singh Solanky	6fd50a5956	Reuse logic to format messages for chat with anthropic models	2024-10-23 17:19:21 -07:00
Debanjum Singh Solanky	82eac5a043	Make the get image from url function more versatile and reusable It was previously added under the google utils. Now it can be used by other conversation processors as well. The updated function - can get both base64 encoded and PIL formatted images from url - will return the media type of the image as well in response	2024-10-23 17:19:20 -07:00
sabaimran	f3ce47b445	Create explicit flow to enable the free trial (#944 ) * Create explicit flow to enable the free trial The current design is confusing. It obfuscates the fact that the user is on a free trial. This design will make the opt-in explicit and more intuitive. * Use the Subscription Type enum instead of hardcoded strings everywhere * Use length of free trial in the frontend code as well	2024-10-23 15:29:23 -07:00
Debanjum Singh Solanky	bc059eeb0b	Merge branch 'master' into put-retrieved-context-in-separate-chatml-message	2024-10-23 12:55:18 -07:00
Debanjum Singh Solanky	3b978b9b67	Fix chat history construction when generating chatml msgs with context	2024-10-23 12:55:12 -07:00
Debanjum Singh Solanky	9f2c02d9f7	Chat with the default agent by default from web app home Had temporarily updated the default selected agent to last used. Revert for now as 1. The previous logic was buggy. It didn't select the default agent even when the last used agent was the default agent. Which would require more work. 2. It maybe too early anyway to set the default agent to last used.	2024-10-23 03:43:57 -07:00
Debanjum Singh Solanky	218946edda	Fix copying message with user images on web app Adding div elements to message to render degraded text copied to clipboard for messages with user uploaded images. This change fixes that by separating message to render from message for clipboard. It ensures differently formatted forms of the user images are added to the two to allow proper rendering while still having decently formatted text copied to clipboard	2024-10-23 03:41:25 -07:00
Debanjum Singh Solanky	7d9a06c8ab	Merge branch 'master' into put-retrieved-context-in-separate-chatml-message	2024-10-23 00:13:38 -07:00
Debanjum Singh Solanky	2a50694089	Allow typing multi-line queries from a phone with Enter key Add newline instead of sending message when hit Enter key on mobile displays. As on phones shift key doesn't exist and send button is easily clickable. Limit hitting Enter key to send message to computers = larger display = expected to have full fledged keyboards.	2024-10-22 21:20:22 -07:00
Debanjum Singh Solanky	a134cd835c	Focus on chat input area to enter text after file uploads on web app	2024-10-22 21:19:17 -07:00
Debanjum Singh Solanky	750fbce0c2	Merge branch 'master' into improve-agent-pane-on-home-screen	2024-10-22 20:05:29 -07:00
Debanjum Singh Solanky	3be505db48	Only show type of error when image generation fails to clients Rather than showing raw error message from the underlying service as it could contain sensitive information	2024-10-22 20:03:20 -07:00
Debanjum Singh Solanky	b3fff43542	Sanitize user attached images. Constrain chat input width on home page Set max combined images size to 20mb to allow multiple photos to be shared	2024-10-22 19:42:40 -07:00
Debanjum Singh Solanky	6c393800cc	Merge branch 'master' into multi-image-chat-and-vision-for-gemini	2024-10-22 18:38:49 -07:00
Debanjum Singh Solanky	91bbd19333	Close the agent detail hover card when scroll on agent pane	2024-10-22 18:03:17 -07:00
Debanjum Singh Solanky	110c67f083	Improve agent pill, detail card styling. Handle null chatInputRef - Remove border from agent detail hover card on home page - Do not wrap long agent names in agent pills on home page - Handle scenario where chatInputRef is null	2024-10-22 18:03:17 -07:00
Debanjum Singh Solanky	aca8bef024	Only use recent chat sessions for agent MRU. Handle null agent chats	2024-10-22 17:46:45 -07:00
sabaimran	0dad4212fa	Generate dynamic diagrams (via Excalidraw) (#940 ) Add support for generating dynamic diagrams in flow with Excalidraw (https://github.com/excalidraw/excalidraw). This happens in three steps: 1. Default information collection & intent determination step. 2. Improving the overall guidance of the prompt for generating a JSON, Excalidraw-compatible declaration. 3. Generation of the diagram to output to the final UI. Add support in the web UI.	2024-10-22 16:13:46 -07:00
sabaimran	1e993d561b	Release Khoj version 1.26.4	2024-10-22 13:50:08 -07:00
Debanjum Singh Solanky	e8fb79a369	Rate limit the count and total size of images shared via API	2024-10-22 04:37:54 -07:00
Debanjum Singh Solanky	0847fb0102	Pass online context from chat history to chat model for response Previously only notes context from chat history was included. This change includes online context from chat history for model to use for response generation. This can reduce need for online lookups by reusing previous online context for faster responses. But will increase overall response time when not reusing past online context, as faster context buildup per conversation. Unsure if inclusion of context is preferrable. If not, both notes and online context should be removed.	2024-10-22 03:09:36 -07:00
Debanjum Singh Solanky	0c52a1169a	Put context into separate user message before sending to chat model The document, online search context are now passed as separate user messages to chat model, instead of being added to the final user message. This will improve - Models ability to differentiate data from user query. That should improve response quality and reduce prompt injection probability - Make truncation logic simpler and more robust When context window hit, can simply pop messages to auto truncate context in order of context, user, assistant message for each conversation turn in history until reach current user query The complex, brittle logic to extract user query from context in last user message isn't required. Marking the context message with assistant role doesn't translate well across chat models. E.g - Gemini can't handle consecutive messages by role = model well - Claude will merge consecutive messages by same role. In current message ordering the context message will result get merged into the previous assistant response. And if move context message after user query. The truncation logic will have to hop and skip while doing deletions - GPT seems to handle consecutive roles of any type fine Using context role = user generalizes better across chat models for now and aligns with previous behavior.	2024-10-22 03:09:36 -07:00
Debanjum Singh Solanky	7ac241b766	Improve format of notes, online context passed to chat models in prompt Improve separation of note snippets and show its origin file in notes prompt to have more readable, contextualized text shared with model. Previously the references dict was being directly passed as a string. The documents don't look well formatted and are less intelligible. - Passing file path along with notes snippets will help contextualize the notes better. - Better formatting should help with making notes more readable by the chat model.	2024-10-22 03:09:36 -07:00
sabaimran	892040972f	Replace user_id with server_id in telemetry	2024-10-21 20:47:52 -07:00
sabaimran	21e69b506d	Release Khoj version 1.26.3	2024-10-21 08:19:05 -07:00

1 2 3 4 5 ...

2921 commits