sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-12-20 19:37:45 +00:00

Author	SHA1	Message	Date
sabaimran	8abc8ded82	Part 1: Server-side changes to support agents integrated with Conversations (#671 ) * Initial pass at backend changes to support agents - Add a db model for Agents, attaching them to conversations - When an agent is added to a conversation, override the system prompt to tweak the instructions - Agents can be configured with prompt modification, model specification, a profile picture, and other things - Admin-configured models will not be editable by individual users - Add unit tests to verify agent behavior. Unit tests demonstrate imperfect adherence to prompt specifications * Customize default behaviors for conversations without agents or with default agents * Use agent_id for getting correct agent * Merge migrations * Simplify some variable definitions, add additional security checks for agents * Rename agent.tuning -> agent.personality	2024-03-23 22:09:38 +05:30
sabaimran	4deb849fb1	Merge branch 'features/add-agents-ui' of github.com:khoj-ai/khoj into features/chat-socket-streaming	2024-03-23 14:04:25 +05:30
sabaimran	8edbd7094f	Let the name, slug of the default agent be Khoj, khoj	2024-03-23 14:03:58 +05:30
sabaimran	6b4c4f10b5	Merge branch 'features/add-agents-ui' of github.com:khoj-ai/khoj into features/chat-socket-streaming	2024-03-23 11:22:00 +05:30
sabaimran	20617614ae	Merge branch 'features/customize-chat-with-agents' of github.com:khoj-ai/khoj into features/add-agents-ui	2024-03-23 11:20:57 +05:30
sabaimran	2399d91f61	Merge migrations	2024-03-22 10:05:33 +05:30
sabaimran	d38089ab57	Merge with origin	2024-03-22 09:55:33 +05:30
Debanjum Singh Solanky	aed4313cfc	Fix updating specific conversation by id from the chat API endpoint - Use the conversation id of the retrieved conversation rather than the potentially unset conversation id passed via API - await creating new chat when no chat id provided and no existing conversations exist	2024-03-21 02:46:52 +05:30
sabaimran	6ba0d8e379	Add a connected notification if the websocket is connected	2024-03-20 20:53:28 +05:30
sabaimran	255b69dc58	Add a comma delimeter between outputted search queries	2024-03-20 19:43:35 +05:30
sabaimran	d84188b221	Scroll down when a message is added in the chat interface's handle stream response method	2024-03-20 15:04:41 +05:30
sabaimran	70ad78990a	Use a common method for sending a generic message to the client from the server in the ws connection	2024-03-20 15:04:14 +05:30
sabaimran	d4e83b060a	Update the web UI for the chat interface to establish a connection via a socket to the server - Move some common methods into separate functions to make the UI components more efficient - The normal HTTP-based chat connection will still work and serves as a fallback if the websocket is unavailable	2024-03-20 14:34:47 +05:30
sabaimran	a346f79b39	Add support for chatting via the web socket connection - Convert to a model of calling the search API directly with a function call (rather than using the API method) - Gracefully handle websocket connection disconnects - Ensure that the rest of the response is still saved, as it is currently, if the user disconects from the client - Setup unchangeable context at the beginning of the session when the connection is established (like location, username, etc)	2024-03-20 14:33:33 +05:30
Debanjum Singh Solanky	62a83dc9bb	Fix online search actor to use natural dates not after: operator The recently added after: operator to online search actor was too restrictive, gave worse results than when just use natural language dates in search query	2024-03-15 21:50:14 +05:30
Debanjum Singh Solanky	4a1e6a2275	Convert deleted old user requests log line to debug from info	2024-03-15 20:50:10 +05:30
Debanjum Singh Solanky	9a068dadbf	Fix extract questions prompt to use YYYY-MM-DD date filter format	2024-03-15 18:43:18 +05:30
Debanjum Singh Solanky	ecddf98430	Handle truncation when single long non-system chat message Previously was assuming the system prompt is being always passed as the first message. So expected there to be at least 2 messages in logs. This broke chat actors querying with single long non system message. A more robust way to extract system prompt is via the message role instead	2024-03-15 15:58:39 +05:30
Debanjum Singh Solanky	ec0c35b7ed	Improve delete, rename chat session UX in Desktop, Web app - Ask for Confirmation before deleting chat session in Desktop, Web app - Save chat session rename on hitting enter in title edit input box - No need to flash previous conversation cleared status message - Move chat session delete button after rename button in Desktop app	2024-03-15 15:58:19 +05:30
Debanjum Singh Solanky	924b1215ce	Allow unset locale for Google authenticated user	2024-03-15 15:35:20 +05:30
Debanjum Singh Solanky	c792fa819f	Fix setting chat session title from Desktop app Pass auth headers to not have the chat session title update request fail	2024-03-15 15:19:20 +05:30
Debanjum Singh Solanky	c9e05dc184	Get conversation by title when requested via chat API	2024-03-15 12:31:50 +05:30
sabaimran	724557fc7b	Merge branch 'master' of github.com:khoj-ai/khoj into features/add-agents-ui	2024-03-15 12:14:34 +05:30
sabaimran	7fc484ba7a	Merge branch 'master' of github.com:khoj-ai/khoj into features/customize-chat-with-agents	2024-03-15 12:13:28 +05:30
Debanjum Singh Solanky	cac26dafe3	Only create new chat on get if a specific chat id, slug isn't requested	2024-03-15 11:58:39 +05:30
sabaimran	416feb13ef	Fix layout of agent, agents pages	2024-03-15 11:17:40 +05:30
sabaimran	d734be61cf	Rename agents_page -> agent_page	2024-03-15 10:17:51 +05:30
Debanjum Singh Solanky	08993ff109	Add new, remove old known chat models from model to prompt size map	2024-03-15 04:02:25 +05:30
Debanjum Singh Solanky	fba0338787	Release Khoj version 1.7.0	2024-03-15 00:08:32 +05:30
Debanjum Singh Solanky	6118d1ff57	Create chat actor for directly reading webpages based on user message - Add prompt for the read webpages chat actor to extract, infer webpage links - Make chat actor infer or extract webpage to read directly from user message - Rename previous read_webpage function to more narrow read_webpage_at_url function	2024-03-14 14:58:37 +05:30
Debanjum	e549824fe2	Improve OpenAI Chat Actors and their prompts (#673 ) ### Major - Enforce json mode response from OpenAI chat actors prev using string lists - Use `gpt-4-turbo-preview' as default chat model, extract questions actor - Make Khoj read khoj website to respond with accurate, up-to-date information about itself - Dedupe query in notes prompt. Improve OAI chat actor, director tests ### Minor - Test data source, output mode selector, web search query chat actors - Improve notes search actor to always create a non-empty list of queries - Construct available data sources, output modes as a bullet list in prompts - Use consistent agent name across static and dynamic examples in prompts - Add actor's name to extract questions prompt to improve context for guidance	2024-03-14 12:44:40 +05:30
sabaimran	3caf0a79d8	Spruce up the 404 page and improve the overall layout for agents pages	2024-03-14 11:26:49 +05:30
sabaimran	c45030af44	Fix agent view	2024-03-14 11:13:19 +05:30
Debanjum Singh Solanky	a1ce12296f	Fix rendering online with note references post streaming chat response Previously only the notes references would get rendered post response streaming when when both online and notes references were used to respond to the user's message	2024-03-14 03:40:40 +05:30
Debanjum Singh Solanky	1aeea3d854	Fix opening external links from confirmation dialog box on desktop app	2024-03-14 02:29:22 +05:30
Debanjum Singh Solanky	2e5cc49cb3	Enforce json response from OpenAI chat actors prev using string lists - Allow passing response format type to OpenAI API via chat actors - Convert in-context examples to use json objects instead of str lists - Update actors outputting str list to request output to be json_object - OpenAI's json mode enforces the model to output valid json object	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	7211eb9cf5	Default to gpt-4-turbo-preview for chat model, extract questions actor GPT-4 is more expensive and generally less capable than gpt-4-turbo-preview	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	dd883dc53a	Dedupe query in notes prompt. Improve OAI chat actor, director tests - Remove stale tests - Improve tests to pass across gpt-3.5 and gpt-4-turbo - The haiku creation director was failing because of duplicate query in instantiated prompt	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	14682d5354	Improve notes search actor to always create a non-empty list of queries - Remove the option for Notes search query generation actor to return no queries. Whether search should be performed is decided before, this step doesn't need to decide that - But do not throw warning if the response is a list with no elements	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	f5734826cb	Improve pick data source prompt to look online for info about Khoj - Add examples where user queries requesting information about Khoj results in the "online" data source being selected - Add an example for "general" to select chat command prompt	2024-03-14 01:21:13 +05:30
Debanjum Singh Solanky	9a516bed47	Construct available data sources, output modes as a bullet list in prompts	2024-03-14 00:34:57 +05:30
Debanjum Singh Solanky	f28fb89af8	Use consistent agent name across static and dynamic examples in prompts Previously the examples constructed from chat history used "Khoj" as the agent's name but all 3 prompts using the func used static examples with "AI:" as the pertinent agent's name	2024-03-14 00:34:57 +05:30
Debanjum Singh Solanky	f5793149a9	Add actor's name to extract questions prompt to improve context for guidance	2024-03-14 00:34:57 +05:30
Debanjum Singh Solanky	73ad444086	Make online search Actor read khoj.dev for docs, info about Khoj - Add example to read khoj.dev website for up-to-date info to setup, use khoj, discover khoj features etc. - Online search should use site: and after: google search operators - Show example of adding the after: date filter to google search - Give local event lookup example using user's current location in query - Remove unused select search content type prompt	2024-03-14 00:34:57 +05:30
sabaimran	290712c3fe	Add web UI views for agents - Add a page to view all agents - Add slugs to manage agents - Add a view to view single agent - Display active agent when in chat window - Fix post-login redirect issue	2024-03-14 00:07:36 +05:30
Debanjum	3abe7ccb26	Improve Online Search Speed and Context (#670 ) ### Major - Read web pages in parallel to improve chat response time - Read web pages directly when Olostep proxy not setup - Include search results & web page content in online context for chat response ### Minor - Simplify, modularize and add type hints to online search functions	2024-03-11 22:16:30 +05:30
Debanjum Singh Solanky	dc86e44a07	Include search results & webpage content in online context for chat response Previously if a web page was read for a sub-query, only the extracted web page content was provided as context for the given sub-query. But the google results themselves have relevant snippets. So include them	2024-03-11 18:41:02 +05:30
Debanjum Singh Solanky	d136a6be44	Simplify, modularize and add type hints to online search functions - Simplify content arg to `extract_relevant_info' function. Validate, clean the content arg inside the `extract_relevant_info' function - Extract `search_with_google' function outside the parent function - Call the parent function a more appropriate `search_online' instead of `search_with_google' - Simplify the `search_with_google' function using list comprehension. Drop empty search result fields from chat model context for response to reduce cost and response latency - No need to show stacktrace when unable to read webpage, basic error is enough - Add type hints to online search functions to catch issues with mypy	2024-03-11 18:41:02 +05:30
Debanjum Singh Solanky	88f096977b	Read webpages directly when Olostep proxy not setup This is useful for self-hosted, individual user, low traffic setups where a proxy service is not required	2024-03-11 18:41:02 +05:30
Debanjum Singh Solanky	ca2f962e95	Read, extract information from web pages in parallel to lower response time - Time reading webpage, extract info from webpage steps for perf analysis - Deduplicate webpages to read gathered across separate google searches - Use aiohttp to make API requests non-blocking, pair with asyncio to parallelize all the online search webpage read and extract calls	2024-03-11 18:41:02 +05:30
sabaimran	8e1445b15b	Use agent_id for getting correct agent	2024-03-11 14:44:46 +05:30
sabaimran	6ab649312f	Add a new web client route for viewing all agents	2024-03-11 14:40:40 +05:30
sabaimran	352168d6c2	Customize default behaviors for conversations without agents or with default agents	2024-03-11 14:20:28 +05:30
sabaimran	9b88976f36	Initial pass at backend changes to support agents - Add a db model for Agents, attaching them to conversations - When an agent is added to a conversation, override the system prompt to tweak the instructions - Agents can be configured with prompt modification, model specification, a profile picture, and other things - Admin-configured models will not be editable by individual users - Add unit tests to verify agent behavior. Unit tests demonstrate imperfect adherence to prompt specifications	2024-03-11 12:45:24 +05:30
Debanjum	18fa3e2384	Rerank Search Results by Default on GPU machines (#668 ) - Trigger SentenceTransformer Cross Encoder models now run fast on GPU enabled machines, including Mac ARM devices since UKPLab/sentence-transformers#2463 - Details - Use cross-encoder to rerank search results by default on GPU machines and when using an inference server - Only call search API when pause in typing search query on web, desktop apps	2024-03-10 15:15:25 +05:30
Debanjum Singh Solanky	53d402480c	Rerank search results with cross-encoder when using an inference server If an inference server is being used, we can expect the cross encoder to be running fast enough to rerank search results by default	2024-03-10 15:09:46 +05:30
Debanjum Singh Solanky	44c8d09342	Only call search API when pause in typing search query on web, desktop apps Wait for 300ms since stop typing before calling search API. This smooths out UI jitter when rendering search results, especially now that we're reranking for every search query on GPU enabled devices Emacs already has 300ms debounce time. More convoluted to add debounce time to Obsidian search modal, so not updating that yet	2024-03-10 14:29:24 +05:30
Debanjum Singh Solanky	1105d8814f	Use cross-encoder to rerank search results by default on GPU machines Latest sentence-transformer package uses GPU for cross-encoder. This makes it fast enough to enable reranking on machines with GPU. Enabling search reranking by default allows (at least) users with GPUs to side-step learning the UI affordance to rerank results (i.e hitting Cmd/Ctrl-Enter or ENTER).	2024-03-10 14:29:21 +05:30
Debanjum Singh Solanky	fd81446ba3	Do not create new chat session when an old chat session is deleted - Fix `get_conversation_by_user' shouldn't return new conversation if conversation with requested id not found. It should only return new conversation if no specific conversation is requested and no conversations found for user at all - Repro - Delete a new chat, this calls loadChat via window.onload which calls server /chat/history API endpoint with conversationId set to that of just deleted conversation sporadically The call to GET chat/history API with conversationId set occurs when window.onload triggers before the conversationId is deleted by the delete button after the DELETE /chat/history API call (via race) - In such a scenario, get_conversation_by_user called by chat/history API with conversationId of deleted conversation returns a new conversation - Miscellaneous - Chat history load should be logged as call to that chat_history api, not the "chat" api - Show status updates of clearing conversation history in chat input - Simplify web, desktop client code by removing unnecessary new variables	2024-03-10 02:17:23 +05:30
Debanjum Singh Solanky	b7fad04870	Use consistent field name for queries in chat history & better image prompt	2024-03-09 19:11:03 +05:30
sabaimran	6aae9864d3	Fix Notion indexing and add an admin view for Entry objects	2024-03-09 16:25:23 +05:30
sabaimran	12d6c4da7d	Only include inferred queries in the conversation history for images, not links. Overflow the side panel when too long	2024-03-09 11:59:35 +05:30
sabaimran	e5cd0237e3	Release Khoj version 1.6.2	2024-03-08 17:04:03 +05:30
Debanjum Singh Solanky	446ac7649d	Remove unused js method in web chat client, add newline to web data in prompt	2024-03-08 16:40:39 +05:30
Debanjum Singh Solanky	12d32ac99c	Increase user visibility into more errors during image generation Catch OpenAI connection error and errors during better image prompt generation	2024-03-08 16:40:39 +05:30
sabaimran	ff31759423	Fix target determination in the copy programmatic output button	2024-03-08 16:33:12 +05:30
sabaimran	9f934929c6	Infer mime type from file ending when not available in browser. Don't output image in conversation turns	2024-03-08 12:34:26 +05:30
sabaimran	81beb7940c	Upload generated images to s3, if AWS credentials and bucket is available (#667 ) * Upload generated images to s3, if AWS credentials and bucket is available. - In clients, render the images via the URL if it's returned with a text-to-image2 intent type * Make the loading screen more intuitve, less jerky and update the programmatic copy button * Update the loading icon when waiting for a chat response	2024-03-08 10:54:13 +05:30
sabaimran	13894e1fd5	add instructions for drag/drop files in sys prompt	2024-03-07 17:57:42 +05:30
sabaimran	7357b6eff1	Revert white-space preline and add more detailed help text when selecting file	2024-03-06 16:47:27 +05:30
sabaimran	b615c0719e	Support upload for files via drag/drop in the web UI (#666 ) * Add additional styling changes for showing UI changes when dragging file to the main screen * Add a loading spinner when file upload is in progress, and don't index github/notion when indexing files * Add an explicit icon for file uploading in the chat button menu * Add appropriate dragover styling when picking a file from the file picker/browser * Add a loading screen when retrieving chat history. Fix width of the chat window. Put attachment icon to the left of chat input	2024-03-06 16:43:05 +05:30
sabaimran	e323a6d69b	Include additional user context in the image generation flow (#660 ) * Make major improvements to the image generation flow - Include user context from online references and personal notes for generating images - Dynamically select the modality that the LLM should respond with - Retun the inferred context in the query response for the dekstop, web chat views to read * Add unit tests for retrieving response modes via LLM * Move output mode unit tests to the actor suite, rather than director * Only show the references button if there is at least one available * Rename aget_relevant_modes to aget_relevant_output_modes * Use a shared method for generating reference sections, simplify some of the prompting logic * Make out of space errors in the desktop client more obvious	2024-03-06 13:48:41 +05:30
Debanjum Singh Solanky	2d61591c22	Improve user visibility into errors during image generation	2024-02-29 13:19:13 +05:30
sabaimran	0bbb5cff85	Release Khoj version 1.6.1	2024-02-26 13:27:20 -08:00
sabaimran	c8194a7364	Make out of space errors in the desktop client more obvious	2024-02-26 11:53:36 -08:00
Debanjum Singh Solanky	956dd71d91	Clean entry before adding to DB and log when it fails Remove \0 null characters from entry fields as this is causing indexing errors	2024-02-27 01:19:34 +05:30
Debanjum Singh Solanky	bb613a8e1d	Make indentation styling more compact on Obsidian client	2024-02-25 14:41:45 +05:30
Debanjum Singh Solanky	682b70011f	Set chat body height to remove UX jitter on chat history load in Web, Desktop	2024-02-25 14:40:47 +05:30
Debanjum Singh Solanky	efe86ce159	Fix saved conversation logger to handle image responses	2024-02-25 13:46:32 +05:30
Debanjum Singh Solanky	4839f2901a	Open external links in Desktop app with default app for url on OS - Open external links using the default link handler registered on OS for the link type, e.g http:// -> firefox, mailto: thunderbird etc - Confirm before opening non-http URL using an external app	2024-02-25 13:21:52 +05:30
Debanjum	170bce2c02	Fix, Improve rendering images in Obsidian, Desktop, Web clients (#659 ) - Improve render of inferred query in image chat messages in Web, Desktop apps - Add inferred queries to image chat responses in Obsidian client - Fix rendering images from Khoj response in Obsidian client	2024-02-25 00:56:26 +05:30
Debanjum Singh Solanky	f84606325c	Improve render of inferred query in image chat messages in Web, Desktop apps	2024-02-25 00:47:06 +05:30
Debanjum Singh Solanky	a2e53d5e41	Add inferred queries to image chat responses in Obsidian client	2024-02-25 00:24:58 +05:30
Debanjum Singh Solanky	9b61f0b5f7	Fix rendering images from Khoj response in Obsidian client	2024-02-25 00:11:11 +05:30
sabaimran	b9d0533d92	Misc. fixes to prompting, admin, and others (#658 ) * Simplify and clarify prompt for selecting toolset dynamically * Add error handling around call to OLOSTEP api * Fix conversation admin page * Skip adding none or empty entries in the chunking method	2024-02-24 10:25:42 -08:00
Debanjum Singh Solanky	0e0e751ef7	Improve docstring of entrypoint function to the emacs client	2024-02-24 21:09:41 +05:30
Debanjum	8855529637	Improve Syncing Obsidian Vault, Invalidate Static Assets in Browser Cache in Web Client (#657 ) - Improve - Only send files modified since their last sync for indexing on server from the Obsidian client - Fix - Invalidate static asset browser cache in Web client when Khoj version changes	2024-02-24 20:20:30 +05:30
Debanjum Singh Solanky	a46f70c4b0	Remove deprecated lastSyncedFiles settings field from Obsidian client	2024-02-24 20:18:22 +05:30
Debanjum Singh Solanky	03a6b491b2	Warn when can't identify mimeType of files in Desktop, Obsidian clients	2024-02-24 19:59:03 +05:30
Debanjum Singh Solanky	3675ab4864	Only sync modified files from the Obsidian client Previously we'd send all files in vault and let the server deduplicate. This changes takes inspiration from the desktop app, and only pushes files which were modified after their previous sync with the server. This should reduce the processing load on the server	2024-02-24 07:48:40 +05:30
Debanjum Singh Solanky	ddfbf31bc8	Append version query param to web asset URLs to bypass browser cache Ensure latest assets are loaded when khoj version is updated	2024-02-24 06:49:25 +05:30
sabaimran	42773e808c	Retrieve, create, and save conversations differently for ClientApplications (#656 ) * Retrieve, create, and save conversations differently if they're coming from a client application - Not all of our client apps will necessarily maintain state over the conversation IDs available to a user. For some (single-threaded conversations), it should just use a single conversation. Fix the code to do so * Simplify conversation retrieval logic * Keep 0 padding below chat response * Add order_by sorting to retrieving the conversation without id	2024-02-23 11:32:00 -08:00
Debanjum	9afb2a14ef	Fix and Improve Chat UI in Web, Desktop apps (#655 ) ### Improvements to Chat UI on Web, Desktop apps - Improve styling of chat session side panel - Improve styling of chat message bubble in Desktop, Web app - Add frosted, minimal chat UI to background of Login screen - Improve PWA install experience of Khoj ### Fixes to Chat UI on Web, Desktop apps - Fix creating new chat sessions from the Desktop app - Only show 3 starter questions even when consecutive chat sessions created ### Other Improvements - Update Khoj cloud trial period to a fortnight instead of a week - Document using venv to handle dependency conflict on khoj pip install Resolves #276	2024-02-23 19:27:02 +05:30
Debanjum Singh Solanky	c70ca78cdc	Improve PWA install experience for Khoj on Desktop, Mobile - Resolve PWA issues thrown by Chrome/Edge - Add screenshot samples showcasing remember, browse and draw features - This can provide a richer app store like experience when installing Khoj PWA on Mobile or Desktop - Add wide and narrow screenshots to show Mobile vs Desktop UX - Add higher resolution favicon for PWA - Use single web manifest instead of separate ones for Chat, Search - Update manifest description with more details about Khoj features	2024-02-23 18:59:52 +05:30
Debanjum Singh Solanky	e10b260988	Update web login screen to show frosted minimal chat UI in background	2024-02-23 18:59:52 +05:30
Debanjum Singh Solanky	1b0318564e	Log when conversation turn is saved to DB	2024-02-23 18:59:52 +05:30
Debanjum Singh Solanky	4c39960917	Make number of conversation starters to get from DB configurable	2024-02-23 18:59:52 +05:30
Debanjum Singh Solanky	50617594fd	Only show 3 starter questions even when consecutive chat sessions created Reset starter question suggestions before appending in web, desktop app Otherwise previously it'd keep adding to existing starter question suggestions on each new session creation if multiple consecutive new chat sessions created. This would result in more than the 3 expected starter questions being displayed at a time	2024-02-23 18:59:52 +05:30
Debanjum Singh Solanky	102f5c3f53	Improve styling of chat session side panel - Make collapse, expand toggle arrow point in the direction the action will expand the side panel in - Make the collapsed side panel reduce to a 1px sliver	2024-02-23 18:59:52 +05:30
Debanjum Singh Solanky	6283d9fe83	Update Khoj cloud trial period to a fortnight instead of a week - Improve rate limit error message wording - Make the "too many requests" error message more robust. Should throw that exception fix self.request >= self.subscribed_requests because upgrading wouldn't fix this rate limiting	2024-02-23 18:33:56 +05:30

1 2 3 4 5 ...

1770 commits