sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-27 17:35:07 +01:00

Author	SHA1	Message	Date
Debanjum Singh Solanky	b22a7dae5d	Tweak prompts to extract information from webpages, online results - Show more of the truncated messages for debugging context - Update Khoj personality prompt to encourage it to remember it's capabilities	2024-03-24 15:46:29 +05:30
Debanjum Singh Solanky	85c62efca1	Test select webpage as data source and extract web urls chat actors	2024-03-24 15:46:29 +05:30
Debanjum Singh Solanky	ad6f6bb0ed	Support webpage command in chat API - Fallback to use webpage when SERPER not setup and online command was attempted - Do not stop responding if can't retrieve online results. Try to respond without the online context	2024-03-24 15:46:29 +05:30
Debanjum Singh Solanky	a6b7432837	Add webpage chat command for read web pages requested by user Update auto chat command inference prompt to show example of when to use webpage chat command (i.e when url is directly provided in link)	2024-03-24 15:46:29 +05:30
Debanjum Singh Solanky	6118d1ff57	Create chat actor for directly reading webpages based on user message - Add prompt for the read webpages chat actor to extract, infer webpage links - Make chat actor infer or extract webpage to read directly from user message - Rename previous read_webpage function to more narrow read_webpage_at_url function	2024-03-14 14:58:37 +05:30
Debanjum	e549824fe2	Improve OpenAI Chat Actors and their prompts (#673 ) ### Major - Enforce json mode response from OpenAI chat actors prev using string lists - Use `gpt-4-turbo-preview' as default chat model, extract questions actor - Make Khoj read khoj website to respond with accurate, up-to-date information about itself - Dedupe query in notes prompt. Improve OAI chat actor, director tests ### Minor - Test data source, output mode selector, web search query chat actors - Improve notes search actor to always create a non-empty list of queries - Construct available data sources, output modes as a bullet list in prompts - Use consistent agent name across static and dynamic examples in prompts - Add actor's name to extract questions prompt to improve context for guidance	2024-03-14 12:44:40 +05:30
Debanjum Singh Solanky	a1ce12296f	Fix rendering online with note references post streaming chat response Previously only the notes references would get rendered post response streaming when when both online and notes references were used to respond to the user's message	2024-03-14 03:40:40 +05:30
Debanjum Singh Solanky	1aeea3d854	Fix opening external links from confirmation dialog box on desktop app	2024-03-14 02:29:22 +05:30
Debanjum Singh Solanky	2e5cc49cb3	Enforce json response from OpenAI chat actors prev using string lists - Allow passing response format type to OpenAI API via chat actors - Convert in-context examples to use json objects instead of str lists - Update actors outputting str list to request output to be json_object - OpenAI's json mode enforces the model to output valid json object	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	7211eb9cf5	Default to gpt-4-turbo-preview for chat model, extract questions actor GPT-4 is more expensive and generally less capable than gpt-4-turbo-preview	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	dd883dc53a	Dedupe query in notes prompt. Improve OAI chat actor, director tests - Remove stale tests - Improve tests to pass across gpt-3.5 and gpt-4-turbo - The haiku creation director was failing because of duplicate query in instantiated prompt	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	70b04d16c0	Test data source, output mode selector, web search query chat actors	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	14682d5354	Improve notes search actor to always create a non-empty list of queries - Remove the option for Notes search query generation actor to return no queries. Whether search should be performed is decided before, this step doesn't need to decide that - But do not throw warning if the response is a list with no elements	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	f5734826cb	Improve pick data source prompt to look online for info about Khoj - Add examples where user queries requesting information about Khoj results in the "online" data source being selected - Add an example for "general" to select chat command prompt	2024-03-14 01:21:13 +05:30
Debanjum Singh Solanky	9a516bed47	Construct available data sources, output modes as a bullet list in prompts	2024-03-14 00:34:57 +05:30
Debanjum Singh Solanky	f28fb89af8	Use consistent agent name across static and dynamic examples in prompts Previously the examples constructed from chat history used "Khoj" as the agent's name but all 3 prompts using the func used static examples with "AI:" as the pertinent agent's name	2024-03-14 00:34:57 +05:30
Debanjum Singh Solanky	f5793149a9	Add actor's name to extract questions prompt to improve context for guidance	2024-03-14 00:34:57 +05:30
Debanjum Singh Solanky	73ad444086	Make online search Actor read khoj.dev for docs, info about Khoj - Add example to read khoj.dev website for up-to-date info to setup, use khoj, discover khoj features etc. - Online search should use site: and after: google search operators - Show example of adding the after: date filter to google search - Give local event lookup example using user's current location in query - Remove unused select search content type prompt	2024-03-14 00:34:57 +05:30
Debanjum	3abe7ccb26	Improve Online Search Speed and Context (#670 ) ### Major - Read web pages in parallel to improve chat response time - Read web pages directly when Olostep proxy not setup - Include search results & web page content in online context for chat response ### Minor - Simplify, modularize and add type hints to online search functions	2024-03-11 22:16:30 +05:30
Debanjum Singh Solanky	dc86e44a07	Include search results & webpage content in online context for chat response Previously if a web page was read for a sub-query, only the extracted web page content was provided as context for the given sub-query. But the google results themselves have relevant snippets. So include them	2024-03-11 18:41:02 +05:30
Debanjum Singh Solanky	d136a6be44	Simplify, modularize and add type hints to online search functions - Simplify content arg to `extract_relevant_info' function. Validate, clean the content arg inside the `extract_relevant_info' function - Extract `search_with_google' function outside the parent function - Call the parent function a more appropriate `search_online' instead of `search_with_google' - Simplify the `search_with_google' function using list comprehension. Drop empty search result fields from chat model context for response to reduce cost and response latency - No need to show stacktrace when unable to read webpage, basic error is enough - Add type hints to online search functions to catch issues with mypy	2024-03-11 18:41:02 +05:30
Debanjum Singh Solanky	88f096977b	Read webpages directly when Olostep proxy not setup This is useful for self-hosted, individual user, low traffic setups where a proxy service is not required	2024-03-11 18:41:02 +05:30
Debanjum Singh Solanky	ca2f962e95	Read, extract information from web pages in parallel to lower response time - Time reading webpage, extract info from webpage steps for perf analysis - Deduplicate webpages to read gathered across separate google searches - Use aiohttp to make API requests non-blocking, pair with asyncio to parallelize all the online search webpage read and extract calls	2024-03-11 18:41:02 +05:30
sabaimran	1da453306e	Add num online for Discord badge	2024-03-10 17:48:30 +05:30
Debanjum	18fa3e2384	Rerank Search Results by Default on GPU machines (#668 ) - Trigger SentenceTransformer Cross Encoder models now run fast on GPU enabled machines, including Mac ARM devices since UKPLab/sentence-transformers#2463 - Details - Use cross-encoder to rerank search results by default on GPU machines and when using an inference server - Only call search API when pause in typing search query on web, desktop apps	2024-03-10 15:15:25 +05:30
Debanjum Singh Solanky	53d402480c	Rerank search results with cross-encoder when using an inference server If an inference server is being used, we can expect the cross encoder to be running fast enough to rerank search results by default	2024-03-10 15:09:46 +05:30
Debanjum Singh Solanky	44c8d09342	Only call search API when pause in typing search query on web, desktop apps Wait for 300ms since stop typing before calling search API. This smooths out UI jitter when rendering search results, especially now that we're reranking for every search query on GPU enabled devices Emacs already has 300ms debounce time. More convoluted to add debounce time to Obsidian search modal, so not updating that yet	2024-03-10 14:29:24 +05:30
Debanjum Singh Solanky	1105d8814f	Use cross-encoder to rerank search results by default on GPU machines Latest sentence-transformer package uses GPU for cross-encoder. This makes it fast enough to enable reranking on machines with GPU. Enabling search reranking by default allows (at least) users with GPUs to side-step learning the UI affordance to rerank results (i.e hitting Cmd/Ctrl-Enter or ENTER).	2024-03-10 14:29:21 +05:30
Debanjum	8eb3c441ec	Do not create new chat session when an old chat session is deleted (#669 ) ### Issue Previously deleting a chat session from the side panel on desktop, web app would sometimes result in also creating a new chat session ### Fix `get_conversation_by_user' shouldn't return new conversation if conversation with requested id not found. It should only return new conversation if no specific conversation is requested and no conversations found for user at all ### Miscellaneous Improvements - Chat history load should be logged as call to that chat_history api, not the "chat" api - Show status updates of clearing conversation history in chat input - Simplify web, desktop client code by removing unnecessary new variables ### Repro - Delete a new chat, this calls loadChat via window.onload which calls server /chat/history API endpoint with conversationId set to that of just deleted conversation sporadically The call to GET chat/history API with conversationId set occurs when window.onload triggers before the conversationId is deleted by the delete button after the DELETE /chat/history API call (via race) - In such a scenario, get_conversation_by_user called by chat/history API with conversationId of deleted conversation returns a new conversation	2024-03-10 14:14:43 +05:30
Debanjum Singh Solanky	fd81446ba3	Do not create new chat session when an old chat session is deleted - Fix `get_conversation_by_user' shouldn't return new conversation if conversation with requested id not found. It should only return new conversation if no specific conversation is requested and no conversations found for user at all - Repro - Delete a new chat, this calls loadChat via window.onload which calls server /chat/history API endpoint with conversationId set to that of just deleted conversation sporadically The call to GET chat/history API with conversationId set occurs when window.onload triggers before the conversationId is deleted by the delete button after the DELETE /chat/history API call (via race) - In such a scenario, get_conversation_by_user called by chat/history API with conversationId of deleted conversation returns a new conversation - Miscellaneous - Chat history load should be logged as call to that chat_history api, not the "chat" api - Show status updates of clearing conversation history in chat input - Simplify web, desktop client code by removing unnecessary new variables	2024-03-10 02:17:23 +05:30
Debanjum Singh Solanky	b7fad04870	Use consistent field name for queries in chat history & better image prompt	2024-03-09 19:11:03 +05:30
sabaimran	086d5f8324	Add link to drag drop pdf demo video	2024-03-09 17:02:23 +05:30
sabaimran	6aae9864d3	Fix Notion indexing and add an admin view for Entry objects	2024-03-09 16:25:23 +05:30
sabaimran	b3b6278af2	Update documentation to show how you can upload files	2024-03-09 15:58:13 +05:30
sabaimran	12d6c4da7d	Only include inferred queries in the conversation history for images, not links. Overflow the side panel when too long	2024-03-09 11:59:35 +05:30
Debanjum Singh Solanky	42d4bc6b14	Document installing Khoj on Phone as a Progressive Web App (PWA)	2024-03-08 21:18:06 +05:30
sabaimran	e5cd0237e3	Release Khoj version 1.6.2	2024-03-08 17:04:03 +05:30
Debanjum Singh Solanky	446ac7649d	Remove unused js method in web chat client, add newline to web data in prompt	2024-03-08 16:40:39 +05:30
Debanjum Singh Solanky	12d32ac99c	Increase user visibility into more errors during image generation Catch OpenAI connection error and errors during better image prompt generation	2024-03-08 16:40:39 +05:30
sabaimran	ff31759423	Fix target determination in the copy programmatic output button	2024-03-08 16:33:12 +05:30
sabaimran	9f934929c6	Infer mime type from file ending when not available in browser. Don't output image in conversation turns	2024-03-08 12:34:26 +05:30
sabaimran	81beb7940c	Upload generated images to s3, if AWS credentials and bucket is available (#667 ) * Upload generated images to s3, if AWS credentials and bucket is available. - In clients, render the images via the URL if it's returned with a text-to-image2 intent type * Make the loading screen more intuitve, less jerky and update the programmatic copy button * Update the loading icon when waiting for a chat response	2024-03-08 10:54:13 +05:30
sabaimran	13894e1fd5	add instructions for drag/drop files in sys prompt	2024-03-07 17:57:42 +05:30
sabaimran	7357b6eff1	Revert white-space preline and add more detailed help text when selecting file	2024-03-06 16:47:27 +05:30
sabaimran	b615c0719e	Support upload for files via drag/drop in the web UI (#666 ) * Add additional styling changes for showing UI changes when dragging file to the main screen * Add a loading spinner when file upload is in progress, and don't index github/notion when indexing files * Add an explicit icon for file uploading in the chat button menu * Add appropriate dragover styling when picking a file from the file picker/browser * Add a loading screen when retrieving chat history. Fix width of the chat window. Put attachment icon to the left of chat input	2024-03-06 16:43:05 +05:30
sabaimran	e323a6d69b	Include additional user context in the image generation flow (#660 ) * Make major improvements to the image generation flow - Include user context from online references and personal notes for generating images - Dynamically select the modality that the LLM should respond with - Retun the inferred context in the query response for the dekstop, web chat views to read * Add unit tests for retrieving response modes via LLM * Move output mode unit tests to the actor suite, rather than director * Only show the references button if there is at least one available * Rename aget_relevant_modes to aget_relevant_output_modes * Use a shared method for generating reference sections, simplify some of the prompting logic * Make out of space errors in the desktop client more obvious	2024-03-06 13:48:41 +05:30
sabaimran	3cbc5b0d52	Add links to blog in docs	2024-03-02 17:37:18 +05:30
sabaimran	880368635e	Set default value of KHOJ_DEBUG to False in the docker-compose file	2024-03-01 21:51:13 +05:30
Debanjum Singh Solanky	2d61591c22	Improve user visibility into errors during image generation	2024-02-29 13:19:13 +05:30
sabaimran	0bbb5cff85	Release Khoj version 1.6.1	2024-02-26 13:27:20 -08:00

1 2 3 4 5 ...

2353 commits