sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-23 23:48:56 +01:00

Author	SHA1	Message	Date
sabaimran	377f7668c5	Merge pull request #858 from khoj-ai/use-sse-instead-of-websocket Use Single HTTP API for Robust, Generalizable Chat Streaming	2024-07-26 07:11:54 -07:00
sabaimran	6607e666dc	Increase rate limit for data upload packet size in indexer.py	2024-07-26 19:35:32 +05:30
Debanjum Singh Solanky	778c571288	Use enum to track chat stream event types in chat api router	2024-07-26 00:19:43 +05:30
sabaimran	7482797605	Add some better default states for no files found, prompt to search. Add link to search in the file search compnoent in side panel	2024-07-25 13:00:28 +05:30
sabaimran	662dffea3b	Press enter to search	2024-07-24 19:28:38 +05:30
sabaimran	19cd607c96	Style the see content button correctly	2024-07-24 18:28:23 +05:30
sabaimran	75a370cc06	Implement focus mode to click into full text of the note	2024-07-24 18:00:33 +05:30
sabaimran	5adbfe14ab	Add a search page that just renders truncated results when you click search	2024-07-24 17:43:19 +05:30
sabaimran	52db15706d	Remove unused styling	2024-07-24 17:42:36 +05:30
sabaimran	cfe7a1068e	Update the navmenu title if prop is updated and undefined	2024-07-24 17:41:31 +05:30
Debanjum Singh Solanky	ebe92ef16d	Do not send references twice in streamed image response Remove unused image content to reduce response payload size. References are collated, sent separately	2024-07-24 17:18:14 +05:30
Debanjum Singh Solanky	37b8fc5577	Extract events even when http chunk contains partial or mutiple events Previous logic was more brittle to break with simple unbalanced '{' or '}' string present in the event data. This method of trying to identify valid json obj was fairly brittle. It only allowed json objects or processed event as raw strings. Now we buffer chunk until we see our unicode magic delimiter and only then process it. This is much less likely to break based on event data and the delimiter is more tunable if we want to reduce rendering breakage likelihood further	2024-07-24 17:17:39 +05:30
sabaimran	4d30e5b158	Fix indexing error for notion, expecting image and docx in dict	2024-07-24 16:58:31 +05:30
sabaimran	694bedc25b	Add support for text to speech and speech to text (#863 ) - Add support for text to speech, speech to text. Add loading and responsive indicators to reflect state. - When streaming for speech to text, show incremental transcription in the message input field - When streaming text to speech, and a pause button in the chat message to allow user to stop playback	2024-07-24 14:36:40 +05:30
Raghav Tirumale	3e4325edab	Upgrade: New Home Screen for Khoj (#860 ) * V1 of the new automations page Implemented: - Shareable - Editable - Suggested Cards - Create new cards - added side panel new conversation button - Implement mobile-friendly view for homepage - Fix issue of new conversations being created when selected agent is changed - Improve center of the homepage experience - Fix showing agent during first chat experience - dark mode gradient updates --------- Co-authored-by: sabaimran <narmiabas@gmail.com>	2024-07-24 13:16:19 +05:30
Debanjum Singh Solanky	70201e8db8	Log total, ttft chat response time on start, end llm_response events - Deduplicate code to collect chat telemetry by relying on end_llm_response event - Log time to first token and total chat response time for latency analysis of Khoj as an agent. Not just the latency of the LLM - Remove duplicate timer in the image generation path	2024-07-23 23:21:12 +05:30
Debanjum Singh Solanky	b36a7833a6	Remove the old mechanism of streaming compiled references Do not need response generator to stuff compiled references in chat stream using "### compiled references:" separator. References are now sent to clients as structured json while streaming	2024-07-23 19:53:51 +05:30
Debanjum Singh Solanky	eb4e12d3c5	s/online_context/onlineContext chat API response field for consistency This will align the name of the online context field returned by current chat message and chat history	2024-07-23 19:50:43 +05:30
Debanjum	498fe2458c	Support Gemma 2 Model Family for Offline Chat (#855 ) ## Overview - Gemma 2 is a new open model family by Google. They've released a 9B, 29B param model. A 2B model is also expected. - It performs really well on the Chatbot arena and shows good performance when testing within Khoj as well. - Llama.cpp support for Gemma 2 architecture seems to have stabilized - If Gemma 2 performs well in further testing, it can be made the default offline chat model for Khoj - Once the 2B param model is released, the model size to download can be automatically chosen based on (V)RAM available ## Major - Support Gemma 2 for Offline Chat - Improve and fix chat model prompts for better, consistent context ## Minor - Fix and improve offline chat actor, director tests - Improve offline chat truncation to consider chat message delimiter tokens	2024-07-23 06:57:02 -07:00
Debanjum Singh Solanky	0277d16daf	Share desktop chat streaming utility funcs across chat, shortcut views Null check menu, menuContainer to avoid errors on Khoj mini	2024-07-23 19:16:33 +05:30
Debanjum Singh Solanky	e439a6ddac	Use async/await in web client chat stream instead of promises Align streaming logic across web, desktop and obsidian clients	2024-07-23 18:17:47 +05:30
Debanjum Singh Solanky	fafc467173	Put loading spinner at bottom of chat message in web client	2024-07-23 18:17:47 +05:30
Debanjum Singh Solanky	fc33162ec6	Use new chat streaming API to show Khoj train of thought in Desktop app Show loading spinner at end of current message	2024-07-23 18:17:47 +05:30
Debanjum Singh Solanky	c5ad172616	Keep loading animation at message end & reduce lists padding in Obsidian Previously loading animation would be at top of message. Moving it to bottom is more intuitve and easier to track. Remove white-space: pre from list elements. It was adding too much y axis padding to chat messages (and train of thought)	2024-07-23 17:56:03 +05:30
Debanjum Singh Solanky	54b4203683	Update chat API client tests to mix testing of batch and streaming mode	2024-07-23 17:56:03 +05:30
Debanjum Singh Solanky	3f5f418d0e	Use new chat streaming API to show Khoj train of thought in Obsidian client	2024-07-23 17:56:03 +05:30
Debanjum Singh Solanky	8303b09129	Convert snake case to camel case in chat view of obsidian plugin	2024-07-23 15:29:12 +05:30
Debanjum Singh Solanky	b224d7ffad	Simplify get_conversation_by_user DB adapter code	2024-07-23 14:51:11 +05:30
Debanjum Singh Solanky	daec439d52	Replace old chat router with new chat router with advanced streaming - Details Only return notes refs, online refs, inferred queries and generated response in non-streaming mode. Do not return train of throught and other status messages Incorporate missing logic from old chat API router into new one. - Motivation So we can halve chat API code by getting rid of the duplicate logic for the websocket router The deduplicated code: - Avoids inadvertant logic drift between the 2 routers - Improves dev velocity	2024-07-23 14:51:11 +05:30
Debanjum Singh Solanky	2d4b284218	Simplify streaming chat function in web client	2024-07-23 14:38:55 +05:30
Debanjum Singh Solanky	6b9550238f	Simplify advanced streaming chat API, align params with normal chat API	2024-07-22 22:51:24 +05:30
Debanjum Singh Solanky	b8d3e3669a	Stream Status Messages via Streaming Response from server to web client - Overview Use simpler HTTP Streaming Response to send status messages, alongside response and references from server to clients via API. Update web client to use the streamed response to show train of thought, stream response and render references. - Motivation This should allow other Khoj clients to pass auth headers and recieve Khoj's train of thought messages from server over simple HTTP streaming API. It'll also eventually deduplicate chat logic across /websocket and /chat API endpoints and help maintainability and dev velocity - Details - Pass references as a separate streaming message type for simpler parsing. Remove passing "### compiled references" altogether once the original /api/chat API is deprecated/merged with the new one and clients have been updated to consume the references using this new mechanism - Save message to conversation even if client disconnects. This is done by not breaking out of the async iterator that is sending the llm response. As the save conversation is called at the end of the iteration - Handle parsing chunked json responses as a valid json on client. This requires additional logic on client side but makes the client more robust to server chunking json response such that each chunk isn't itself necessarily a valid json.	2024-07-22 15:41:21 +05:30
Debanjum Singh Solanky	91fe41106e	Convert Websocket into Server Side Event (SSE) API endpoint - Convert functions in SSE API path into async generators using yields - Validate image generation, online, notes lookup and general paths of chat request are handled fine by the web client and server API	2024-07-21 14:20:22 +05:30
sabaimran	9cf52bb7e4	Update automations UX for more consistency (#856 ) * Update the automations UI to be a more suitable color distribution based on new designs * Use accented colors for the metadata, update dark mode colors * Update form to use icons as well and render more pretty inline form labels	2024-07-21 12:22:23 +05:30
sabaimran	e694c82343	Fix Docker build issues with yarn / next /node (#859 ) * Rollback node version being installed from nodesource to node 20	2024-07-19 19:11:29 +05:30
sabaimran	1af9dbb083	Switch node/yarn install steps to use more native installation patterns	2024-07-19 17:10:08 +05:30
sabaimran	6d5ca5a3e1	yarn clean cache before build	2024-07-19 16:06:38 +05:30
sabaimran	7f0d1bd414	Add verbose logs when outputing yarn install steps	2024-07-19 15:48:43 +05:30
sabaimran	7426a4f819	Prefetch related agent when retrieving the conversation for performance improvements	2024-07-19 14:43:30 +05:30
Debanjum Singh Solanky	07f36fa95a	Update new web interface with update calls to /content, /model APIs	2024-07-19 12:23:22 +05:30
Debanjum Singh Solanky	f03525f431	Add back /api/configure as /api/settings API endpoint It had been removed during the /api/configure/content to /api/content API migration before	2024-07-19 05:40:34 +05:30
Debanjum Singh Solanky	3832ef0236	Move API endpoints under /api/configure/phone/ to /api/phone/ Pull out /api/configure/phone API endpoints into /api/phone for more concise and sufficiently explanatory API path Refactor Flow 1. Rename /api/configure/phone -> /api/phone	2024-07-19 05:40:34 +05:30
Debanjum Singh Solanky	1197266912	Move API endpoints under /configure/<type>/model to /api/model/<type> Now the API to configure all the AI models is under /api/models. This provides better organization and API hierarchy. The /configure url segment was redundant. - Rename POST /api/phone to PATCH /api/phone - Rename GET /api/configure to GET /api/settings Refactor Flow 1. Move out POST /user/name to main api.py 2. Rename /api/configure/<type>/model -> /api/model/<type> 3. Rename @api_configure to @api_mode 4. Rename file api_config.py to api_model.py	2024-07-19 05:40:34 +05:30
Debanjum Singh Solanky	469a1cb6a2	Move API endpoints under /api/configure/content/ to /api/content/ Pull out /api/configure/content API endpoints into /api/content to allow for more logical organization of API path hierarchy This should make the url more succinct and API request intent more understandable by using existing HTTP method semantics along with the path. The /configure URL path segment was either - redundant (e.g POST /configure/notion) or - incorrect (e.g GET /configure/files) Some example of naming improvements: - GET /configure/types -> GET /content/types - GET /configure/files -> GET /content/files - DELETE /configure/files -> DELETE /content/files This should also align, merge better the the content indexing API triggered via PUT, PATCH /content Refactor Flow 1. Rename /api/configure/types -> /api/content/types 2. Rename /api/configure -> /api 3. Move /api/content to api_content from under api_config	2024-07-19 05:40:34 +05:30
Debanjum Singh Solanky	bba4e0b529	Accept file deletion requests by clients during sync - Remove unused full_corpus boolean. The full_corpus=False code path wasn't being used (accept for in a test) - The full_corpus=True code path used was ignoring file deletion requests sent by clients during sync. Unclear why this was done - Added unit test to prevent regression and show file deletion by clients during sync not ignored now	2024-07-19 04:53:01 +05:30
Debanjum Singh Solanky	5923b6d89e	Split /api/v1/index/update into /api/content PUT, PATCH API endpoints - This utilizes PUT, PATCH HTTP method semantics to remove need for the "regenerate" query param and "/update" url suffix - This should make the url more succinct and API request intent more understandable by using existing HTTP method semantics	2024-07-19 01:45:53 +05:30
Debanjum Singh Solanky	e9f86e320b	Fix and improve offline chat actor, director tests - Use updated references schema with compiled key - Enable director tests that are now expected to pass and that do pass (with Gemma 2 at least)	2024-07-18 03:43:09 +05:30
Debanjum Singh Solanky	b0ee78586c	Improve offline chat truncation to consider message separator tokens	2024-07-18 03:43:09 +05:30
Debanjum Singh Solanky	6f46e6afc6	Improve and fix chat model prompts for better, consistent context - Add day of week to system prompt of openai, anthropic, offline chat models - Pass more context to offline chat system prompt to - ask follow-up questions - know where to find information about khoj (itself) - Fix output mode selection prompt. Log error if model does not select valid option from list of valid output modes provided - Use consistent names for question, answers passed to extract_questions_offline prompt - Log which model extracts question, what the offline chat model sees as context. Similar to debug log shown for openai models	2024-07-18 03:43:09 +05:30
Debanjum Singh Solanky	53eabe0c06	Support Gemma 2 for Offline Chat - Pass system message as the first user chat message as Gemma 2 doesn't support system messages - Use gemma-2 chat format - Pass chat model name to generic, extract questions chat actors Used to figure out chat template to use for model For generic chat actor argument was anyway available but not being passed, which is confusing	2024-07-18 03:09:38 +05:30

... 2 3 4 5 6 ...

3249 commits