sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-27 17:35:07 +01:00

Author	SHA1	Message	Date
sabaimran	413255ddc7	Add closing tag to whatsapp qr code image	2024-07-28 13:50:38 +05:30
sabaimran	41eb85c933	Update the docs for whatsapp to include the QR code	2024-07-28 13:43:50 +05:30
sabaimran	eb5af38f33	Release Khoj version 1.17.0	2024-07-26 20:14:45 +05:30
sabaimran	44d34f9090	Update the unit test for the subscribed user	2024-07-26 19:59:01 +05:30
sabaimran	377f7668c5	Merge pull request #858 from khoj-ai/use-sse-instead-of-websocket Use Single HTTP API for Robust, Generalizable Chat Streaming	2024-07-26 07:11:54 -07:00
sabaimran	6607e666dc	Increase rate limit for data upload packet size in indexer.py	2024-07-26 19:35:32 +05:30
Debanjum Singh Solanky	778c571288	Use enum to track chat stream event types in chat api router	2024-07-26 00:19:43 +05:30
Debanjum Singh Solanky	ebe92ef16d	Do not send references twice in streamed image response Remove unused image content to reduce response payload size. References are collated, sent separately	2024-07-24 17:18:14 +05:30
Debanjum Singh Solanky	37b8fc5577	Extract events even when http chunk contains partial or mutiple events Previous logic was more brittle to break with simple unbalanced '{' or '}' string present in the event data. This method of trying to identify valid json obj was fairly brittle. It only allowed json objects or processed event as raw strings. Now we buffer chunk until we see our unicode magic delimiter and only then process it. This is much less likely to break based on event data and the delimiter is more tunable if we want to reduce rendering breakage likelihood further	2024-07-24 17:17:39 +05:30
Debanjum Singh Solanky	70201e8db8	Log total, ttft chat response time on start, end llm_response events - Deduplicate code to collect chat telemetry by relying on end_llm_response event - Log time to first token and total chat response time for latency analysis of Khoj as an agent. Not just the latency of the LLM - Remove duplicate timer in the image generation path	2024-07-23 23:21:12 +05:30
Debanjum Singh Solanky	b36a7833a6	Remove the old mechanism of streaming compiled references Do not need response generator to stuff compiled references in chat stream using "### compiled references:" separator. References are now sent to clients as structured json while streaming	2024-07-23 19:53:51 +05:30
Debanjum Singh Solanky	eb4e12d3c5	s/online_context/onlineContext chat API response field for consistency This will align the name of the online context field returned by current chat message and chat history	2024-07-23 19:50:43 +05:30
Debanjum	498fe2458c	Support Gemma 2 Model Family for Offline Chat (#855 ) ## Overview - Gemma 2 is a new open model family by Google. They've released a 9B, 29B param model. A 2B model is also expected. - It performs really well on the Chatbot arena and shows good performance when testing within Khoj as well. - Llama.cpp support for Gemma 2 architecture seems to have stabilized - If Gemma 2 performs well in further testing, it can be made the default offline chat model for Khoj - Once the 2B param model is released, the model size to download can be automatically chosen based on (V)RAM available ## Major - Support Gemma 2 for Offline Chat - Improve and fix chat model prompts for better, consistent context ## Minor - Fix and improve offline chat actor, director tests - Improve offline chat truncation to consider chat message delimiter tokens	2024-07-23 06:57:02 -07:00
Debanjum Singh Solanky	0277d16daf	Share desktop chat streaming utility funcs across chat, shortcut views Null check menu, menuContainer to avoid errors on Khoj mini	2024-07-23 19:16:33 +05:30
Debanjum Singh Solanky	e439a6ddac	Use async/await in web client chat stream instead of promises Align streaming logic across web, desktop and obsidian clients	2024-07-23 18:17:47 +05:30
Debanjum Singh Solanky	fafc467173	Put loading spinner at bottom of chat message in web client	2024-07-23 18:17:47 +05:30
Debanjum Singh Solanky	fc33162ec6	Use new chat streaming API to show Khoj train of thought in Desktop app Show loading spinner at end of current message	2024-07-23 18:17:47 +05:30
Debanjum Singh Solanky	c5ad172616	Keep loading animation at message end & reduce lists padding in Obsidian Previously loading animation would be at top of message. Moving it to bottom is more intuitve and easier to track. Remove white-space: pre from list elements. It was adding too much y axis padding to chat messages (and train of thought)	2024-07-23 17:56:03 +05:30
Debanjum Singh Solanky	54b4203683	Update chat API client tests to mix testing of batch and streaming mode	2024-07-23 17:56:03 +05:30
Debanjum Singh Solanky	3f5f418d0e	Use new chat streaming API to show Khoj train of thought in Obsidian client	2024-07-23 17:56:03 +05:30
Debanjum Singh Solanky	8303b09129	Convert snake case to camel case in chat view of obsidian plugin	2024-07-23 15:29:12 +05:30
Debanjum Singh Solanky	b224d7ffad	Simplify get_conversation_by_user DB adapter code	2024-07-23 14:51:11 +05:30
Debanjum Singh Solanky	daec439d52	Replace old chat router with new chat router with advanced streaming - Details Only return notes refs, online refs, inferred queries and generated response in non-streaming mode. Do not return train of throught and other status messages Incorporate missing logic from old chat API router into new one. - Motivation So we can halve chat API code by getting rid of the duplicate logic for the websocket router The deduplicated code: - Avoids inadvertant logic drift between the 2 routers - Improves dev velocity	2024-07-23 14:51:11 +05:30
Debanjum Singh Solanky	2d4b284218	Simplify streaming chat function in web client	2024-07-23 14:38:55 +05:30
Debanjum Singh Solanky	6b9550238f	Simplify advanced streaming chat API, align params with normal chat API	2024-07-22 22:51:24 +05:30
Debanjum Singh Solanky	b8d3e3669a	Stream Status Messages via Streaming Response from server to web client - Overview Use simpler HTTP Streaming Response to send status messages, alongside response and references from server to clients via API. Update web client to use the streamed response to show train of thought, stream response and render references. - Motivation This should allow other Khoj clients to pass auth headers and recieve Khoj's train of thought messages from server over simple HTTP streaming API. It'll also eventually deduplicate chat logic across /websocket and /chat API endpoints and help maintainability and dev velocity - Details - Pass references as a separate streaming message type for simpler parsing. Remove passing "### compiled references" altogether once the original /api/chat API is deprecated/merged with the new one and clients have been updated to consume the references using this new mechanism - Save message to conversation even if client disconnects. This is done by not breaking out of the async iterator that is sending the llm response. As the save conversation is called at the end of the iteration - Handle parsing chunked json responses as a valid json on client. This requires additional logic on client side but makes the client more robust to server chunking json response such that each chunk isn't itself necessarily a valid json.	2024-07-22 15:41:21 +05:30
Debanjum Singh Solanky	91fe41106e	Convert Websocket into Server Side Event (SSE) API endpoint - Convert functions in SSE API path into async generators using yields - Validate image generation, online, notes lookup and general paths of chat request are handled fine by the web client and server API	2024-07-21 14:20:22 +05:30
sabaimran	e694c82343	Fix Docker build issues with yarn / next /node (#859 ) * Rollback node version being installed from nodesource to node 20	2024-07-19 19:11:29 +05:30
sabaimran	1af9dbb083	Switch node/yarn install steps to use more native installation patterns	2024-07-19 17:10:08 +05:30
sabaimran	6d5ca5a3e1	yarn clean cache before build	2024-07-19 16:06:38 +05:30
sabaimran	7f0d1bd414	Add verbose logs when outputing yarn install steps	2024-07-19 15:48:43 +05:30
sabaimran	7426a4f819	Prefetch related agent when retrieving the conversation for performance improvements	2024-07-19 14:43:30 +05:30
Debanjum Singh Solanky	e9f86e320b	Fix and improve offline chat actor, director tests - Use updated references schema with compiled key - Enable director tests that are now expected to pass and that do pass (with Gemma 2 at least)	2024-07-18 03:43:09 +05:30
Debanjum Singh Solanky	b0ee78586c	Improve offline chat truncation to consider message separator tokens	2024-07-18 03:43:09 +05:30
Debanjum Singh Solanky	6f46e6afc6	Improve and fix chat model prompts for better, consistent context - Add day of week to system prompt of openai, anthropic, offline chat models - Pass more context to offline chat system prompt to - ask follow-up questions - know where to find information about khoj (itself) - Fix output mode selection prompt. Log error if model does not select valid option from list of valid output modes provided - Use consistent names for question, answers passed to extract_questions_offline prompt - Log which model extracts question, what the offline chat model sees as context. Similar to debug log shown for openai models	2024-07-18 03:43:09 +05:30
Debanjum Singh Solanky	53eabe0c06	Support Gemma 2 for Offline Chat - Pass system message as the first user chat message as Gemma 2 doesn't support system messages - Use gemma-2 chat format - Pass chat model name to generic, extract questions chat actors Used to figure out chat template to use for model For generic chat actor argument was anyway available but not being passed, which is confusing	2024-07-18 03:09:38 +05:30
Debanjum	2ab8fb78b1	Migrate the PyPI package to use project name: khoj (#853 ) ### Changes - Deprecate [khoj-assistant](https://pypi.org/project/khoj-assistant) pypi package. Use more accurate and succinct pypi project name, [khoj](https://pypi.org/project/khoj) - Update references to use `khoj` pypi package in docs and code - Update pypi workflow to publish to both khoj, khoj-assistant for now - Update stale python 3.9 support mentioned in our pyproject Can't support python 3.9 as depend on [Django 5.0.7](https://pypi.org/project/Django/5.0.7/) which needs python >=3.10 ### Verify - Updated `pypi.yml` github workflow publishes to both (new) [khoj](https://pypi.org/project/khoj/1.16.1.dev16/), (old) [khoj-assistant](https://pypi.org/project/khoj-assistant/1.16.1.dev16/) pypi projects - Can install Khoj python package with `pip install khoj`	2024-07-17 01:05:51 -07:00
Debanjum Singh Solanky	30d60aaae9	Add, fix Khoj Docker container labels	2024-07-17 10:41:17 +05:30
Debanjum Singh Solanky	583fa3c188	Migrate the pypi package to khoj project name. Update references - Deprecate khoj-assistant pypi package. Use more accurate and succinct pypi project name, khoj - Update references to sye khoj pypi package in docs and code instead of the legacy khoj-assistant pypi package - Update pypi workflow to publish to both khoj, khoj-assistant for now - Update stale python 3.9 support mentioned in our pyproject. Can't support python 3.9 as depend on latest django which support >=3.10	2024-07-17 10:41:16 +05:30
Debanjum	23f61d49e0	Support syncing, searching images from Obsidian plugin (#847 ) - Sync images from Obsidian vault with Khoj server now that Khoj can OCR images - Support rendering images returned by Khoj search modal	2024-07-14 20:41:39 -07:00
Debanjum Singh Solanky	02658ad4fd	Upgrade Django version	2024-07-11 16:35:10 +05:30
Debanjum Singh Solanky	cbae8b68fb	Add DB migration from making bi_encode configs optional in #834	2024-07-11 16:33:31 +05:30
Debanjum Singh Solanky	3a75838196	Add Keyboard shortcuts to navigate in Khoj Desktop	2024-07-11 16:29:53 +05:30
Debanjum Singh Solanky	6c1861b319	Improve the prompt to generate images with DALLE3 and SD3 - Major - Ask for prompt in prose - Remove seed from SD3 image generation to improve diversity of output for a given prompt Otherwise for conversations with similar sounding prompts, the images would be almost exactly the same. This maybe another indicator of SD3's inability to capture detailed instructions - Consistently use "prompt" wording instead of "query" in improved image generation prompts. Previously a mix of those terms were being used, which could confuse the chat model - Minor - Add day of week to prompt - Remove 2-5 sentence limit on instructions to SD3. It seems to be able to follow longer instructions just with less fidelity than DALLE. And the 2-5 sentence instruction limit wasn't being adhered to - Improve ability to edit, improve the image based on follow-up instructions by the user - Align prompts for DALLE and SD3. Only difference is to wrap text to be rendered in quotes for SD3. This improves it's ability to render requested text. DALLE cannot render text as well or consistently	2024-07-11 16:29:53 +05:30
Debanjum Singh Solanky	21fe1a917b	Support syncing, searching images from Obsidian plugin	2024-07-11 16:22:31 +05:30
sabaimran	260aa61818	Remove tests for python3.9	2024-07-09 12:28:11 +05:30
sabaimran	4471c1e37f	Apply mitigations for piling up open connections - Because we're using a FastAPI api framework with a Django ORM, we're running into some interesting conditions around connection pooling and clean-up. We're ending up with a large pile-up of open, stale connections to the DB recurringly when the server has been running for a while. To mitigate this problem, given starlette and django run in different python threads, add a middleware that will go and call the connection clean up method in each of the threads.	2024-07-09 12:22:58 +05:30
Debanjum	0b1b262512	Add system dependencies required by RapidOCR to fix Khoj Docker image (#842 ) - Issue The Khoj docker build would fail with `ImportError: libGL.so.1: cannot open shared object file: No such file or directory`. This was required by the Khoj RapidOCR python package dependency. - Fix A minimal set of system packages have been added to resolve this issue.	2024-07-08 22:16:16 +05:30
kxnarak	43413cd21f	add dependencies required by the RapidOCR python package	2024-07-08 18:26:19 +05:30
sabaimran	037e157648	Fix a variety of links	2024-07-08 16:49:13 +05:30

1 2 3 4 5 ...

3017 commits