sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-12-18 02:27:10 +00:00

Author	SHA1	Message	Date
Debanjum Singh Solanky	750fbce0c2	Merge branch 'master' into improve-agent-pane-on-home-screen	2024-10-22 20:05:29 -07:00
Debanjum Singh Solanky	3be505db48	Only show type of error when image generation fails to clients Rather than showing raw error message from the underlying service as it could contain sensitive information	2024-10-22 20:03:20 -07:00
Debanjum	c6f3253ebd	Chat with Multiple Images. Support Vision with Gemini (#942 ) ## Overview - Add vision support for Gemini models in Khoj - Allow sharing multiple images as part of user query from the web app - Handle multiple images shared in query to chat API	2024-10-22 19:59:18 -07:00
Debanjum Singh Solanky	b3fff43542	Sanitize user attached images. Constrain chat input width on home page Set max combined images size to 20mb to allow multiple photos to be shared	2024-10-22 19:42:40 -07:00
Debanjum Singh Solanky	6c393800cc	Merge branch 'master' into multi-image-chat-and-vision-for-gemini	2024-10-22 18:38:49 -07:00
Debanjum Singh Solanky	91bbd19333	Close the agent detail hover card when scroll on agent pane	2024-10-22 18:03:17 -07:00
Debanjum Singh Solanky	110c67f083	Improve agent pill, detail card styling. Handle null chatInputRef - Remove border from agent detail hover card on home page - Do not wrap long agent names in agent pills on home page - Handle scenario where chatInputRef is null	2024-10-22 18:03:17 -07:00
Debanjum Singh Solanky	aca8bef024	Only use recent chat sessions for agent MRU. Handle null agent chats	2024-10-22 17:46:45 -07:00
sabaimran	0dad4212fa	Generate dynamic diagrams (via Excalidraw) (#940 ) Add support for generating dynamic diagrams in flow with Excalidraw (https://github.com/excalidraw/excalidraw). This happens in three steps: 1. Default information collection & intent determination step. 2. Improving the overall guidance of the prompt for generating a JSON, Excalidraw-compatible declaration. 3. Generation of the diagram to output to the final UI. Add support in the web UI.	2024-10-22 16:13:46 -07:00
sabaimran	1e993d561b	Release Khoj version 1.26.4	2024-10-22 13:50:08 -07:00
Debanjum Singh Solanky	e8fb79a369	Rate limit the count and total size of images shared via API	2024-10-22 04:37:54 -07:00
Debanjum Singh Solanky	39a613d3bc	Fix up openai chat actor tests	2024-10-22 03:09:36 -07:00
Debanjum Singh Solanky	0847fb0102	Pass online context from chat history to chat model for response Previously only notes context from chat history was included. This change includes online context from chat history for model to use for response generation. This can reduce need for online lookups by reusing previous online context for faster responses. But will increase overall response time when not reusing past online context, as faster context buildup per conversation. Unsure if inclusion of context is preferrable. If not, both notes and online context should be removed.	2024-10-22 03:09:36 -07:00
Debanjum Singh Solanky	0c52a1169a	Put context into separate user message before sending to chat model The document, online search context are now passed as separate user messages to chat model, instead of being added to the final user message. This will improve - Models ability to differentiate data from user query. That should improve response quality and reduce prompt injection probability - Make truncation logic simpler and more robust When context window hit, can simply pop messages to auto truncate context in order of context, user, assistant message for each conversation turn in history until reach current user query The complex, brittle logic to extract user query from context in last user message isn't required. Marking the context message with assistant role doesn't translate well across chat models. E.g - Gemini can't handle consecutive messages by role = model well - Claude will merge consecutive messages by same role. In current message ordering the context message will result get merged into the previous assistant response. And if move context message after user query. The truncation logic will have to hop and skip while doing deletions - GPT seems to handle consecutive roles of any type fine Using context role = user generalizes better across chat models for now and aligns with previous behavior.	2024-10-22 03:09:36 -07:00
Debanjum Singh Solanky	7ac241b766	Improve format of notes, online context passed to chat models in prompt Improve separation of note snippets and show its origin file in notes prompt to have more readable, contextualized text shared with model. Previously the references dict was being directly passed as a string. The documents don't look well formatted and are less intelligible. - Passing file path along with notes snippets will help contextualize the notes better. - Better formatting should help with making notes more readable by the chat model.	2024-10-22 03:09:36 -07:00
sabaimran	892040972f	Replace user_id with server_id in telemetry	2024-10-21 20:47:52 -07:00
sabaimran	db959a504d	Fix the version of pymupdf to avert build errors	2024-10-21 12:56:51 -07:00
sabaimran	21e69b506d	Release Khoj version 1.26.3	2024-10-21 08:19:05 -07:00
Debanjum Singh Solanky	9b554feb91	Show agent details card on hover on agent pill on web app home page - Double click on agent to open edit agent card - Focus on chat input pane when agent selected/clicked for quick, smooth agent switch and message flow - Hover on agent to see agent detail card on non-mobile displays - Use debounce to only show when hover on card for a bit	2024-10-21 00:08:01 -07:00
Debanjum Singh Solanky	220ff1df62	Set chatInputArea forward ref from parent components for control	2024-10-21 00:02:48 -07:00
Debanjum Singh Solanky	54b92eaf73	Extract isUserSubscribed check from Agents page to make it resusable	2024-10-20 23:31:48 -07:00
Debanjum Singh Solanky	bdbe8f003e	Move agent details and edit card out into reusable components on web app	2024-10-20 23:31:47 -07:00
sabaimran	ad197be70c	Fix PDFs unit test, skip OCR	2024-10-20 22:25:41 -07:00
sabaimran	59fec37943	Improve agents management, and limit agents view to private and official agents - Default to None for the input_tools and output_modes so that they can be managed in the admin panel - Hold off on showing off all Public Agents until we have a better experience for user profiles etc.	2024-10-20 22:24:51 -07:00
sabaimran	a979457442	Add unit tests for agents - Add permutations of testing for with, without knowledge base. Private, public, different users.	2024-10-20 20:04:50 -07:00
sabaimran	fc70f25583	Release Khoj version 1.26.2	2024-10-20 18:03:36 -07:00
sabaimran	046de57571	Improve error handling when documents not searched with stack trace - Stop extract OCR content from PDFs - Only use agent knowledge base when user not provided	2024-10-20 18:03:14 -07:00
sabaimran	2b68d61fef	Release Khoj version 1.26.1	2024-10-20 16:21:51 -07:00
Debanjum Singh Solanky	5fca41cc29	Show agents sorted by mru, Select mru agent by default on web app Have get agents API return agents ordered intelligently - Put the default agent first - Sort used agents by most recently chatted with agent for ease of access - Randomly shuffle the remaining unused agents for discoverability	2024-10-20 15:21:25 -07:00
Debanjum Singh Solanky	a6bfdbdbfe	Show all agents in carousel on home screen agent pane of web app This change wraps the agent pane in a scroll area with all agents shown. It allows selecting an agent to chat with directly from the home screen without breaking flow and having to jump to the agents page. The previous flow was not convenient to quickly and consistently start chat with one of your standard agents. This was because a random subet of agents were shown on the home page. To start chat with an agent not shown on home screen load you had to open the agents page and initiate the conversation from there.	2024-10-20 15:21:25 -07:00
Debanjum Singh Solanky	9ffd726799	Allow making sync api requests with body from khoj.el	2024-10-20 15:16:40 -07:00
Debanjum Singh Solanky	ac51920859	Start conversation with Agents from within Emacs Exposes a transient switch with available agents as selectable options in the Khoj chat sub-menu. Currently shows agent slugs instead of agent names as options. This isn't the cleanest but gets the job done for now. Only new conversations with a different agent can be started. Existing conversations will continue with the original agent it was created with. The ability to switch the conversation's agent doesn't exist on the server yet.	2024-10-20 15:16:40 -07:00
Debanjum Singh Solanky	7646ac6779	Style user attached images as carousel on chat input area of web app	2024-10-20 00:40:08 -07:00
sabaimran	5d5bea6a5f	Ensure images are reset after messages processed	2024-10-19 22:02:06 -07:00
sabaimran	1ad6e1749f	Move window redirect to after relevant data is dropped in localStorage on the homage page One limitation of this methodology is that localStorage has a limit in how much data it can take. Should add more graceful error handling here as well.	2024-10-19 20:36:13 -07:00
sabaimran	cb6b3ec1e9	Improve mode description given to LLM when determining how to respond. Currently experiencing difficulty instruction following when an image is shared. It's more likely to try and output an image. Update to make a clearer distinction.	2024-10-19 20:35:32 -07:00
sabaimran	545259e308	Remove unused icons in chatInputArea	2024-10-19 16:54:21 -07:00
Debanjum Singh Solanky	3cc1426edf	Style user attached images with fixed height, in a single row on web app	2024-10-19 16:48:36 -07:00
Debanjum Singh Solanky	58a331227d	Display the attached images inside the chat input area on the web app - Put the attached images display div inside the same parent div as the text area - Keep the attachment, microphone/send message buttons aligned with the text area. So the attached images just show up at the top of the text area but everything else stays at the same horizontal height as before. - This improves the UX by - Ensuring that the attached images do not obscure the agents pane above the chat input area - The attached images visually look like they are inside the actual input area, rather than floating above it. So the visual aligns with the semantics	2024-10-19 16:29:45 -07:00
Debanjum Singh Solanky	3e39fac455	Add vision support for Gemini models in Khoj	2024-10-19 15:47:03 -07:00
Debanjum Singh Solanky	0d6a54c10f	Allow sharing multiple images as part of user query from the web app Previously the web app only expected a single image to be shared by the user as part of their query. This change allows sharing multiple images from the web app. Closes #921	2024-10-19 15:47:03 -07:00
Debanjum Singh Solanky	e2abc1a257	Handle multiple images shared in query to chat API Previously Khoj could respond to a single shared image at a time. This changes updates the chat API to accept multiple images shared by the user and send it to the appropriate chat actors including the openai response generation chat actor for getting an image aware response	2024-10-19 14:53:33 -07:00
Debanjum Singh Solanky	d55cba8627	Pass user query for chat response when document lookup fails Recent changes made Khoj try respond even when document lookup fails. This change missed handling downstream effects of a failed document lookup, as the defiltered_query was null and so the text response didn't have the user query to respond to. This code initializes defiltered_query to original user query to handle that. Also response_type wasn't being passed via send_message_to_model_wrapper_sync unlike in the async scenario	2024-10-19 14:32:19 -07:00
Debanjum Singh Solanky	a4e6e1d5e8	Share webp images from web, desktop, obsidian app to chat with	2024-10-19 14:32:17 -07:00
sabaimran	dbd9a945b0	Re-evaluate agent private/public filtering after authenticateddata is retrieved. Update selectedAgent check logic to reflect.	2024-10-18 09:31:56 -07:00
Debanjum Singh Solanky	35015e720e	Release Khoj version 1.26.0	2024-10-17 18:25:53 -07:00
Debanjum Singh Solanky	f0dcfe4777	Explicitly ask Gemini models to format their response with markdown Otherwise it can get confused by the format of the passed context (e.g respond in org-mode if context contains org-mode notes)	2024-10-17 18:12:47 -07:00
Debanjum	7fb4c2939d	Make Chat and Online Search Resilient and Faster (#936 ) ## Overview ### New - Support using Firecrawl(https://firecrawl.dev) to read web pages - Add, switch and re-prioritize web page reader(s) to use via the admin panel ### Speed - Improve response speed by aggregating web page read, extract queries to run only once for each web page ### Response Resilience - Fallback through enabled web page readers until web page read - Enable reading web pages on the internal network for self-hosted Khoj running in anonymous mode - Try respond even if web search, web page read fails during chat - Try respond even if document search via inference endpoint fails ### Fix - Return data sources to use if exception in data source chat actor ## Details ### Configure web page readers to use - Only the web scraper set in Server Chat Settings via the Django admin panel, if set - Otherwise use the web scrapers added via the Django admin panel (in order of priority), if set - Otherwise, use all the web scrapers enabled by settings API keys via environment variables (e.g `FIRECRAWL_API_KEY', `JINA_API_KEY' env vars set), if set - Otherwise, use Jina to web scrape if no scrapers explicitly defined For self-hosted setups running in anonymous-mode, the ability to directly read webpages is also enabled by default. This is especially useful for reading webpages in your internal network that the other web page readers will not be able to access. ### Aggregate webpage extract queries to run once for each distinct web page Previously, we'd run separate webpage read and extract relevant content pipes for each distinct (query, url) pair. Now we aggregate all queries for each url to extract information from and run the webpage read and extract relevant content pipes once for each distinct URL. Even though the webpage content extraction pipes were previously being run in parallel. They increased the response time by 1. adding more ~duplicate context for the response generation step to read 2. being more susceptible to variability in web page read latency of the parallel jobs The aggregated retrieval of context for all queries for a given webpage could result in some hit to context quality. But it should improve and reduce variability in response time, quality and costs. This should especially help with speed and quality of online search for offline or low context chat models.	2024-10-17 17:57:44 -07:00
Debanjum Singh Solanky	2c20f49bc5	Return enabled scrapers as WebScraper objects for more ergonomic code	2024-10-17 17:44:09 -07:00
Debanjum Singh Solanky	0db52786ed	Make web scraper priority configurable via admin panel - Simplifies changing order in which web scrapers are invoked to read web page by just changing their priority number on the admin panel. Previously you'd have to delete/, re-add the scrapers to change their priority. - Add help text for each scraper field to ease admin setup experience - Friendlier env var to use Firecrawl's LLM to extract content - Remove use of separate friendly name for scraper types. Reuse actual name and just make actual name better	2024-10-17 17:42:42 -07:00

... 4 5 6 7 8 ...

3885 commits