sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-12-04 21:03:01 +01:00

Author	SHA1	Message	Date
Debanjum Singh Solanky	982ac1859c	Parse markdown file as single entry if it fits with max token limits These changes improve entry context available to the search model Specifically this should improve entry context from short knowledge trees, that is knowledge bases with small files Previously we split all markdown files by their headings, even if the file was small enough to fit entirely within the max token limits of the search model. This used to reduce the context available to select the appropriate entries for a given query for the search model, especially from short knowledge trees	2024-04-04 02:41:55 +05:30
Debanjum Singh Solanky	d8f01876e5	Add parent heading ancestory to extracted markdown entries for context Improve, update the markdown to entries extractor tests	2024-04-04 02:41:55 +05:30
Debanjum Singh Solanky	86575b2946	Chunk text in preference order of para, sentence, word, character - Previous simplistic chunking strategy of splitting text by space didn't capture notes with newlines, no spaces. For e.g in #620 - New strategy will try chunk the text at more natural points like paragraph, sentence, word first. If none of those work it'll split at character to fit within max token limit - Drop long words while preserving original delimiters Resolves #620	2024-04-04 02:41:55 +05:30
Debanjum Singh Solanky	a627f56a64	Remove unused Entry to Jsonl converter from text to entry class, tests This was earlier used when the index was plaintext jsonl file. Now that documents are indexed in a DB this func is not required. Simplify org,md,pdf,plaintext to entries tests by removing the entry to jsonl conversion step	2024-04-04 02:41:55 +05:30
Debanjum Singh Solanky	28105ee027	Create wrapper function to get entries from org, md, pdf & text files - Convert extract_org_entries function to actually extract org entries Previously it was extracting intermediary org-node objects instead Now it extracts the org-node objects from files and converts them into entries - Create separate, new function to extract_org_nodes from files - Similarly create wrapper funcs for md, pdf, plaintext to entries - Update org, md, pdf, plaintext to entries tests to use the new simplified wrapper function to extract org entries	2024-04-04 02:41:55 +05:30
Debanjum Singh Solanky	f01a12b1d2	Improve styling of chat sessions side panel - Move green server connected dot to the bottom. Show status when disconnected from server - Move "New conversation" button to right of the "Conversation" title - Center alignment of the new conversation and connection status buttons	2024-04-04 01:43:26 +05:30
sabaimran	dd1e5e145a	Use List[Any] for typing	2024-04-03 21:46:41 +05:30
sabaimran	b8087c4c8e	Add typing to empty list variables in github_to_entries	2024-04-03 21:41:36 +05:30
sabaimran	d036fdfc26	If tree is not in the contents, then just return empty files list	2024-04-03 17:55:25 +05:30
Debanjum Singh Solanky	f915b2bd14	Fix passing model_name param to chatml formatter for online chat	2024-04-03 17:21:43 +05:30
sabaimran	6aa88761b8	Skip creating the default agent if there's no default conversation config	2024-04-03 17:21:01 +05:30
sabaimran	b4f71e06b3	Add timeout after 10 minutes of inactivity on socket	2024-04-02 22:12:27 +05:30
sabaimran	f48426623d	resolve merge conflict in chat.html	2024-04-02 17:29:48 +05:30
sabaimran	bf1187f465	Use new online/websearch logic and add agent to chat_metadata	2024-04-02 17:20:38 +05:30
sabaimran	867e1007d1	Remove superfluous newline	2024-04-02 17:20:08 +05:30
sabaimran	228ad68042	Merge with origin/master	2024-04-02 17:02:21 +05:30
sabaimran	776550d5ce	Add a migration for updating the default chat model, update for existing users	2024-04-02 17:01:31 +05:30
sabaimran	47fc7e1ce6	Rebase with matser	2024-04-02 16:16:06 +05:30
Debanjum	215ab6e66a	Extract More Dates from entries to improve Date Filter (#683 ) - Overview - Extract more structured date variants (e.g with dot(.) & slash(/) separators, 2-digit year) - Extract some natural, partial dates as well from entries - Capability Add ability to extract the following additional date forms: - Natural Dates: 21st April 2000, February 29 2024 - Partial Natural Dates: March 24, Mar 2024 - Structured Dates: 20/12/24, 20.12.2024, 2024/12/20 Note: Previously only YYYY-MM-DD ISO-8601 structured date form was extracted for date filters - Performance Using regexes is MUCH faster than using the `dateparser' python library It's a little crude but gives acceptable performance for large datasets	2024-04-02 16:14:53 +05:30
Debanjum Singh Solanky	7afee2d55c	Let offline chat model set context window. Improve, fix prompts	2024-03-31 16:19:35 +05:30
Debanjum Singh Solanky	4228965c9b	Handle msg truncation when question is larger than max prompt size Notice and truncate the question it self at this point	2024-03-31 15:50:06 +05:30
Debanjum Singh Solanky	886d49e3a4	Merge branch 'master' into migrate-to-llama-cpp-for-offline-chat	2024-03-31 00:59:20 +05:30
Debanjum Singh Solanky	4f65dde201	Release Khoj version 1.8.0	2024-03-31 00:06:15 +05:30
Debanjum Singh Solanky	7923903d21	Improve date filter regexes to extract structured, natural, partial dates - Much faster than using dateparser - It took 2x-4x for improved regex to extracts 1-15% more dates - Whereas It took 33x to 100x for dateparser to extract 65% - 400% more dates - Improve date extractor tests to test deduping dates, natural, structured date extraction from content - Extract some natural, partial dates and more structured dates Using regex is much faster than using dateparser. It's a little crude but should pay off in performance. Supports dates of form: - (Day-of-Month) Month\|AbbreviatedMonth Year\|2DigitYear - Month\|AbbreviatedMonth (Day-of-Month) Year\|2DigitYear	2024-03-30 00:07:19 +05:30
Debanjum Singh Solanky	104eeea274	Extract natural language and locale specific dates in content Previously we just extracted dates in YYYY-MM-DD format from content for date filterings during search. Use dateparser to extract dates across locales and natural language This should improve notes returned as context when chat searches knowledge base with date filters Fallback to regex for date parsing from content if dateparser fails - Limit natural date extractor capabilities to improve performance - Assume language is english Language detection otherwise takes a REALLY long time - Do not extract unix timestamps, timezone - This isn't required, as just using date and approximating dates as UTC	2024-03-30 00:06:56 +05:30
sabaimran	1195f843a3	Remove forward slash from the root agents endpoint	2024-03-28 23:06:55 +05:30
sabaimran	a1729b9b9e	Add telemetry for agents used in conversation, increase image width in agents page	2024-03-28 22:18:11 +05:30
sabaimran	d503b3e867	Use Personality vernacular in agent page - When setting up the default agent, configure every conversation that doesn't have an agent to use the Khoj agent - Fix reverse migration for the locale removal migration	2024-03-28 15:07:02 +05:30
sabaimran	e59de8c9b1	Constrain width/size of agent image in agents view	2024-03-28 13:32:11 +05:30
sabaimran	51d0c9b8b0	Add telemetry to keep state of new agents being used	2024-03-28 11:37:24 +05:30
sabaimran	46ebc55e2b	Add a top tab for agents	2024-03-28 11:37:01 +05:30
sabaimran	8397187231	Use default agent when creating a new conversation without agent specified	2024-03-28 11:36:27 +05:30
Debanjum Singh Solanky	4912c0ee30	Use extract queries actor to improve notes search with offline chat Previously we were skipping the extract questions step for offline chat as default offline chat model wasn't good enough to output proper json given the time it took to extract questions. The new default offline chat models gives json much more regularly and with date filters, so the extract questions step becomes useful given the impact on latency	2024-03-26 22:33:01 +05:30
Debanjum Singh Solanky	1ebd5c3648	Rename GPT4AllChatProcessor* to OfflineChatProcessor Config, Model	2024-03-26 22:33:01 +05:30
Debanjum Singh Solanky	2a0b943bb4	Use Hermes-2-Pro as default offline chat model in khoj.yml	2024-03-26 22:33:01 +05:30
Debanjum Singh Solanky	8ca39a436c	Use llama.cpp for offline chat models - Benefits of moving to llama-cpp-python from gpt4all: - Support for all GGUF format chat models - Support for AMD, Nvidia, Mac, Vulcan GPU machines (instead of just Vulcan, Mac) - Supports models with more capabilities like tools, schema enforcement, speculative ddecoding, image gen etc. - Upgrade default chat model, prompt size, tokenizer for new supported chat models - Load offline chat model when present on disk without requiring internet - Load model onto GPU if not disabled and device has GPU - Load model onto CPU if loading model onto GPU fails - Create helper function to check and load model from disk, when model glob is present on disk. `Llama.from_pretrained' needs internet to get repo info from HuggingFace. This isn't required, if the model is already downloaded Didn't find any existing HF or llama.cpp method that looked for model glob on disk without internet	2024-03-26 22:33:01 +05:30
Debanjum Singh Solanky	0a7392f6ec	Only add location to image prompt generator when location known	2024-03-26 22:33:01 +05:30
sabaimran	fdf78525b4	Part 2: Add web UI updates for basic agent interactions (#675 ) * Initial pass at backend changes to support agents - Add a db model for Agents, attaching them to conversations - When an agent is added to a conversation, override the system prompt to tweak the instructions - Agents can be configured with prompt modification, model specification, a profile picture, and other things - Admin-configured models will not be editable by individual users - Add unit tests to verify agent behavior. Unit tests demonstrate imperfect adherence to prompt specifications * Customize default behaviors for conversations without agents or with default agents * Add a new web client route for viewing all agents * Use agent_id for getting correct agent * Add web UI views for agents - Add a page to view all agents - Add slugs to manage agents - Add a view to view single agent - Display active agent when in chat window - Fix post-login redirect issue * Fix agent view * Spruce up the 404 page and improve the overall layout for agents pages * Create chat actor for directly reading webpages based on user message - Add prompt for the read webpages chat actor to extract, infer webpage links - Make chat actor infer or extract webpage to read directly from user message - Rename previous read_webpage function to more narrow read_webpage_at_url function * Rename agents_page -> agent_page * Fix unit test for adding the filename to the compiled markdown entry * Fix layout of agent, agents pages * Merge migrations * Let the name, slug of the default agent be Khoj, khoj * Fix chat-related unit tests * Add webpage chat command for read web pages requested by user Update auto chat command inference prompt to show example of when to use webpage chat command (i.e when url is directly provided in link) * Support webpage command in chat API - Fallback to use webpage when SERPER not setup and online command was attempted - Do not stop responding if can't retrieve online results. Try to respond without the online context * Test select webpage as data source and extract web urls chat actors * Tweak prompts to extract information from webpages, online results - Show more of the truncated messages for debugging context - Update Khoj personality prompt to encourage it to remember it's capabilities * Rename extract_content online results field to webpages * Parallelize simple webpage read and extractor Similar to what is being done with search_online with olostep * Pass multiple webpages with their urls in online results context Previously even if MAX_WEBPAGES_TO_READ was > 1, only 1 extracted content would ever be passed. URL of the extracted webpage content wasn't passed to clients in online results context. This limited them from being rendered * Render webpage read in chat response references on Web, Desktop apps * Time chat actor responses & chat api request start for perf analysis * Increase the keep alive timeout in the main application for testing * Do not pipe access/error logs to separate files. Flow to stdout/stderr * [Temp] Reduce to 1 gunicorn worker * Change prod docker image to use jammy, rather than nvidia base image * Use Khoj icon when Khoj web is installed on iOS as a PWA * Make slug required for agents * Simplify calling logic and prevent agent access for unauthenticated users * Standardize to use personality over tuning in agent nomenclature * Make filtering logic more stringent for accessible agents and remove unused method: * Format chat message query --------- Co-authored-by: Debanjum Singh Solanky <debanjum@gmail.com>	2024-03-26 18:13:24 +05:30
Debanjum Singh Solanky	15ed208996	Use Khoj icon when Khoj web is installed on iOS as a PWA	2024-03-26 00:13:12 +05:30
Debanjum	586654e2af	Allow directly reading web pages, even when SERP not enabled (#676 ) ### Overview Khoj can now read website directly without needing to go through the search step first ### Details - Parallelize simple webpage read and extractor - Rename extract_content online results field to web pages - Tweak prompts to extract information from webpages, online results - Test select webpage as data source and extract web urls chat actors - Render webpage read in chat response references on Web, Desktop apps - Pass multiple webpages with their urls in online results context - Support webpage command in chat API - Add webpage chat command for read web pages requested by user - Create chat actor for directly reading webpages based on user message	2024-03-24 16:25:25 +05:30
Debanjum Singh Solanky	9e52ae9e98	Time chat actor responses & chat api request start for perf analysis	2024-03-24 15:47:38 +05:30
Debanjum Singh Solanky	dabf71bc3c	Render webpage read in chat response references on Web, Desktop apps	2024-03-24 15:47:38 +05:30
Debanjum Singh Solanky	a2e79c94be	Pass multiple webpages with their urls in online results context Previously even if MAX_WEBPAGES_TO_READ was > 1, only 1 extracted content would ever be passed. URL of the extracted webpage content wasn't passed to clients in online results context. This limited them from being rendered	2024-03-24 15:47:38 +05:30
Debanjum Singh Solanky	71b6905008	Parallelize simple webpage read and extractor Similar to what is being done with search_online with olostep	2024-03-24 15:46:29 +05:30
Debanjum Singh Solanky	1167f6ddf9	Rename extract_content online results field to webpages	2024-03-24 15:46:29 +05:30
Debanjum Singh Solanky	b22a7dae5d	Tweak prompts to extract information from webpages, online results - Show more of the truncated messages for debugging context - Update Khoj personality prompt to encourage it to remember it's capabilities	2024-03-24 15:46:29 +05:30
Debanjum Singh Solanky	ad6f6bb0ed	Support webpage command in chat API - Fallback to use webpage when SERPER not setup and online command was attempted - Do not stop responding if can't retrieve online results. Try to respond without the online context	2024-03-24 15:46:29 +05:30
Debanjum Singh Solanky	a6b7432837	Add webpage chat command for read web pages requested by user Update auto chat command inference prompt to show example of when to use webpage chat command (i.e when url is directly provided in link)	2024-03-24 15:46:29 +05:30
sabaimran	8abc8ded82	Part 1: Server-side changes to support agents integrated with Conversations (#671 ) * Initial pass at backend changes to support agents - Add a db model for Agents, attaching them to conversations - When an agent is added to a conversation, override the system prompt to tweak the instructions - Agents can be configured with prompt modification, model specification, a profile picture, and other things - Admin-configured models will not be editable by individual users - Add unit tests to verify agent behavior. Unit tests demonstrate imperfect adherence to prompt specifications * Customize default behaviors for conversations without agents or with default agents * Use agent_id for getting correct agent * Merge migrations * Simplify some variable definitions, add additional security checks for agents * Rename agent.tuning -> agent.personality	2024-03-23 22:09:38 +05:30
sabaimran	4deb849fb1	Merge branch 'features/add-agents-ui' of github.com:khoj-ai/khoj into features/chat-socket-streaming	2024-03-23 14:04:25 +05:30
sabaimran	8edbd7094f	Let the name, slug of the default agent be Khoj, khoj	2024-03-23 14:03:58 +05:30
sabaimran	6b4c4f10b5	Merge branch 'features/add-agents-ui' of github.com:khoj-ai/khoj into features/chat-socket-streaming	2024-03-23 11:22:00 +05:30
sabaimran	20617614ae	Merge branch 'features/customize-chat-with-agents' of github.com:khoj-ai/khoj into features/add-agents-ui	2024-03-23 11:20:57 +05:30
sabaimran	2399d91f61	Merge migrations	2024-03-22 10:05:33 +05:30
sabaimran	d38089ab57	Merge with origin	2024-03-22 09:55:33 +05:30
Debanjum Singh Solanky	aed4313cfc	Fix updating specific conversation by id from the chat API endpoint - Use the conversation id of the retrieved conversation rather than the potentially unset conversation id passed via API - await creating new chat when no chat id provided and no existing conversations exist	2024-03-21 02:46:52 +05:30
sabaimran	6ba0d8e379	Add a connected notification if the websocket is connected	2024-03-20 20:53:28 +05:30
sabaimran	255b69dc58	Add a comma delimeter between outputted search queries	2024-03-20 19:43:35 +05:30
sabaimran	d84188b221	Scroll down when a message is added in the chat interface's handle stream response method	2024-03-20 15:04:41 +05:30
sabaimran	70ad78990a	Use a common method for sending a generic message to the client from the server in the ws connection	2024-03-20 15:04:14 +05:30
sabaimran	d4e83b060a	Update the web UI for the chat interface to establish a connection via a socket to the server - Move some common methods into separate functions to make the UI components more efficient - The normal HTTP-based chat connection will still work and serves as a fallback if the websocket is unavailable	2024-03-20 14:34:47 +05:30
sabaimran	a346f79b39	Add support for chatting via the web socket connection - Convert to a model of calling the search API directly with a function call (rather than using the API method) - Gracefully handle websocket connection disconnects - Ensure that the rest of the response is still saved, as it is currently, if the user disconects from the client - Setup unchangeable context at the beginning of the session when the connection is established (like location, username, etc)	2024-03-20 14:33:33 +05:30
Debanjum Singh Solanky	62a83dc9bb	Fix online search actor to use natural dates not after: operator The recently added after: operator to online search actor was too restrictive, gave worse results than when just use natural language dates in search query	2024-03-15 21:50:14 +05:30
Debanjum Singh Solanky	4a1e6a2275	Convert deleted old user requests log line to debug from info	2024-03-15 20:50:10 +05:30
Debanjum Singh Solanky	9a068dadbf	Fix extract questions prompt to use YYYY-MM-DD date filter format	2024-03-15 18:43:18 +05:30
Debanjum Singh Solanky	ecddf98430	Handle truncation when single long non-system chat message Previously was assuming the system prompt is being always passed as the first message. So expected there to be at least 2 messages in logs. This broke chat actors querying with single long non system message. A more robust way to extract system prompt is via the message role instead	2024-03-15 15:58:39 +05:30
Debanjum Singh Solanky	ec0c35b7ed	Improve delete, rename chat session UX in Desktop, Web app - Ask for Confirmation before deleting chat session in Desktop, Web app - Save chat session rename on hitting enter in title edit input box - No need to flash previous conversation cleared status message - Move chat session delete button after rename button in Desktop app	2024-03-15 15:58:19 +05:30
Debanjum Singh Solanky	924b1215ce	Allow unset locale for Google authenticated user	2024-03-15 15:35:20 +05:30
Debanjum Singh Solanky	c792fa819f	Fix setting chat session title from Desktop app Pass auth headers to not have the chat session title update request fail	2024-03-15 15:19:20 +05:30
Debanjum Singh Solanky	c9e05dc184	Get conversation by title when requested via chat API	2024-03-15 12:31:50 +05:30
sabaimran	724557fc7b	Merge branch 'master' of github.com:khoj-ai/khoj into features/add-agents-ui	2024-03-15 12:14:34 +05:30
sabaimran	7fc484ba7a	Merge branch 'master' of github.com:khoj-ai/khoj into features/customize-chat-with-agents	2024-03-15 12:13:28 +05:30
Debanjum Singh Solanky	cac26dafe3	Only create new chat on get if a specific chat id, slug isn't requested	2024-03-15 11:58:39 +05:30
sabaimran	416feb13ef	Fix layout of agent, agents pages	2024-03-15 11:17:40 +05:30
sabaimran	d734be61cf	Rename agents_page -> agent_page	2024-03-15 10:17:51 +05:30
Debanjum Singh Solanky	08993ff109	Add new, remove old known chat models from model to prompt size map	2024-03-15 04:02:25 +05:30
Debanjum Singh Solanky	fba0338787	Release Khoj version 1.7.0	2024-03-15 00:08:32 +05:30
Debanjum Singh Solanky	6118d1ff57	Create chat actor for directly reading webpages based on user message - Add prompt for the read webpages chat actor to extract, infer webpage links - Make chat actor infer or extract webpage to read directly from user message - Rename previous read_webpage function to more narrow read_webpage_at_url function	2024-03-14 14:58:37 +05:30
Debanjum	e549824fe2	Improve OpenAI Chat Actors and their prompts (#673 ) ### Major - Enforce json mode response from OpenAI chat actors prev using string lists - Use `gpt-4-turbo-preview' as default chat model, extract questions actor - Make Khoj read khoj website to respond with accurate, up-to-date information about itself - Dedupe query in notes prompt. Improve OAI chat actor, director tests ### Minor - Test data source, output mode selector, web search query chat actors - Improve notes search actor to always create a non-empty list of queries - Construct available data sources, output modes as a bullet list in prompts - Use consistent agent name across static and dynamic examples in prompts - Add actor's name to extract questions prompt to improve context for guidance	2024-03-14 12:44:40 +05:30
sabaimran	3caf0a79d8	Spruce up the 404 page and improve the overall layout for agents pages	2024-03-14 11:26:49 +05:30
sabaimran	c45030af44	Fix agent view	2024-03-14 11:13:19 +05:30
Debanjum Singh Solanky	a1ce12296f	Fix rendering online with note references post streaming chat response Previously only the notes references would get rendered post response streaming when when both online and notes references were used to respond to the user's message	2024-03-14 03:40:40 +05:30
Debanjum Singh Solanky	1aeea3d854	Fix opening external links from confirmation dialog box on desktop app	2024-03-14 02:29:22 +05:30
Debanjum Singh Solanky	2e5cc49cb3	Enforce json response from OpenAI chat actors prev using string lists - Allow passing response format type to OpenAI API via chat actors - Convert in-context examples to use json objects instead of str lists - Update actors outputting str list to request output to be json_object - OpenAI's json mode enforces the model to output valid json object	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	7211eb9cf5	Default to gpt-4-turbo-preview for chat model, extract questions actor GPT-4 is more expensive and generally less capable than gpt-4-turbo-preview	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	dd883dc53a	Dedupe query in notes prompt. Improve OAI chat actor, director tests - Remove stale tests - Improve tests to pass across gpt-3.5 and gpt-4-turbo - The haiku creation director was failing because of duplicate query in instantiated prompt	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	14682d5354	Improve notes search actor to always create a non-empty list of queries - Remove the option for Notes search query generation actor to return no queries. Whether search should be performed is decided before, this step doesn't need to decide that - But do not throw warning if the response is a list with no elements	2024-03-14 01:22:33 +05:30
Debanjum Singh Solanky	f5734826cb	Improve pick data source prompt to look online for info about Khoj - Add examples where user queries requesting information about Khoj results in the "online" data source being selected - Add an example for "general" to select chat command prompt	2024-03-14 01:21:13 +05:30
Debanjum Singh Solanky	9a516bed47	Construct available data sources, output modes as a bullet list in prompts	2024-03-14 00:34:57 +05:30
Debanjum Singh Solanky	f28fb89af8	Use consistent agent name across static and dynamic examples in prompts Previously the examples constructed from chat history used "Khoj" as the agent's name but all 3 prompts using the func used static examples with "AI:" as the pertinent agent's name	2024-03-14 00:34:57 +05:30
Debanjum Singh Solanky	f5793149a9	Add actor's name to extract questions prompt to improve context for guidance	2024-03-14 00:34:57 +05:30
Debanjum Singh Solanky	73ad444086	Make online search Actor read khoj.dev for docs, info about Khoj - Add example to read khoj.dev website for up-to-date info to setup, use khoj, discover khoj features etc. - Online search should use site: and after: google search operators - Show example of adding the after: date filter to google search - Give local event lookup example using user's current location in query - Remove unused select search content type prompt	2024-03-14 00:34:57 +05:30
sabaimran	290712c3fe	Add web UI views for agents - Add a page to view all agents - Add slugs to manage agents - Add a view to view single agent - Display active agent when in chat window - Fix post-login redirect issue	2024-03-14 00:07:36 +05:30
Debanjum	3abe7ccb26	Improve Online Search Speed and Context (#670 ) ### Major - Read web pages in parallel to improve chat response time - Read web pages directly when Olostep proxy not setup - Include search results & web page content in online context for chat response ### Minor - Simplify, modularize and add type hints to online search functions	2024-03-11 22:16:30 +05:30
Debanjum Singh Solanky	dc86e44a07	Include search results & webpage content in online context for chat response Previously if a web page was read for a sub-query, only the extracted web page content was provided as context for the given sub-query. But the google results themselves have relevant snippets. So include them	2024-03-11 18:41:02 +05:30
Debanjum Singh Solanky	d136a6be44	Simplify, modularize and add type hints to online search functions - Simplify content arg to `extract_relevant_info' function. Validate, clean the content arg inside the `extract_relevant_info' function - Extract `search_with_google' function outside the parent function - Call the parent function a more appropriate `search_online' instead of `search_with_google' - Simplify the `search_with_google' function using list comprehension. Drop empty search result fields from chat model context for response to reduce cost and response latency - No need to show stacktrace when unable to read webpage, basic error is enough - Add type hints to online search functions to catch issues with mypy	2024-03-11 18:41:02 +05:30
Debanjum Singh Solanky	88f096977b	Read webpages directly when Olostep proxy not setup This is useful for self-hosted, individual user, low traffic setups where a proxy service is not required	2024-03-11 18:41:02 +05:30
Debanjum Singh Solanky	ca2f962e95	Read, extract information from web pages in parallel to lower response time - Time reading webpage, extract info from webpage steps for perf analysis - Deduplicate webpages to read gathered across separate google searches - Use aiohttp to make API requests non-blocking, pair with asyncio to parallelize all the online search webpage read and extract calls	2024-03-11 18:41:02 +05:30
sabaimran	8e1445b15b	Use agent_id for getting correct agent	2024-03-11 14:44:46 +05:30
sabaimran	6ab649312f	Add a new web client route for viewing all agents	2024-03-11 14:40:40 +05:30
sabaimran	352168d6c2	Customize default behaviors for conversations without agents or with default agents	2024-03-11 14:20:28 +05:30
sabaimran	9b88976f36	Initial pass at backend changes to support agents - Add a db model for Agents, attaching them to conversations - When an agent is added to a conversation, override the system prompt to tweak the instructions - Agents can be configured with prompt modification, model specification, a profile picture, and other things - Admin-configured models will not be editable by individual users - Add unit tests to verify agent behavior. Unit tests demonstrate imperfect adherence to prompt specifications	2024-03-11 12:45:24 +05:30
Debanjum	18fa3e2384	Rerank Search Results by Default on GPU machines (#668 ) - Trigger SentenceTransformer Cross Encoder models now run fast on GPU enabled machines, including Mac ARM devices since UKPLab/sentence-transformers#2463 - Details - Use cross-encoder to rerank search results by default on GPU machines and when using an inference server - Only call search API when pause in typing search query on web, desktop apps	2024-03-10 15:15:25 +05:30
Debanjum Singh Solanky	53d402480c	Rerank search results with cross-encoder when using an inference server If an inference server is being used, we can expect the cross encoder to be running fast enough to rerank search results by default	2024-03-10 15:09:46 +05:30
Debanjum Singh Solanky	44c8d09342	Only call search API when pause in typing search query on web, desktop apps Wait for 300ms since stop typing before calling search API. This smooths out UI jitter when rendering search results, especially now that we're reranking for every search query on GPU enabled devices Emacs already has 300ms debounce time. More convoluted to add debounce time to Obsidian search modal, so not updating that yet	2024-03-10 14:29:24 +05:30
Debanjum Singh Solanky	1105d8814f	Use cross-encoder to rerank search results by default on GPU machines Latest sentence-transformer package uses GPU for cross-encoder. This makes it fast enough to enable reranking on machines with GPU. Enabling search reranking by default allows (at least) users with GPUs to side-step learning the UI affordance to rerank results (i.e hitting Cmd/Ctrl-Enter or ENTER).	2024-03-10 14:29:21 +05:30
Debanjum Singh Solanky	fd81446ba3	Do not create new chat session when an old chat session is deleted - Fix `get_conversation_by_user' shouldn't return new conversation if conversation with requested id not found. It should only return new conversation if no specific conversation is requested and no conversations found for user at all - Repro - Delete a new chat, this calls loadChat via window.onload which calls server /chat/history API endpoint with conversationId set to that of just deleted conversation sporadically The call to GET chat/history API with conversationId set occurs when window.onload triggers before the conversationId is deleted by the delete button after the DELETE /chat/history API call (via race) - In such a scenario, get_conversation_by_user called by chat/history API with conversationId of deleted conversation returns a new conversation - Miscellaneous - Chat history load should be logged as call to that chat_history api, not the "chat" api - Show status updates of clearing conversation history in chat input - Simplify web, desktop client code by removing unnecessary new variables	2024-03-10 02:17:23 +05:30
Debanjum Singh Solanky	b7fad04870	Use consistent field name for queries in chat history & better image prompt	2024-03-09 19:11:03 +05:30
sabaimran	6aae9864d3	Fix Notion indexing and add an admin view for Entry objects	2024-03-09 16:25:23 +05:30
sabaimran	12d6c4da7d	Only include inferred queries in the conversation history for images, not links. Overflow the side panel when too long	2024-03-09 11:59:35 +05:30
sabaimran	e5cd0237e3	Release Khoj version 1.6.2	2024-03-08 17:04:03 +05:30
Debanjum Singh Solanky	446ac7649d	Remove unused js method in web chat client, add newline to web data in prompt	2024-03-08 16:40:39 +05:30
Debanjum Singh Solanky	12d32ac99c	Increase user visibility into more errors during image generation Catch OpenAI connection error and errors during better image prompt generation	2024-03-08 16:40:39 +05:30
sabaimran	ff31759423	Fix target determination in the copy programmatic output button	2024-03-08 16:33:12 +05:30
sabaimran	9f934929c6	Infer mime type from file ending when not available in browser. Don't output image in conversation turns	2024-03-08 12:34:26 +05:30
sabaimran	81beb7940c	Upload generated images to s3, if AWS credentials and bucket is available (#667 ) * Upload generated images to s3, if AWS credentials and bucket is available. - In clients, render the images via the URL if it's returned with a text-to-image2 intent type * Make the loading screen more intuitve, less jerky and update the programmatic copy button * Update the loading icon when waiting for a chat response	2024-03-08 10:54:13 +05:30
sabaimran	13894e1fd5	add instructions for drag/drop files in sys prompt	2024-03-07 17:57:42 +05:30
sabaimran	7357b6eff1	Revert white-space preline and add more detailed help text when selecting file	2024-03-06 16:47:27 +05:30
sabaimran	b615c0719e	Support upload for files via drag/drop in the web UI (#666 ) * Add additional styling changes for showing UI changes when dragging file to the main screen * Add a loading spinner when file upload is in progress, and don't index github/notion when indexing files * Add an explicit icon for file uploading in the chat button menu * Add appropriate dragover styling when picking a file from the file picker/browser * Add a loading screen when retrieving chat history. Fix width of the chat window. Put attachment icon to the left of chat input	2024-03-06 16:43:05 +05:30
sabaimran	e323a6d69b	Include additional user context in the image generation flow (#660 ) * Make major improvements to the image generation flow - Include user context from online references and personal notes for generating images - Dynamically select the modality that the LLM should respond with - Retun the inferred context in the query response for the dekstop, web chat views to read * Add unit tests for retrieving response modes via LLM * Move output mode unit tests to the actor suite, rather than director * Only show the references button if there is at least one available * Rename aget_relevant_modes to aget_relevant_output_modes * Use a shared method for generating reference sections, simplify some of the prompting logic * Make out of space errors in the desktop client more obvious	2024-03-06 13:48:41 +05:30
Debanjum Singh Solanky	2d61591c22	Improve user visibility into errors during image generation	2024-02-29 13:19:13 +05:30
sabaimran	0bbb5cff85	Release Khoj version 1.6.1	2024-02-26 13:27:20 -08:00
sabaimran	c8194a7364	Make out of space errors in the desktop client more obvious	2024-02-26 11:53:36 -08:00
Debanjum Singh Solanky	956dd71d91	Clean entry before adding to DB and log when it fails Remove \0 null characters from entry fields as this is causing indexing errors	2024-02-27 01:19:34 +05:30
Debanjum Singh Solanky	bb613a8e1d	Make indentation styling more compact on Obsidian client	2024-02-25 14:41:45 +05:30
Debanjum Singh Solanky	682b70011f	Set chat body height to remove UX jitter on chat history load in Web, Desktop	2024-02-25 14:40:47 +05:30
Debanjum Singh Solanky	efe86ce159	Fix saved conversation logger to handle image responses	2024-02-25 13:46:32 +05:30
Debanjum Singh Solanky	4839f2901a	Open external links in Desktop app with default app for url on OS - Open external links using the default link handler registered on OS for the link type, e.g http:// -> firefox, mailto: thunderbird etc - Confirm before opening non-http URL using an external app	2024-02-25 13:21:52 +05:30
Debanjum	170bce2c02	Fix, Improve rendering images in Obsidian, Desktop, Web clients (#659 ) - Improve render of inferred query in image chat messages in Web, Desktop apps - Add inferred queries to image chat responses in Obsidian client - Fix rendering images from Khoj response in Obsidian client	2024-02-25 00:56:26 +05:30
Debanjum Singh Solanky	f84606325c	Improve render of inferred query in image chat messages in Web, Desktop apps	2024-02-25 00:47:06 +05:30
Debanjum Singh Solanky	a2e53d5e41	Add inferred queries to image chat responses in Obsidian client	2024-02-25 00:24:58 +05:30
Debanjum Singh Solanky	9b61f0b5f7	Fix rendering images from Khoj response in Obsidian client	2024-02-25 00:11:11 +05:30
sabaimran	b9d0533d92	Misc. fixes to prompting, admin, and others (#658 ) * Simplify and clarify prompt for selecting toolset dynamically * Add error handling around call to OLOSTEP api * Fix conversation admin page * Skip adding none or empty entries in the chunking method	2024-02-24 10:25:42 -08:00
Debanjum Singh Solanky	0e0e751ef7	Improve docstring of entrypoint function to the emacs client	2024-02-24 21:09:41 +05:30
Debanjum	8855529637	Improve Syncing Obsidian Vault, Invalidate Static Assets in Browser Cache in Web Client (#657 ) - Improve - Only send files modified since their last sync for indexing on server from the Obsidian client - Fix - Invalidate static asset browser cache in Web client when Khoj version changes	2024-02-24 20:20:30 +05:30
Debanjum Singh Solanky	a46f70c4b0	Remove deprecated lastSyncedFiles settings field from Obsidian client	2024-02-24 20:18:22 +05:30
Debanjum Singh Solanky	03a6b491b2	Warn when can't identify mimeType of files in Desktop, Obsidian clients	2024-02-24 19:59:03 +05:30
Debanjum Singh Solanky	3675ab4864	Only sync modified files from the Obsidian client Previously we'd send all files in vault and let the server deduplicate. This changes takes inspiration from the desktop app, and only pushes files which were modified after their previous sync with the server. This should reduce the processing load on the server	2024-02-24 07:48:40 +05:30
Debanjum Singh Solanky	ddfbf31bc8	Append version query param to web asset URLs to bypass browser cache Ensure latest assets are loaded when khoj version is updated	2024-02-24 06:49:25 +05:30
sabaimran	42773e808c	Retrieve, create, and save conversations differently for ClientApplications (#656 ) * Retrieve, create, and save conversations differently if they're coming from a client application - Not all of our client apps will necessarily maintain state over the conversation IDs available to a user. For some (single-threaded conversations), it should just use a single conversation. Fix the code to do so * Simplify conversation retrieval logic * Keep 0 padding below chat response * Add order_by sorting to retrieving the conversation without id	2024-02-23 11:32:00 -08:00
Debanjum	9afb2a14ef	Fix and Improve Chat UI in Web, Desktop apps (#655 ) ### Improvements to Chat UI on Web, Desktop apps - Improve styling of chat session side panel - Improve styling of chat message bubble in Desktop, Web app - Add frosted, minimal chat UI to background of Login screen - Improve PWA install experience of Khoj ### Fixes to Chat UI on Web, Desktop apps - Fix creating new chat sessions from the Desktop app - Only show 3 starter questions even when consecutive chat sessions created ### Other Improvements - Update Khoj cloud trial period to a fortnight instead of a week - Document using venv to handle dependency conflict on khoj pip install Resolves #276	2024-02-23 19:27:02 +05:30
Debanjum Singh Solanky	c70ca78cdc	Improve PWA install experience for Khoj on Desktop, Mobile - Resolve PWA issues thrown by Chrome/Edge - Add screenshot samples showcasing remember, browse and draw features - This can provide a richer app store like experience when installing Khoj PWA on Mobile or Desktop - Add wide and narrow screenshots to show Mobile vs Desktop UX - Add higher resolution favicon for PWA - Use single web manifest instead of separate ones for Chat, Search - Update manifest description with more details about Khoj features	2024-02-23 18:59:52 +05:30
Debanjum Singh Solanky	e10b260988	Update web login screen to show frosted minimal chat UI in background	2024-02-23 18:59:52 +05:30
Debanjum Singh Solanky	1b0318564e	Log when conversation turn is saved to DB	2024-02-23 18:59:52 +05:30
Debanjum Singh Solanky	4c39960917	Make number of conversation starters to get from DB configurable	2024-02-23 18:59:52 +05:30
Debanjum Singh Solanky	50617594fd	Only show 3 starter questions even when consecutive chat sessions created Reset starter question suggestions before appending in web, desktop app Otherwise previously it'd keep adding to existing starter question suggestions on each new session creation if multiple consecutive new chat sessions created. This would result in more than the 3 expected starter questions being displayed at a time	2024-02-23 18:59:52 +05:30
Debanjum Singh Solanky	102f5c3f53	Improve styling of chat session side panel - Make collapse, expand toggle arrow point in the direction the action will expand the side panel in - Make the collapsed side panel reduce to a 1px sliver	2024-02-23 18:59:52 +05:30
Debanjum Singh Solanky	6283d9fe83	Update Khoj cloud trial period to a fortnight instead of a week - Improve rate limit error message wording - Make the "too many requests" error message more robust. Should throw that exception fix self.request >= self.subscribed_requests because upgrading wouldn't fix this rate limiting	2024-02-23 18:33:56 +05:30
Debanjum Singh Solanky	05c1903784	Fix creating new chat sessions from the Desktop app Code wasn't passing the authorization header in the POST request to create new chat session	2024-02-23 18:33:56 +05:30
Debanjum Singh Solanky	8a219b6e9c	Improve styling of chat message bubble in Desktop, Web app - Respect newline with pre-line but not for bullets to improve formatting of responses by Khoj - Respect bold font by loading tajawal font with other weights - Reduce bottom margin in chat message bubble, its taking too much space	2024-02-23 18:33:56 +05:30
sabaimran	b4902090e7	Misc. chat and application improvements (#652 ) * Document original query when subqueries can't be generated * Only add messages to the chat message log if it's non-empty * When changing the search model, alert the user that all underlying data will be deleted * Adding more clarification to the prompt input for username, location * Check if has_more is in the notion results before getting next_cursor * Update prompt template for user name/location, update confirmation message when changing search model	2024-02-22 19:09:22 -08:00
Debanjum Singh Solanky	7271164256	Set chat session title to textContent of the chat session HTML element We don't expect/want the user to use HTML titles for chat session	2024-02-23 02:07:08 +05:30
sabaimran	f8ec6b4464	Remove backslash for default route in api_chat	2024-02-20 20:09:44 -08:00
sabaimran	b1c86fee3b	Release Khoj version 1.6.0	2024-02-20 14:12:24 -08:00
sabaimran	44f8f20ea7	Miscellaneous bugs and fixes for chat sessions (#646 ) * Display given_name field only if it is not None * Add default slugs in the migration script * Ensure that updated_at is saved appropriately, make sure most recent chat is returned for default history * Remove the bin button from the chat interface, given deletion is handled in the drop-down menus * Refresh the side panel when a new chat is created * Improveme tool retrieval prompt, don't let /online fail, and improve parsing of extract questions * Fix ending chat response by offline chat on hitting a stop phrase Previously the whole phrase wouldn't be in the same response chunk, so chat response wouldn't stop on hitting a stop phrase Now use a queue to keep track of last 3 chunks, and to stop responding when hit a stop phrase * Make chat on Obsidian backward compatible post chat session API updates - Make chat on Obsidian get chat history from `responseJson.response.chat' when available (i.e when using new api) - Else fallback to loading chat history from responseJson.response (i.e when using old api) * Fix detecting success of indexing update in khoj.el When khoj.el attempts to index on a Khoj server served behind an https endpoint, the success reponse status contains plist with certs. This doesn't mean the update failed. Look for :errors key in status instead to determine if indexing API call failed. This fixes detecting indexing API call success on the Khoj Emacs client, even for Khoj servers running behind SSL/HTTPS * Fix the mechanism for populating notes references in the conversation primer for both offline and online chat * Return conversation.default when empty list for dynamic prompt selection, send all cmds in telemetry * Fix making chat on Obsidian backward compatible post chat session API updates New API always has conversation_id set, not `chat' which can be unset when chat session is empty. So use conversation_id to decide whether to get chat logs from `responseJson.response.chat' or `responseJson.response' instead --------- Co-authored-by: Debanjum Singh Solanky <debanjum@gmail.com>	2024-02-20 13:55:35 -08:00
sabaimran	138f5223bd	Fix process for generating embeddings for Notion entries (#648 ) * Fix process for generating embeddings for Notion entries * If no title field found, just log a warning and set the title to	2024-02-20 13:46:56 -08:00
Debanjum Singh Solanky	4722da9642	Only enable API token, Whatsapp cards on Web UI when Stripe, Twilio setup	2024-02-16 17:41:09 +05:30
Debanjum Singh Solanky	cf4a524988	Move production dependencies to prod python packages group This will reduce khoj dependencies to install for self-hosting users - Move auth production dependencies to prod python packages group - Only enable authentication API router if not in anonymous mode - Improve error with requirements to enable authentication when not in anonymous mode	2024-02-16 17:41:08 +05:30
Debanjum Singh Solanky	d7dbb715ef	Fix docs links in khoj introductory chat message	2024-02-13 22:38:03 +05:30
sabaimran	32ec54172e	Add additional personalization in Chat via Location, Username (#644 ) * Add location metadata to chat history * Add support for custom configuration of the user name * Add region, country, city in the desktop app's URL for context in chat * Update prompts to specify user location, rather than just location. * Add location data to Obsidian chat query * Use first word for first name, last word for last name when setting profile name	2024-02-13 17:05:13 +05:30
sabaimran	a3eb17b7d4	Have Khoj dynamically select conversation command(s) in chat (#641 ) * Have Khoj dynamically select which conversation command(s) are to be used in the chat flow - Intercept the commands if in default mode, and have Khoj dynamically guess which tools would be the most relevant for answering the user's query * Remove conditional for default to enter online search mode * Add multiple-tool examples in the prompt, make prompt for tools more specific to info collection	2024-02-11 17:11:32 +05:30
sabaimran	69344a6aa6	Add support for multiple chat sessions in the desktop application (#639 ) * Add chat sessions to the desktop application * Increase width of the main chat body to 90vw * Update the version of electron * Render the default message if chat history fails to load * Merge conversation migrations and fix slug setting * Update the welcome message, use the hostURL, and update background color for chat actions * Only update the window's web contents if the page is config	2024-02-11 16:05:28 +05:30
sabaimran	1412ed6a00	Support multiple chat sessions within the web UI (#638 ) * Enable support for multiple chat sessions within the web client - Allow users to create multiple chat sessions and manage them - Give chat session slugs based on the most recent message - Update web UI to have a collapsible menu with active chats - Move chat routes into a separate file * Make the collapsible side panel more graceful, improve some styling elements of the new layout * Support modification of the conversation title - Add a new field to the conversation object - Update UI to add a threedotmenu to each conversation * Get the default conversation if a matching one is not found by id	2024-02-11 15:48:28 +05:30
Debanjum Singh Solanky	70f74cde68	Fix timestamps to separate each logline. Info log response start time	2024-02-07 20:45:16 +05:30
Debanjum Singh Solanky	8e5db72140	Release Khoj version 1.5.1	2024-02-06 23:09:33 +05:30
Debanjum	fc1b8f6fb6	Fix Khoj Obsidian plugin on Obsidian Mobile (#635 ) - Removed node-fetch dependency to work on mobile. - Fix CORS issue for Khoj (streaming) chat on Obsidian mobile - Verified Khoj plugin, search, chat work on Obsidian mobile. ## Details ### Major - Allow calls to Khoj server from Obsidian mobile app to fix CORS issue - Chat stream using default `fetch' not `node-fetch' in obsidian plugin ### Minor - Load chat history after other elements in chat modal on Obsidian are rendered - Scroll to bottom of chat modal on Obsidian across mobile & desktop	2024-02-06 22:03:51 +05:30
Debanjum Singh Solanky	fd238ff792	Load chat history after other elements in chat modal on Obsidian rendered This reduces laggy feeling due to latency of loading chat history from server	2024-02-06 21:25:43 +05:30
Debanjum Singh Solanky	e06a0c6ae0	Scroll to bottom of chat modal on Obsidian across mobile & desktop Put logic into single reused function	2024-02-06 21:25:43 +05:30
Debanjum Singh Solanky	07dc04f40e	Allow calls to Khoj server from Obsidian mobile app to fix CORS issue - Obsidian mobile uses capacitor js. Requests from it have origin as http://localhost on Android and capacitor://localhost on iOS - Allow those Obsidian mobile origins in CORS middleware of server	2024-02-06 21:25:43 +05:30
Debanjum Singh Solanky	dd4cf66be1	Improve offline chat system prompt to think step by step	2024-02-06 20:23:19 +05:30
Debanjum Singh Solanky	035165b534	Make offline chat model current date aware. Improve system prompts - Can now expect date awareness chat quality test to pass - Prevent offline chat model from printing verbatim user Notes and special tokens - Make it ask follow-up questions if it needs more context	2024-02-06 20:23:19 +05:30
Debanjum Singh Solanky	447904f0ab	Chat stream using default `fetch' not` node-fetch' in obsidian plugin Plugins using NodeJS libraries like `node-fetch' don't work on Obsidian mobile	2024-02-06 03:03:42 +05:30
Debanjum Singh Solanky	ba79334863	Only log number of day old user requests, not the complete dictionary	2024-02-02 10:33:31 +05:30
Debanjum Singh Solanky	1c6f1d94f5	Fix styling of Whatsapp card & notify banner in config page of web app - Put Whatsapp card back in Client section. - Fixes side spacing on cards - Improve Whatsapp card row gaps - Hide notification banner on web app load. Previously it showed up as a yellow dot on smaller displays	2024-02-01 22:59:57 +05:30
sabaimran	4daac334bc	Fix subscription state detection for users based on phone numbers, emails (#633 ) * Fix subscription state detection for users based on phone numbers, emails * Fix unit tests for api_user4 * Use a single method for determining subscription from user * Pass user object, rather than user.email for getting subscription state	2024-01-31 07:48:55 +05:30
sabaimran	fc4b57d9f6	Revert styling for white-space pre-line in the chat views as it looks bad	2024-01-29 18:29:54 +05:30
sabaimran	da854703aa	Release Khoj version 1.5.0	2024-01-29 18:05:10 +05:30
Debanjum	d1bfb245df	Improve Khoj Chat and Settings UI (#630 ) * Fix license in pyproject.toml. Remove unused utils.state import * Use single debug mode check function. Disable telemetry in debug mode - Use single logic to check if khoj is running in debug mode. Previously there were 3 different variants of the check - Do not log telemetry if KHOJ_DEBUG is set to true. Previously didn't log telemetry even if KHOJ_DEBUG set to false * Respect line breaks in user, khoj chat messages to improve formatting * Disable Whatsapp config section on web client if Twilio not configured Simplify Whatsapp configuration status checking js by standardizing external input to lower case * Disable Phone API when Twilio not setup and rate limit calls to it - Move phone api to separate router and only enable it if Twilio enabled - Add rate-limiting to OTP and verification calls * Add slugs for phone rate limiting --------- Co-authored-by: sabaimran <narmiabas@gmail.com>	2024-01-29 18:03:43 +05:30
sabaimran	4fb8d5c6d4	Store rate limiter-related metadata in the database for more resilience (#629 ) * Store rate limiter-related metadata in the database for more resilience - This helps maintain state even between server restarts - Allows you to scale up workers on your service without having to implement sticky routing * Make the usage exceeded message less abrasive * Fix rate limiter for specific conversation commands and improve the copy	2024-01-29 15:27:06 +05:30
sabaimran	71cbe5160d	Add retries in case the embeddings API fails (#628 ) * Add retries in case the embeddings API fails * Improve error handling in the inference endpoint API request handler - retry only if HTTP exception - use logger to output information about errors	2024-01-29 15:26:34 +05:30
sabaimran	b782683e60	Scrape results from Serper results using Olostep (#627 ) * Initailize changes to incporate web scraping logic after getting SERP results - Do some minor refactors to pass a symptom prompt to the openai model when making a query - integrate Olostep in order to perform the webscraping * Fix truncation error with new line, fix typing in olostep code * Use the authorization header for the token * Add a small hint/indicator for how to use Khojs other modalities in the welcome prompt * Add more detailed error message if Olostep query fails * Add unit tests which invoke Olostep in chat director * Add test for olostep tool	2024-01-29 14:16:50 +05:30
sabaimran	360b59cdb2	Add handling for None field values in logs and make telemetry upload more frequent	2024-01-26 00:00:55 +05:30
sabaimran	737fb6417b	Revert none checking in telemetry logs	2024-01-25 23:48:09 +05:30
sabaimran	211c5623e8	Improve error handling for telemetry uploads - Use response.raise_for_status when telemetry upload files - Do not send null packets to the destination server	2024-01-25 20:40:42 +05:30
Debanjum Singh Solanky	098a8e4fb1	Fix evaluating connected to server status in Obsidian plugin Only show welcome status message when khojApiKey not set and khojUrl set to khoj cloud	2024-01-25 18:04:29 +05:30
Debanjum Singh Solanky	1c52ddf792	Bump up server side content indexing interval to ~1 day Reduce server side indexing load and API request failures	2024-01-25 13:33:34 +05:30
sabaimran	0fba1e27c5	Add hint to input text for using slash commands	2024-01-25 11:56:56 +05:30
sabaimran	da6cd5ddc4	Improve subqueries for online search and prompt generation for image (#626 ) * Improve subqueries for online search and prompt generation for image - Include conversation history so that subqueries or intermediate prompts are generated with the appropriate context	2024-01-24 17:42:59 +05:30
sabaimran	dbdca7d8d1	Disable swagger UI docs in production	2024-01-24 15:23:39 +05:30
sabaimran	ddf6fd9c09	Remove valid number alert	2024-01-23 17:57:27 +05:30
Debanjum Singh Solanky	17107a0337	Release Khoj version 1.4.0	2024-01-23 10:18:31 +05:30
sabaimran	679db51453	Add support for phone number authentication with Khoj (part 2) (#621 ) * Allow users to configure phone numbers with the Khoj server * Integration of API endpoint for updating phone number * Add phone number association and OTP via Twilio for users connecting to WhatsApp - When verified, store the result as such in the KhojUser object * Add a Whatsapp.svg for configuring phone number * Change setup hint depending on whether the user has a number already connected or not * Add an integrity check for the intl tel js dependency * Customize the UI based on whether the user has verified their phone number - Update API routes to make nomenclature for phone addition and verification more straightforward (just /config/phone, etc). - If user has not verified, prompt them for another verification code (if verification is enabled) in the configuration page * Use the verified filter only if the user is linked to an account with an email * Add some basic documentation for using the WhatsApp client with Khoj * Point help text to the docs, rather than landing page info * Update messages on various callbacks and add link to docs page to learn more about the integration	2024-01-22 18:14:58 -08:00
sabaimran	58bf917775	Update the font used across Khoj desktop and web to be Tajawal (#622 )	2024-01-20 23:13:33 +05:30
Debanjum	679f0f24a4	Improve Chat Input Pane Actions. Move to 1 Click Audio Chat on Mobile (#624 ) ## Major ### Move to single click audio chat UX on Obsidian, Desktop, Web clients New default UX has 1 long-press on mobile, 2-click on desktop to send transcribed audio message - New Audio Chat Flow 1. Record audio while microphone button pressed 2. Show auto-send 3s countdown timer UI for audio chat message Provide a visual cue around send button for how long before audio message is automatically sent to Khoj for response 3. Auto-send msg in 3s unless stop send message button clicked - Why - Removes the previous default of 3 clicks required to send audio message The record > stop > send process to send audio messages was unclear and effortful - Still allows stopping message from being sent, to make correction to transcribed audio - Removes inadvertent long audio transcriptions if forget to press stop while recording ### Improve chat input pane actions & icons on Obsidian. Desktop, Web clients - Use SVG icons in chat footer on web, desktop app - Move delete icon to left of chat input. This makes it harder to inadvertently click it - Add send button to chat input pane - Color chat message send button to make it primary CTA - Make chat footer shorter. Use no or round border on action buttons ## Minor - Stop rendering empty starter questions element when no questions present - Add round border, hover color to starter questions in web, desktop apps - Fix auto resizing chat input box when transcribed text added - Convert chat input into a text area in the Obsidian client	2024-01-20 21:52:56 +05:30
Debanjum Singh Solanky	ec3b837d00	Send audio message in 2-clicks on desktop to avoid holding down mic button	2024-01-20 21:40:38 +05:30
Debanjum Singh Solanky	f0daa45ae0	Move to single click audio chat UX on Obsidian client - Capabillity New default UX has 1 long-press to send transcribed audio message - Removes the previous default of 3 clicks required to send audio message - The record > stop > send process to send audio messages was unclear - Still allows stopping message from being sent, if users want to make correction to transcribed audio - Removes inadvertent long audio transcriptions if user forgets to press stop when recording - Changes - Record audio while microphone button pressed - Show auto-send 3s countdown timer UI for audio chat message Provide a visual cue around send button for how long before audio message is automatically sent to Khoj for response - Auto-send msg in 3s unless stop send message button clicked	2024-01-20 16:07:12 +05:30
Debanjum Singh Solanky	29a581d2b0	Move to single click audio chat UX on desktop app - Capabillity New default UX has 1 long-press to send transcribed audio message - Removes the previous default of 3 clicks required to send audio message - The record > stop > send process to send audio messages was unclear - Still allows stopping message from being sent, if users want to make correction to transcribed audio - Removes inadvertent long audio transcriptions if user forgets to press stop when recording - Changes - Record audio while microphone button pressed - Show auto-send 3s countdown timer UI for audio chat message Provide a visual cue around send button for how long before audio message is automatically sent to Khoj for response - Auto-send msg in 3s unless stop send message button clicked	2024-01-20 16:03:51 +05:30
Debanjum Singh Solanky	699e9ff878	Move to single click audio chat UX on web app - Capabillity New default UX has 1 long-press to send transcribed audio message - Removes the previous default of 3 clicks required to send audio message - The record > stop > send process to send audio messages was unclear - Still allows stopping message from being sent, if users want to make correction to transcribed audio - Removes inadvertent long audio transcriptions if user forgets to press stop when recording - Changes - Record audio while microphone button pressed - Show auto-send 3s countdown timer UI for audio chat message Provide a visual cue around send button for how long before audio message is automatically sent to Khoj for response - Auto-send msg in 3s unless stop send message button clicked	2024-01-20 15:56:46 +05:30
Debanjum Singh Solanky	26bd3533d8	Stop rendering empty starter questions element when no questions present	2024-01-20 11:39:58 +05:30
Debanjum Singh Solanky	7c8c475c3a	Add round border, hover color to starter questions in web, desktop apps	2024-01-20 00:51:11 +05:30
Debanjum Singh Solanky	8a488b9e39	Fix auto resizing chat input box when transcribed text added	2024-01-20 00:48:56 +05:30
Debanjum Singh Solanky	07ca137bdf	Convert chat input into a text area in the Obsidian client This allows for better readability of multi-line messages by users. The chat input is a text area in the other clients as well.	2024-01-20 00:48:56 +05:30
Debanjum Singh Solanky	d4552117f6	Add and improve chat input pane, actions, icons on Obsidian client - Move delete icon to left of chat input. This makes it harder to inadvertently click - Add send button to chat footer. Enter being the only way to send messages is not intuitive, outside standard modern UI patterns - Color chat message send button to make it primary CTA on web client - Make chat footer shorter. Use no or round border on action buttons	2024-01-20 00:48:56 +05:30
Debanjum Singh Solanky	c0ad64d9a3	Add and improve chat input pane, actions, icons on desktop client - Use SVG icons in chat footer on web - Move delete icon to left of chat input. This makes it harder to inadvertently click - Add send button to chat footer. Enter being the only way to send messages is not intuitive, outside standard modern UI patterns - Color chat message send button to make it primary CTA on web client - Make chat footer shorter. Use no or round border on action buttons	2024-01-20 00:29:49 +05:30
Debanjum Singh Solanky	ea85ebdacb	Add and improve chat input pane, actions, icons on web client - Use SVG icons in chat footer on web - Move delete icon to left of chat input. This makes it harder to inadvertently click - Add send button to chat footer. Enter being the only way to send messages is not intuitive, outside standard modern UI patterns - Color chat message send button to make it primary CTA on web client - Make chat footer shorter. Use no or round border on action buttons	2024-01-19 20:40:42 +05:30
sabaimran	039ed78253	Add support for a first-party client app to call into Khoj (Part 1) (#601 ) * Add support for a first party client app - Based on a client id and client secret, allow a first party app to call into the Khoj backend with a phone number identifier - Add migration to add phone numbers to the KhojUser object * Add plus in front of country code when registering a phone number. - Decrease free tier limit to 5 (from 10) - Return a response object when handling stripe webhooks * Fix telemetry method which references authenticated user's client app * Add better error handling for null phone numbers, simplify logic of authenticating user * Pull the client_secret in the API call from the authorization header * Add a migration merge to resolve phone number and other changes	2024-01-18 19:24:14 +05:30
Debanjum Singh Solanky	9dfe1bb003	Fix updating subscription when invoice paid. Revert renewal_date logic The actual issue was that `get_or_create_user_by_email' tried to create a subscription even if it already existed. With updated logic: - New subscription is only created when it doesn't already exist in `get_or_create_user_by_email' - `set_user_subscription' just updates the subscription state as user subscription object creation is already managed by `get_or_create_user_by_email'. So the other conditionals are unnecessary	2024-01-18 16:20:18 +05:30
Debanjum Singh Solanky	9b1a66c969	Fix updating subscription renewal date when invoice paid	2024-01-18 14:46:10 +05:30
sabaimran	93d5cb128c	Initialize embeddings to empty list before processing	2024-01-18 13:27:04 +05:30
Debanjum Singh Solanky	24af888c41	Release Khoj version 1.3.0	2024-01-18 11:42:13 +05:30
Debanjum	8b4dd16255	Fix markdownRenderer arg to allow chat responses in Obsidian plugin (#619 ) - Issue Users with Dataview plugin would have error as its markdown post-processor expects the sourcePath to be a string This prevents Khoj from responding to chat messages in the Obsidian chat modal. Search via Obsidian still works but it throws the same dataview plugin error - Fix Pass a string as sourcePath to markdownRenderer to fix failing chat response and stop throwing dataview errors on search Resolves #614, Resolves #606	2024-01-18 10:18:31 +05:30
Debanjum	c8dbe8ee7b	Improve server status check and message in Obsidian client (#617 ) - Update health API to pass authenticated users their info - Improve Khoj server status check in Khoj Obsidian client - Show Khoj Obsidian commands even if no connection to server - Show Khoj chat by default in Obsidian side pane instead of search	2024-01-18 10:17:35 +05:30
Debanjum Singh Solanky	f9420e1209	Show Khoj Obsidian commands even if no connection to server Server connection check can be a little flaky in Obsidian. Don't gate the commands behind it to improve usability of Khoj. Previously the commands would get disabled when server connection check failed, even though server was actually accessible	2024-01-18 10:09:20 +05:30
Debanjum Singh Solanky	36bf42a860	Show Khoj chat by default in Obsidian side pane instead of search	2024-01-18 10:09:20 +05:30
Debanjum Singh Solanky	aab75a6ead	Improve Khoj server status check in Khoj Obsidian client - Update server connection status on every edit of khoj url, api key in settings instead of only on plugin load The error message was stale if connection fixed after changes in Khoj plugin settings to URL or API key, like on plugin install - Show better welcome message on first plugin install. Include API key setup instruction - Show logged in user email on Khoj settings page	2024-01-18 10:09:20 +05:30
Debanjum Singh Solanky	1a46734485	Fix markdownRenderer arg to allow chat responses in Obsidian plugin - Issue: Users with Dataview plugin would have error as its markdown post-processor expects the sourcePath to be a string This prevents Khoj from responding to chat messages in the Obsidian chat modal. Search via Obsidian still works but it throws the same dataview error - Fix: Pass a string as sourcePath to markdownRenderer to fix failing chat response Resolves #614, Resolves #606	2024-01-18 10:02:50 +05:30
sabaimran	e9e49ea098	Allow custom inference endpoint for the crossencoder model (#616 ) * Add support for custom inference endpoints for the cross encoder model - Since there's not a good out of the box solution, I've deployed a custom model/handler via huggingface to support this use case. * Use langchain.community for pdf, openai chat modules * Add an explicit stipulation that the api endpoint for crossencoder inference should be for huggingface for now	2024-01-18 10:02:12 +05:30
Debanjum Singh Solanky	870af19ba4	Update health API to pass authenticated users their info This allows Khoj clients to get email address associated with user's API token for display in client UX In anonymous mode, default user information is passed	2024-01-17 13:38:57 +05:30
Debanjum	4d30f7d1d9	Short-circuit API rate limiter for unauthenticated users (#607 ) ### Major - Short-circuit API rate limiter for unauthenticated user Calls by unauthenticated users were failing at API rate limiter as it failed to access user info object. This is a bug. API rate limiter should short-circuit for unauthenicated users so a proper Forbidden response can be returned by API Add regression test to verify that unauthenticated users get 403 response when calling the /chat API endpoint ### Minor - Remove trailing slash to normalize khoj url in obsidian plugin settings - Move used /api/config API controllers into separate module - Delete unused /api/beta API endpoint - Fix error message rendering in khoj.el, khoj obsidian chat - Handle deprecation warnings for subscribe renew date, langchain, pydantic & logger.warn	2024-01-17 00:59:52 +05:30
Debanjum Singh Solanky	2752e0d607	Update jinja2 and axios min supported package versions	2024-01-16 18:45:38 +05:30
Debanjum Singh Solanky	7039c202c8	Merge branch 'master' into short-circuit-api-rate-limiter	2024-01-16 18:18:34 +05:30
Debanjum Singh Solanky	8917228dbb	Remove unused, deprecated /api/config/data API endpoints - Use /api/health for server up check instead of api/config/default - Remove unused `khoj--post-new-config' method - Remove the now unused /config/data GET, POST API endpoints	2024-01-16 18:15:06 +05:30
Debanjum Singh Solanky	6ded4c1d75	Merge branch 'master' into fix-1000-file-index-update-limit	2024-01-16 16:50:58 +05:30
Debanjum Singh Solanky	16175137e5	Decode URL encoded query string in chat API endpoint before processing	2024-01-16 13:09:28 +05:30
Debanjum Singh Solanky	9fe1c8ae13	Make references and online_results optional params to converse_offline Fixes all the failing GPT4All tests because they were missing the online_results argument	2024-01-16 13:09:28 +05:30
Debanjum Singh Solanky	d74f8e03d3	Pass max context length to fix using updated GPT4All.list_gpu method It's signature was updated in GPT4All 2.1.0 pypi release. Resolves #610	2024-01-16 12:23:45 +05:30
Debanjum Singh Solanky	1ae6669fbf	Correctly handle API response when no files to index	2024-01-16 11:57:40 +05:30
sabaimran	50575b749b	Add option to use HuggingFace's inference endpoint for generating embeddings (#609 ) * Support using hosted Huggingface inference endpoint for embeddings generation * Since the huggingface inference endpoint is model-specific, make the URL an optional property of the search model config * Handle ECONNREFUSED error in desktop app * Drive API key via the search model config model and use more generic names	2024-01-16 08:58:24 +05:30
Debanjum Singh Solanky	ba37b28fb5	Improve batched error handling. Catch can't connect to server error Break out of batch processing when unable to connect to server or when requests throttled by server	2024-01-14 01:04:44 +05:30
Debanjum Singh Solanky	7dfbcd2e5a	Handle subscribe renew date, langchain, pydantic & logger.warn warnings - Ensure langchain less than 0.2.0 is used, to prevent breaking ChatOpenAI, PyMuPDF usage due to their deprecation after 0.2.0 - Set subscription renewal date to a timezone aware datetime - Use logger.warning instead of logger.warn as latter is deprecated - Use `model_dump' not deprecated dict to get all configured content_types	2024-01-12 01:46:52 +05:30
Debanjum Singh Solanky	5f97357fe0	Delete unused /api/beta API endpoint	2024-01-12 01:11:05 +05:30
Debanjum Singh Solanky	bb1c1b39d8	Move /api/config API controllers into separate module for code modularity	2024-01-12 01:11:04 +05:30
Debanjum Singh Solanky	ba99089a12	Short-circuit API rate limiter for unauthenticated user Calls by unauthenticated users were failing at API rate limiter as it failed to access user info object. This is a bug. API rate limiter should short-circuit for unauthenicated users so a proper Forbidden response can be returned by API Add regression test to verify that unauthenticated users get 403 response when calling the /chat API endpoint	2024-01-12 00:23:50 +05:30
Debanjum Singh Solanky	b1269fdad2	Remove trailing slash to normalize khoj url in obsidian plugin settings	2024-01-11 21:56:36 +05:30
Debanjum Singh Solanky	ffdb291fe0	Fix error message rendering in khoj.el, khoj obsidian chat - Fix failed to index error message in khoj.el - Fix chat model not configured message in khoj obsidian chat	2024-01-11 21:55:54 +05:30
Debanjum Singh Solanky	af9ceb00a0	Show relevant error msg in desktop app, e.g when can't connect to server	2024-01-09 23:09:34 +05:30
Debanjum Singh Solanky	43423432ce	Pass indexed filenames in API response for client validation	2024-01-09 23:09:34 +05:30
Debanjum Singh Solanky	5f9ac5a630	Collect files to index in single dict to simplify index/update controller Simplifies code while maintaining typing	2024-01-09 23:09:34 +05:30
Debanjum Singh Solanky	efe41aaaca	Push 1000 files at a time from the Desktop client for indexing FastAPI API endpoints only support uploading 1000 files at a time. So split all files to index into groups of 1000 for upload to index/update API endpoint	2024-01-09 23:09:34 +05:30
Debanjum Singh Solanky	b6d5392c0c	Release Khoj version 1.2.1	2024-01-04 18:45:37 +05:30
Debanjum Singh Solanky	fca7a5ff32	Push 1000 files at a time from the Obsidian client for indexing FastAPI API endpoints only support uploading 1000 files at a time. So split all files to index into groups of 1000 for upload to index/update API endpoint	2024-01-04 18:43:22 +05:30
Debanjum Singh Solanky	4a234c8db3	Use default offline/openai chat model to extract DB search queries Make usage of the first offline/openai chat model as the default LLM to use for background tasks more explicit The idea is to use the default/first chat model for all background activities, like user message to extract search queries to perform. This is controlled by the server admin. The chat model set by the user is used for user-facing functions like generating chat responses	2024-01-03 14:04:49 +05:30
Debanjum Singh Solanky	e28adf2884	Also index pdf, markdown and plaintext files using khoj emacs client Previously you could only index org-mode files and directories from khoj.el Mark the `khoj-org-directories', `khoj-org-files' variables for deprecation, since `khoj-index-directories', `khoj-index-files' replace them as more appropriate names for the more general case Resolves #597	2024-01-03 11:46:17 +05:30
Debanjum Singh Solanky	5abaed9d08	Use user chosen OpenAI model to extract DB search questions from query Previously Khoj was selecting the first OpenAI model configured on server and not the OpenAI model configured by the user for themselves	2024-01-03 11:45:06 +05:30
Debanjum Singh Solanky	05536aab6b	Merge how users can share personal information in personality prompt	2024-01-03 11:40:14 +05:30
Liam Swayne	455f78b178	Replace var declarations with let declarations (#576 ) * Replace var declaration with let declaration	2023-12-29 10:20:48 +05:30
sabaimran	79913d4c17	Add isort to the pre-commit configuration and apply it to the whole project (#595 ) * Apply isort to the entire repository * Fix missing import issues in text_to_entries * Fix imports in migration files	2023-12-28 18:04:02 +05:30
sabaimran	442c913de3	Update telemetry state for search model only if one is found, fix alt text for language setting	2023-12-28 12:53:53 +05:30
sabaimran	d3ab3f1b70	Rename matrix_blog to web and move the language setting into the content section	2023-12-28 12:44:49 +05:30
sabaimran	00af6baeb6	Resolve merge conflicts with intro message in chat.html web view	2023-12-23 17:52:58 +05:30
sabaimran	afec4394f9	Merge pull request #592 from ayushjha119/Fixed-Health-Check-to-Khoj-api Fixed health check to khoj api	2023-12-23 13:04:50 +05:30
sabaimran	c50eb8a691	Fix mypy/pre-commit issues	2023-12-23 11:44:37 +05:30
Debanjum Singh Solanky	21c55b4c0d	Release Khoj version 1.2.0	2023-12-22 21:43:47 +05:30
Debanjum Singh Solanky	6a8c1fe423	Sanitize rendering chat references in Web, Desktop and Obsidian clients Use textContent instead of innerHTML to append references Resolves #583	2023-12-22 18:11:49 +05:30
Debanjum	6879daccc6	Fix Chat Streaming on Obsidian, Docker Image Version and First-Run, Chat Error Messages in Clients (#589 ) - Fix streaming chat response in Obsidian client - Fix first-run, chat error message in obsidian, desktop and web clients - Set Khoj app version to latest version in Docker images - Tag Khoj Docker image built on release with the `latest` tag This align docker image release cadence with client, server releases	2023-12-22 04:13:01 -08:00
Debanjum Singh Solanky	d101297995	Use markdown formatted chat message in chat modal	2023-12-22 17:01:31 +05:30
Debanjum Singh Solanky	350fd89c8d	Clear chat history html in Obsidian if getChatHistory works too	2023-12-22 17:01:31 +05:30
ayushjha119	e487ec5370	fixed app to api health Check	2023-12-21 17:51:30 +05:30
Debanjum Singh Solanky	70607cbbbb	Update FRE message to get any Khoj client to sync files with server	2023-12-21 15:23:47 +05:30
ayushjha119	b3d7d6a79d	used the Response class from fastapi.responses and set the input for status_code to 200	2023-12-21 14:26:40 +05:30
sabaimran	e1aaff2053	Add more details about functionality in Khoj's intro message	2023-12-21 10:09:30 +05:30
sabaimran	a1211f40d7	Fix type declaration for the cross_encoder_model state variable. Update name of the new update API	2023-12-21 09:15:13 +05:30
sabaimran	089e4bee12	FIx unit tests with new search model configurations	2023-12-20 21:50:44 +05:30
Debanjum Singh Solanky	447c1b90e7	Fix streaming chat response in Obsidian client - Convert renderIncrementalMessage to an async method as MarkdownRenderer is an async method - Simplify code, remove unneeded JSON check	2023-12-20 14:51:19 +05:30
sabaimran	aa23da60a3	Add a notification banner to show temporary messages	2023-12-20 14:22:08 +05:30
Debanjum Singh Solanky	e04fe921eb	Fix first-run, chat error message in obsidian, desktop and web clients - Disable chat input field if getChatHistory had error as Khoj may not be setup correctly to chat	2023-12-20 14:03:07 +05:30
sabaimran	5ff9df9d4c	Add support per user for configuring the preferred search model from the config page - Honor this setting across the relevant places where embeddings are used - Convert the VectorField object to have None for dimensions in order to make the search model easily configurable	2023-12-20 13:25:43 +05:30
sabaimran	0f6e4ff683	Add a model that specifies the user's search model configuration - Update all endpoints that generate embeddings to use the new model. Incl. generating text embeddings, creating embeddings for a search query	2023-12-20 09:22:26 +05:30
sabaimran	6dd2b05bf5	Rebase with master	2023-12-19 21:02:49 +05:30
sabaimran	e3557cd8b7	Update the personality prompt to make Khoj aware that users can share data via the desktop app	2023-12-19 16:42:45 +05:30
sabaimran	927e477f68	Ignore typing error in custom action short description	2023-12-19 16:10:58 +05:30
sabaimran	946305d977	Add function to export conversations for debugging	2023-12-19 16:05:20 +05:30
sabaimran	903a01745f	Use 0px for padding for input row buttons in web	2023-12-18 16:09:06 +05:30
sabaimran	5b092d59f4	Ignore dict assignment typing error	2023-12-17 22:34:54 +05:30
sabaimran	03cb86ee46	Update typing and object assignment for new text to image method return	2023-12-17 21:28:33 +05:30
sabaimran	0288804f2e	Render the inferred query along with the image that Khoj returns	2023-12-17 21:02:55 +05:30
sabaimran	49af2148fe	Miscellaneous improvements to image generation - Improve the prompt before sending it for image generation - Update the help message to include online, image functionality - Improve styling for the voice, trash buttons	2023-12-17 20:25:35 +05:30
sabaimran	7cb64cb2f9	Add telemetry for image generation conversation command	2023-12-17 18:25:03 +05:30
sabaimran	09544dee09	Add TextToImageModelConfig to the admin page	2023-12-17 16:44:19 +05:30
sabaimran	0459666beb	CSRF Cookie not set error in prod. Try fixing https forwarding for mitigation	2023-12-17 12:55:18 +05:30
sabaimran	61dde8ed89	If text to image config isn't set, send back an error message to the client	2023-12-17 12:54:50 +05:30
sabaimran	3065cea562	Address mypy typing issues	2023-12-16 09:24:26 +05:30
sabaimran	5f6dcf9f2e	Add a rate limiter for the transcribe API endpoint	2023-12-16 09:18:56 +05:30
sabaimran	73a107690d	Add a ConversationCommand rate limiter for the chat endpoint	2023-12-16 09:03:52 +05:30
sabaimran	9b961ed496	Merge pull request #580 from khoj-ai/fix-upgrade-chat-to-create-images Support Image Generation with Khoj	2023-12-07 21:17:58 +05:30
Debanjum Singh Solanky	7504669f2b	Fix rendering image on chat response in obsidian client	2023-12-05 03:48:07 -05:00
Debanjum Singh Solanky	408b7413e9	Use global openai client for transcribe, image	2023-12-05 03:36:33 -05:00
Debanjum Singh Solanky	162b219f2b	Throw unsupported error when server not configured for image, speech-to-text	2023-12-05 01:51:14 -05:00
Debanjum Singh Solanky	8f2f053968	Fix rendering image on chat response in web, desktop client	2023-12-05 01:51:14 -05:00
Debanjum Singh Solanky	d124266923	Reduce promise based nesting in chat JS func used in desktop, web client Use async/await to reduce .then() based nesting to improve code readability	2023-12-05 01:51:14 -05:00
Debanjum Singh Solanky	6e3f66c0f1	Use base64 encoded image instead of source URL for persistence The source URL returned by OpenAI would expire soon. This would make the chat sessions contain non-accessible images/messages if using OpenaI image URL Get base64 encoded image from OpenAI and store directly in conversation logs. This resolves the image link expiring issue	2023-12-05 01:51:14 -05:00
Debanjum Singh Solanky	52c5f4170a	Show generated images in the chat modal of the Khoj Obsidian plugin	2023-12-05 01:51:14 -05:00
Debanjum Singh Solanky	8016a57b5e	Show generated images in chat interface on Desktop client	2023-12-05 01:51:14 -05:00
Debanjum Singh Solanky	cc051ceb4b	Show generated images in chat interface on Web client	2023-12-05 01:51:14 -05:00
Debanjum Singh Solanky	252b35b2f0	Support /image slash command to generate images using the chat API	2023-12-05 01:51:14 -05:00
sabaimran	ef21d78c99	Initial changes to support multiple search model configurations - All search models are loaded into memory, and stored in a dictionary indexed by name - Still need to add database migrations and create a UI for user to select their choice. Presently, it uses the default option	2023-12-05 00:35:40 -05:00
Debanjum Singh Solanky	1d9c1333f2	Configure text to image models available on server - Currently supports OpenAI text to image model, by default dall-e-3 - Allow setting the text to image model via CLI during server setup	2023-12-04 21:27:53 -05:00
Debanjum Singh Solanky	f0222f6d08	Make save_to_conversation_log helper function reusable - Move it out to conversation.utils from generate_chat_response function - Log new optional intent_type argument to capture type of response expected. This can be type responses by Khoj e.g speech, image. It can be used to render responses by Khoj appropriately on clients - Make user_message_time argument optional, set the time to now by default if not passed by calling function	2023-12-04 19:42:12 -05:00
sabaimran	d2ddbef08f	Use a unique name for the temp PDF generated	2023-12-04 19:27:00 -05:00
sabaimran	d20746613a	Properly filter out empty PDFs for indexing	2023-12-04 16:15:17 -05:00
Debanjum Singh Solanky	316b7d471a	Handle offline chat model retrieval when no internet Offline chat shouldn't fail on retrieve_model when no internet, if model was previously downloaded and usable offline	2023-12-04 13:46:25 -05:00
Debanjum Singh Solanky	2b09caa237	Make online results an optional argument to the gpt converse method	2023-12-04 12:15:29 -05:00
Debanjum Singh Solanky	7009793170	Migrate to OpenAI Python library >= 1.0	2023-12-03 18:16:00 -05:00
sabaimran	cc064ea57d	Fix circular import issue	2023-12-03 17:46:44 -05:00
sabaimran	21f8d63e89	If a user subscribes to Khoj with an email address that's not present in the DB, create an account	2023-12-03 17:28:40 -05:00
sabaimran	c5d297a9ed	Recursively search through folders for indexing	2023-12-03 16:17:28 -05:00
Debanjum Singh Solanky	a57d529f39	Fix path to system tray icon of Khoj desktop app	2023-12-03 00:12:50 -08:00
Debanjum Singh Solanky	106cdbe455	Release Khoj version 1.1.0	2023-11-30 20:09:08 -08:00
Debanjum Singh Solanky	10ce4ee11c	Ignore null params type check for markdown renderer in Obsidian client	2023-11-30 20:09:08 -08:00
sabaimran	a5ffa2342f	Add documentation for local setup and fix admin panel bugs - Wasn't able to login to the admin panel when KHOJ_DEBUG was not True. Fix this error so self-hosted users can get unblocked from accessing the admin settings - Don't force users to set their KHOJ_DJANGO_SECRET_KEY	2023-11-30 17:55:27 -08:00
Debanjum Singh Solanky	d587632700	Clear result before render thinking placeholder emoji in Obsidian chat	2023-11-30 13:53:09 -08:00
Debanjum Singh Solanky	48719ee0dd	Render newline separation in chat references to improve readability	2023-11-30 13:16:48 -08:00
Debanjum Singh Solanky	1a31a2efcf	Render Khoj chat streaming response as md & show refs in Obsidian - Use new style references for Khoj chat modal in Obsidian - Khoj Chat responses in Obsidian had regressed to not show references for new questions after modal has been opened. Now even those are rendered, and use new references style - Render chat response as markdown while it's being streamed	2023-11-30 13:02:00 -08:00
Debanjum Singh Solanky	0430fa67b6	Show temporary status message when copied to clipboard	2023-11-29 13:49:33 -08:00
Debanjum Singh Solanky	491a1a949a	Render chat responses as markdown in Desktop client too	2023-11-29 13:49:33 -08:00
Debanjum Singh Solanky	20ef5bfc93	Properly stop mediaRecorder stream to clear microphone in-use state	2023-11-29 13:48:35 -08:00
Debanjum Singh Solanky	8faa63c3c6	Convert config page buttons to use stronger yellow	2023-11-28 19:55:43 -08:00
Debanjum Singh Solanky	a6ca2076d5	Open link to Khoj app landing page from nav pane in current tab	2023-11-28 14:20:37 -08:00
Debanjum Singh Solanky	643e018947	Handle if user subscription field doesn't exists in telemetry func Avoid null ref in the method when running Khoj server in anon mode	2023-11-28 14:15:14 -08:00
Debanjum Singh Solanky	110d7646fc	Use milder yellow as primary Khoj theme color for chat, buttons etc.	2023-11-28 14:15:14 -08:00
sabaimran	18254850ab	Set a default value for the khoj django secret key and add additional guidance for setting environment variables on first run	2023-11-28 09:39:44 -08:00
sabaimran	6290b463f5	Compute size of the indexed data only if explicitly requested to avoid heavy load on the DB	2023-11-27 12:05:00 -08:00
sabaimran	eb5e3096e0	Change subscribed scope to premium	2023-11-27 11:39:20 -08:00
sabaimran	6e1ba11e59	Resolve merge conflicts for rendering chat response	2023-11-27 11:33:13 -08:00
Debanjum Singh Solanky	71f2d54258	Render chat response as markdown while streaming on Web, Desktop clients	2023-11-26 20:27:10 -08:00
Debanjum Singh Solanky	9e714d032b	Fix Khoj telemetry server. Add server_version column	2023-11-26 15:05:43 -08:00
Debanjum Singh Solanky	b249bbb5b5	Limit max audio file size allowed for transcription on API endpoint	2023-11-26 14:19:46 -08:00
Debanjum Singh Solanky	a79604b601	Fix return types of offline, online transcribe methods for python 3.9	2023-11-26 06:26:34 -08:00
Debanjum Singh Solanky	06f99ceb3c	Rename /api/speak API endpoint to /api/transcribe	2023-11-26 06:18:44 -08:00
Debanjum Singh Solanky	56a1a61c77	Remove unused button element retrieval code from web, desktop	2023-11-26 06:17:56 -08:00
Debanjum Singh Solanky	877532a167	Speak to Khoj from the Obsidian client - Add transcription button with mic icon - Collect audio recording on pressing mic - Process and send audio recording to server for transcription - Extract the functionality to flash status in chat input for reuse	2023-11-26 06:17:54 -08:00
Debanjum Singh Solanky	cc9eae5d18	Update default chat model to Mistral in GPT4AllProcessor config	2023-11-26 05:55:43 -08:00
Debanjum Singh Solanky	4636390f7f	Transcribe speech to text offline with Whisper - Allow server admin to configure offline speech to text model during initialization - Use offline speech to text model to transcribe audio from clients - Set offline whisper as default speech to text model as no setup api key reqd	2023-11-26 05:55:11 -08:00
Debanjum Singh Solanky	a0a7ab7ec8	Rename conversation.gpt4all package to conversation.offline	2023-11-26 04:19:32 -08:00
Debanjum Singh Solanky	499adf86a0	Move transcription using OpenAI API into independent package	2023-11-26 04:19:32 -08:00
Debanjum Singh Solanky	897170ab15	Use single db migration script for transcribe model, related updates	2023-11-26 04:19:32 -08:00
Debanjum Singh Solanky	28090216f6	Show transcription error status in chatInput placeholder on web, desktop - Extract flashing status message in chat input placeholder into reusable function - Use emoji prefixes for status messages - Improve alt text of transcribe button to indicate what the button does	2023-11-26 04:19:32 -08:00
Debanjum Singh Solanky	fc040825b2	Default to Offline chat with Mistral as minimal setup, no API key reqd.	2023-11-26 01:07:20 -08:00
Debanjum Singh Solanky	5a6547677c	Add type of operation variable in latest migration	2023-11-26 00:38:52 -08:00
Debanjum Singh Solanky	3e252036c3	Remove whitespace: pre-line from chat html, since markdown rendering	2023-11-26 00:27:29 -08:00
Debanjum Singh Solanky	b484795b8e	Merge branch 'master' into add-speak-to-chat - Conflicts: - src/interface/desktop/chat.html Combine and use common class names for speak component - src/khoj/database/adapters/__init__.py Combine imports - src/khoj/interface/web/chat.html Combine and use common class names for speak component - src/khoj/routers/api.py Combine imports	2023-11-26 00:26:21 -08:00
sabaimran	6233a957b4	Merge branch 'master' of github.com:khoj-ai/khoj into features/enforce-subscription-status	2023-11-25 22:46:10 -08:00
sabaimran	52b88de7f4	Indicate in the desktop if the user gets rate limited for indexing	2023-11-25 22:31:23 -08:00
Debanjum	e0a59cff68	Delete Conversation History from Web, Desktop, Obsidian Clients (#551 ) Add delete button to clear conversation history from Web, Desktop and Obsidian Khoj clients Resolves #523	2023-11-25 22:24:12 -08:00
Debanjum Singh Solanky	d0e294d8a5	Clear Conversation History from the Obsidian client - Fix font color for Khoj chat responses in Obsidian. Previous color had too low a contrast to be readable	2023-11-25 22:16:13 -08:00
sabaimran	b2afbaa315	Add support for rate limiting the amount of data indexed - Add a dependency on the indexer API endpoint that rounds up the amount of data indexed and uses that to determine whether the next set of data should be processed - Delete any files that are being removed for adminstering the calculation - Show current amount of data indexed in the config page	2023-11-25 20:28:04 -08:00
Debanjum Singh Solanky	07bf365c7c	Clear any network connections to khoj server via khoj.el on reindex - Ignore errors in deleting network requests to khoj server - Also delete open network connection to khoj server on auto reindex Otherwise when server is unreachable a bunch of failed network connections accrue in the processes list	2023-11-25 20:19:41 -08:00
sabaimran	dd1badae81	Use userwithtoken.user when authenticating with an API key	2023-11-24 22:18:45 -08:00
sabaimran	48b9116195	Fix to use user rather than user_with_token in authenticated credentials	2023-11-24 22:18:00 -08:00
sabaimran	771f9bcfa1	If the user subscription was created over 7 days ago, then their trial is expired	2023-11-24 22:08:32 -08:00
sabaimran	e5b1350523	Enforce API use limits depending on whether the server has billing enabled and whether the given user is subscribed	2023-11-24 21:55:16 -08:00
sabaimran	9c868ee10b	Use the state.billing_enabled field to determine whether to use the subscribed scope	2023-11-24 20:41:19 -08:00
sabaimran	69c8f45830	Use scopes to represent whether the use has a valid subscription in the middleware	2023-11-24 20:29:36 -08:00
Debanjum	25f3f2367e	Handle Server Unavailable Error from Khoj.el (#568 ) - Make auto-update of content index user configurable from khoj.el - Handle server unavailable error on auto-index schedule job in khoj.el Resolves #567	2023-11-24 16:46:07 -08:00
Debanjum Singh Solanky	138f4e3f3c	Make auto-update of content index user configurable from khoj.el	2023-11-24 16:40:50 -08:00
Debanjum Singh Solanky	0885fc6c23	Handle server unavailable error on auto-index schedule job in khoj.el	2023-11-24 16:39:44 -08:00
sabaimran	c13953311a	Add reflective questions to admin pages	2023-11-23 14:01:05 -08:00
sabaimran	c42ec32a95	Merge pull request #552 from khoj-ai/features/internet-enabled-search Support internet-enabled, online searching using Serper.dev	2023-11-23 12:34:05 -08:00
sabaimran	c641b8df58	Update desktop package version	2023-11-22 17:54:53 -08:00
sabaimran	a1b2289074	Release Khoj version 1.0.1	2023-11-22 17:52:07 -08:00
sabaimran	b1b037f0ea	Fix URL configuration issues with reorganized subfolders	2023-11-22 17:03:33 -08:00
sabaimran	e0949e232b	Import random in adapters file for selecting reflective question	2023-11-22 07:52:51 -08:00
sabaimran	256e8de40a	Merge with features/internet-enabled-search	2023-11-22 07:25:24 -08:00
Debanjum Singh Solanky	fd60db766e	Clear Conversation History from the Web Client	2023-11-22 03:35:00 -08:00
Debanjum Singh Solanky	d5a4830761	Clear Conversation History from the Desktop Client	2023-11-22 03:35:00 -08:00
Debanjum Singh Solanky	3096544cf2	Create API endpoint to clear user's chat history	2023-11-22 03:34:59 -08:00
Debanjum Singh Solanky	63675b3299	Speak to Khoj from the Desktop client - Use icons to style speech to text recording state	2023-11-22 02:47:17 -08:00
Debanjum Singh Solanky	2951fc92d7	Speak to Khoj from the Web client - Use icons to style speech to text recording state	2023-11-22 02:47:17 -08:00
Debanjum Singh Solanky	cc77bc4076	Create speech to text API endpoint. Use OpenAI whisper for ASR - Wrap audio transcription in try/catch and delete audio file after processing - Use configured speech to text model, else handle error	2023-11-22 02:47:06 -08:00
Debanjum Singh Solanky	1ca99b6eb0	Add speech to text model configuration to Database	2023-11-22 02:24:31 -08:00
sabaimran	c652a7fd2d	Move text_to_entries under the new content folder	2023-11-21 22:25:17 -08:00
sabaimran	1e2af083f0	Rename the data_sources module to content	2023-11-21 22:11:32 -08:00
sabaimran	4cb28aeffb	Resolve merge conflicts with master	2023-11-21 22:07:41 -08:00
Debanjum Singh Solanky	4cdfe8fc4f	Re-enable Khoj Obsidian plugin for Mobile, as Khoj cloud is available	2023-11-21 16:33:48 -08:00
Debanjum	5d9d50157e	Clean Logs, Improve Message Rendering and Make Khoj Trusted Host Configurable (#561 ) - Append chat message to chat logs as TextNodes in web, desktop clients - Simplify Code to Identify Files from Github, Notion on Web, Desktop Client - Use file source to find entries from github, notion on web, desktop client - Pass file source to clients via text search API response - Make Django Logs Follow Khoj Log Format, Verbosity - Handle image search setup related warning - Format Django initializing outputs using Khoj logger format - Use `KHOJ_HOST` env var to set allowed/trusted domains to host Khoj	2023-11-21 15:14:34 -08:00
Debanjum Singh Solanky	9e736d4340	Use KHOJ_DOMAIN for CORS allow_origins list as well - Default to app.khoj.dev - Remove unnecesary any_path regex in allow_origins. It only cares about host, paths are not set in origin header	2023-11-21 14:02:04 -08:00
sabaimran	5469e81a87	Use full path for the static directory in FastAPI and reflect deeper nesting of the django app	2023-11-21 13:44:45 -08:00
sabaimran	d199c4c35f	Resovle merge conflicts with matser	2023-11-21 13:35:56 -08:00
Debanjum Singh Solanky	76d041f633	Use KHOJ_HOST env var to set allowed/trusted domains to host Khoj Allows hosting Khoj behind other, non "khoj.dev" domains	2023-11-21 13:11:45 -08:00
Debanjum Singh Solanky	90d463c12a	Append chat message to chat logs as TextNodes in web, desktop clients	2023-11-21 13:10:50 -08:00
Debanjum Singh Solanky	befcbcdd5d	Use file source to find entries from github, notion on web, desktop client This is a more robust mechanism of identification than via file name including github or notion domain names	2023-11-21 13:10:50 -08:00
Debanjum Singh Solanky	3f0de45ec6	Pass file source to clients via text search API response Source of entry stored in DB is now passed to clients for processing	2023-11-21 13:10:50 -08:00
Debanjum Singh Solanky	4aec581306	Handle image search setup related warning Ideally should rename model_directory to config_directory or some such but the current image search code will need to be migrated soon. So changing the variable name and creating a migration script for old khoj.yml files using model-directory variable isn't worth it Remove the explicity set of number of threads to use by pytorch. Use the default used by it.	2023-11-21 13:10:50 -08:00
Debanjum Singh Solanky	b06628ee31	Format Django initializing outputs using Khoj logger format - Collect STDOUT from the `migrate', `collectstatic' commands and output using the Khoj logger format and verbosity settings - Only show Django `collectstatic' command output in verbose mode - Fix showing the Initializing Khoj log line by moving it after logger level set	2023-11-21 13:10:50 -08:00
sabaimran	341abf03ff	Handle none for search_type and use equals comparator rather than in for determining Notion type	2023-11-21 12:55:09 -08:00
sabaimran	2bb989e9d8	Resolve merge conflicts and fix some import ordering	2023-11-21 12:30:43 -08:00
sabaimran	244b76ffed	Add isort for automatic import sorting and skip main.py because it's a drama queen 👑	2023-11-21 12:20:41 -08:00
Debanjum	8a0d92e2d7	Fix Connectivity Check in Obsidian Client (#559 ) from dtkav/bugfix-local-connectivity-check Check connection to Khoj server for self-hosted server. This check had regressed during the cloud rearchitecture	2023-11-21 12:05:16 -08:00
sabaimran	0e6f09b241	Merge pull request #562 from khoj-ai/fix/pypi-package-app-not-included Fix PyPi package app reference issue	2023-11-21 11:54:46 -08:00
sabaimran	333cb3445c	Use colon rather than equals to indicate typing	2023-11-21 11:28:51 -08:00
Debanjum Singh Solanky	645fd96634	Search across all content types from Khoj Obsidian client Previously it was only searching for PDF and Markdown files. This was meant to show only content from current vault as results. But it has not scaled well as other clients also allow syncing PDF and markdown files now. So remove this content type filter for now. A proper solution would limit by using file/dir filters on server or client side.	2023-11-21 11:19:33 -08:00
sabaimran	a1460a5bf9	Set operations to typed empty list in migration file	2023-11-21 11:14:40 -08:00
sabaimran	71e794c26f	Remove the sys.append line in the main.py file, as it's not required	2023-11-21 10:57:21 -08:00
sabaimran	a474c31e02	Move the django app into the src/khoj folder for better organization and functionality - Our pypi package currently does not work because the django app and associated database is not included. To remedy this issue, move the app into the src/khoj folder. This has the added benefit of improved organization of the codebase, as all server related code is now in a single folder - Update associated file paths and system references	2023-11-21 10:56:04 -08:00
Debanjum Singh Solanky	c89bd49973	Fix ranking search results on Obsidian It's reversed since score of entries is now a distance metric on Khoj server. So lesser distance is better. Previously higher score was better	2023-11-21 01:24:59 -08:00
Daniel Grossmann-Kavanagh	f142999bce	fix khoj local server usage	2023-11-20 17:07:30 -08:00
Debanjum Singh Solanky	c07401cf76	Fix, Improve chat config via CLI on first run by using defaults - Fix setting prompt size for online chat - generally improve chat config via cli by using default chat model, prompt size for online and offline chat	2023-11-20 17:01:20 -08:00
sabaimran	b142de15a8	Merge branch 'features/internet-enabled-search' of github.com:khoj-ai/khoj into features/reflective-suggested-questions	2023-11-20 15:56:09 -08:00
sabaimran	a9623ef85a	Add requisite imports in order to instantiate offline model in adapters file	2023-11-20 15:27:42 -08:00
sabaimran	a8f13f334f	Fix merging issues with base after popping the stash	2023-11-20 15:22:50 -08:00

... 6 7 8 9 10 ...

2118 commits