sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-27 17:35:07 +01:00

Author	SHA1	Message	Date
sabaimran	62704cac09	Add a plugin which allows users to index their Notion pages (#284 ) * For the demo instance, re-instate the scheduler, but infrequently for api updates - In constants, determine the cadence based on whether it's a demo instance or not - This allow us to collect telemetry again. This will also allow us to save the chat session * Conditionally skip updating the index altogether if it's a demo isntance * Add backend support for Notion data parsing - Add a NotionToJsonl class which parses the text of Notion documents made accessible to the API token - Make corresponding updates to the default config, raw config to support the new notion addition * Add corresponding views to support configuring Notion from the web-based settings page - Support backend APIs for deleting/configuring notion setup as well - Streamline some of the index updating code * Use defaults for search and chat queries results count * Update pagination of retrieving pages from Notion * Update state conversation processor when update is hit * frequency_penalty should be passed to gpt through kwargs * Add check for notion in render_multiple method * Add headings to Notion render * Revert results count slider and split Notion files by blocks * Clean/fix misc things in the function to update index - Use the successText and errorText variables appropriately - Name parameters in function calls - Add emojis, woohoo * Clean up and further modularize code for processing data in Notion	2023-07-09 15:29:26 -07:00
Debanjum	77755c0284	Fix Packaging the Khoj Desktop Apps (#289 ) * Add langchain static files and pytorch metadata to Khoj native app * Add pillow static files, metadata & hidden imports to Khoj native app * Fix path to web interface static files on Khoj native app * Add tiktoken hidden imports to make chat work from Khoj native app * Fix Khoj native app to run with GUI mode enabled This got broken when we moved from using the --no-gui flag to using --gui in https://github.com/khoj-ai/khoj/pull/263	2023-07-09 10:21:16 -07:00
sabaimran	4c135ea316	Make streaming optional for the /chat endpoint (#287 ) * Update the /chat endpoint to conditionally support streaming - If streams are enabled, return the threadgenerator as it does currently - If stream is disabled, return a JSON response with the response/compiled references separated out - Correspondingly, update the chat.html UI to use the streamed API, as well as Obsidian - Rename chat/init/ to chat/history * Update khoj.el to use the /history endpoint - Update corresponding unit tests to use stream=true * Remove & from call to /chat for obsidian * Abstract functions out into a helpers.py file and clean up some of the error-catching	2023-07-09 10:12:09 -07:00
Debanjum Singh Solanky	0a86220d42	Use default values, delete content config on disable and update state	2023-07-07 20:36:16 -07:00
Debanjum Singh Solanky	362063f5fe	By default, connect to Khoj server over IPv4 from Obsidian plugin	2023-07-07 20:36:16 -07:00
Debanjum Singh Solanky	571e8c2548	Add rerank, index corruption hint on search page of web interface Similar to the hint alrady in the Obsidian search modal Closes #272	2023-07-07 20:36:16 -07:00
Debanjum	4b79d8216f	Move remaining chat actors to use OpenAI chat models - Deprecate the unused beta /answer and /search type identification endpoints and associated GPT functions - Update extract_questions to use GPT4 - Update summarize method to default to GPT-3.5 - Update date filter to support quoting values in single quotes too. So now both dt>'2023-04-01' and dt>"2023-04-01" should work - Remove "model" field from chat settings on the web interface	2023-07-07 18:53:05 -07:00
Debanjum Singh Solanky	61e131f95c	Hide unused model field from chat settings on web interface	2023-07-07 18:43:53 -07:00
Debanjum Singh Solanky	af30d01e85	Move to newer chat models to extract questions & summarize chats Deprecate usage of the older gpt3 models in-place of the newer chat based models - text-davinci-003 is only 50% cheaper than gpt4 and less reliable for question extraction - Using gpt-3.50turbo for summarization should reduce cost of chat - Keep conversation.chat_session as a list instead of a string - Update completion_with_backoff func to use ChatML format	2023-07-07 17:32:27 -07:00
Debanjum Singh Solanky	171ce19e1f	Update date filter to allow quoting values in single quotes	2023-07-07 17:13:47 -07:00
Debanjum Singh Solanky	e588f7c528	Deprecate unused beta search and answer API endpoints	2023-07-07 16:38:07 -07:00
Debanjum Singh Solanky	c9fc4d1296	Revert to using cross-encoder to improve search results used by chat	2023-07-07 15:31:34 -07:00
Debanjum Singh Solanky	11f0a9f196	Fix chat tests since streaming. Pass args correctly to chat methods - Fix testing gpt converse method after it started streaming responses - Pass stop in model_kwargs dictionary and api key in openai_api_key parameter to chat completion methods. This should resolve the arg warning thrown by OpenAI module	2023-07-07 15:23:44 -07:00
Debanjum Singh Solanky	48870d9170	Fix parsing questions generated by extract_questions actor into list The previous json parsing was failing to handle questions with date filters Fix the chat actor tests to run without throwing error with freezegun complaining about importing transformers.local_llama model Remove quote escapes from date filter examples provided to extract_questions actor	2023-07-07 15:18:55 -07:00
Debanjum Singh Solanky	279662620b	Move results count to settings page on web. Use it for search & chat - Before Only the search interface had the results count configuration option - After - The results count is set on the settings page instead of the search page - Both search and chat can use the configured results count instead of just search	2023-07-07 14:08:08 -07:00
Debanjum Singh Solanky	2ec8da89e8	Remove Update button from Khoj Search page on the Web interface The settings page on the Khoj web interface already has a configure button. Don't need the Update button on the search page as well	2023-07-07 12:49:58 -07:00
Debanjum Singh Solanky	bf427cd8dd	Set no. of results used to generate chat response from Khoj Emacs	2023-07-07 12:34:50 -07:00
Debanjum Singh Solanky	1d77fe712c	Set no. of results used to generate chat response from Khoj Obsidian	2023-07-07 12:32:32 -07:00
Debanjum Singh Solanky	2f31de5ed5	Set no. of references to use for chat configurable in Chat API	2023-07-07 12:29:36 -07:00
Debanjum Singh Solanky	d97682fdac	Use tooltip, placeholders to guide Khoj setup via web settings page	2023-07-06 21:37:48 -07:00
Debanjum Singh Solanky	f5cf09424b	Use more descriptive field names for content type settings on Khoj web Resolves #281	2023-07-06 20:47:39 -07:00
Debanjum Singh Solanky	a2c668268f	Use node-fetch >=3.1.0 in khoj obsidian plugin to avoid security vulnerability	2023-07-06 13:05:52 -07:00
sabaimran	d688ddf92c	Re-instate the scheduler for the demo instances (#279 ) * For the demo instance, re-instate the scheduler, but infrequently for api updates - In constants, determine the cadence based on whether it's a demo instance or not - This allow us to collect telemetry again. This will also allow us to save the chat session * Conditionally skip updating the index altogether if it's a demo isntance	2023-07-06 11:01:32 -07:00
Debanjum Singh Solanky	8f36572a9b	Improve typing, null checks in controllers and gpt functions	2023-07-05 20:49:25 -07:00
Debanjum Singh Solanky	41ac1e24c9	Add docs for a pre-emptive setup of Khoj for later offline usage Closes #151	2023-07-05 20:48:51 -07:00
Debanjum	6c2a8a5bce	⚡️ Stream Responses by Khoj Chat on Web, Obsidian - What - Stream chat responses from OpenAI API to Web, Obsidian clients - Implement using a callback function which manages a queue where new tokens can be placed as they come on. As the thread is read from, tokens are removed. - When the final token has been processed, add the `compiled_references` to the queue to be rendered by the `chat` client - When the thread has been closed, save the accumulated conversation log in the user's history using a `partial func` - Incrementally decode tokens on the front end and add them as they appear from the streamed response - Why This significantly reduces perceived latency and OpenAI API request timeouts for Chat Closes https://github.com/khoj-ai/khoj/issues/257	2023-07-05 20:02:11 -07:00
Debanjum Singh Solanky	e111eda6ae	Make client, app_config optional in telemetry logger for correct typing	2023-07-05 18:57:38 -07:00
Debanjum Singh Solanky	e562114f6b	Improve comments, var names in js for chat streaming on web interface	2023-07-05 18:57:27 -07:00
Debanjum Singh Solanky	46269ddfd3	Fix chat logging messages to get context without flooding logs	2023-07-05 18:27:06 -07:00
Debanjum Singh Solanky	0ba838b53a	Show temp status message in Khoj Obsidian chat while Khoj is thinking - Scroll to bottom after adding temporary status message and references too	2023-07-05 18:02:43 -07:00
Debanjum Singh Solanky	8271abe729	Use optional chaining operator to extract khojBannerSubmit from conditional	2023-07-05 18:02:43 -07:00
Debanjum Singh Solanky	c12ec1fd03	Show temp status message in Khoj web chat while Khoj is thinking - Scroll to bottom after adding temporary status message and references too	2023-07-05 18:02:30 -07:00
sabaimran	257a421e45	Bonus: add try-catch logic around telemetry upload in case of JSON serializability issues	2023-07-05 15:12:18 -07:00
sabaimran	4e6b66b139	Add support for streaming chat response from OpenAI to Obsidian - I needed to installed node-fetch to accomplish this, as the built-in request object from Obsidian doesn't seem to support streaming and the built-in fetch object is very sensitive to any and all cross origin requests	2023-07-05 15:01:22 -07:00
sabaimran	3ff5074cf5	Log the end-to-end time of generating a streamed response from OpenAI	2023-07-05 14:59:44 -07:00
sabaimran	68e635cc32	Remove additional comments and debug statements	2023-07-05 11:33:56 -07:00
sabaimran	67a8795b1f	Clean-up commented out code	2023-07-05 11:24:40 -07:00
sabaimran	79b1b1d350	Save streamed chat conversations via partial function passed to the ThreadGenerator	2023-07-04 17:33:52 -07:00
sabaimran	afd162de01	Add reference notes to result response from GPT when streaming is completed - NOTE: results are still not being saved to conversation history	2023-07-04 12:47:50 -07:00
sabaimran	8f491d72de	Initial code with chat streaming working (warning: messy code)	2023-07-04 10:14:39 -07:00
Debanjum Singh Solanky	5889eceba4	Make text selectable in Khoj chat modal on Obsidian Previously the text in the Khoj chat modal couldn't be copied as it was not selectable Resolves #206	2023-07-03 23:24:04 -07:00
sabaimran	89354def9b	Update request timeout window to 20 seconds	2023-07-03 22:28:18 -07:00
sabaimran	b1940519c3	Log error if unable to decode chunk from Github	2023-07-03 16:29:32 -07:00
Debanjum Singh Solanky	ecf9730cd7	Disable Chat, Search on Web if Khoj not configured & show next steps	2023-07-03 16:04:32 -07:00
sabaimran	017e8c1aef	Skip indexing a PDF that has an indexing error (#274 )	2023-07-03 15:55:11 -07:00
sabaimran	a6f313589e	Release Khoj version 0.7.1	2023-07-03 12:26:41 -07:00
Debanjum Singh Solanky	70f6b8266c	Upgrade minimum supported pydantic version	2023-07-03 12:22:56 -07:00
sabaimran	8bfd5828e6	Remove deprecation notice since we're opening the web UI by default	2023-07-03 12:01:09 -07:00
sabaimran	92d81d3b16	Initialize the search.model field to SearchModels() and fix Reinitialize API call (#273 )	2023-07-03 11:32:44 -07:00
sabaimran	61403138d5	Merge pull request #269 from khoj-ai/features/simplify-configuration-steps Simplify some common configuration steps	2023-07-03 00:16:51 -07:00

1 2 3 4 5 ...

1490 commits