sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-23 23:48:56 +01:00

Author	SHA1	Message	Date
Debanjum Singh Solanky	035165b534	Make offline chat model current date aware. Improve system prompts - Can now expect date awareness chat quality test to pass - Prevent offline chat model from printing verbatim user Notes and special tokens - Make it ask follow-up questions if it needs more context	2024-02-06 20:23:19 +05:30
Debanjum Singh Solanky	0d949140f4	Fix actor, director tests using freeze time by ignoring transformers package transformers package was causing freeze time to fail during setup	2024-02-06 03:00:48 +05:30
sabaimran	b782683e60	Scrape results from Serper results using Olostep (#627 ) * Initailize changes to incporate web scraping logic after getting SERP results - Do some minor refactors to pass a symptom prompt to the openai model when making a query - integrate Olostep in order to perform the webscraping * Fix truncation error with new line, fix typing in olostep code * Use the authorization header for the token * Add a small hint/indicator for how to use Khojs other modalities in the welcome prompt * Add more detailed error message if Olostep query fails * Add unit tests which invoke Olostep in chat director * Add test for olostep tool	2024-01-29 14:16:50 +05:30
Debanjum	4d30f7d1d9	Short-circuit API rate limiter for unauthenticated users (#607 ) ### Major - Short-circuit API rate limiter for unauthenticated user Calls by unauthenticated users were failing at API rate limiter as it failed to access user info object. This is a bug. API rate limiter should short-circuit for unauthenicated users so a proper Forbidden response can be returned by API Add regression test to verify that unauthenticated users get 403 response when calling the /chat API endpoint ### Minor - Remove trailing slash to normalize khoj url in obsidian plugin settings - Move used /api/config API controllers into separate module - Delete unused /api/beta API endpoint - Fix error message rendering in khoj.el, khoj obsidian chat - Handle deprecation warnings for subscribe renew date, langchain, pydantic & logger.warn	2024-01-17 00:59:52 +05:30
Debanjum Singh Solanky	d26a4ffcea	Only run the OpenAI chat client, /online test when API keys are set	2024-01-17 00:36:03 +05:30
Debanjum Singh Solanky	7039c202c8	Merge branch 'master' into short-circuit-api-rate-limiter	2024-01-16 18:18:34 +05:30
Debanjum Singh Solanky	6ded4c1d75	Merge branch 'master' into fix-1000-file-index-update-limit	2024-01-16 16:50:58 +05:30
Debanjum Singh Solanky	e0b381d523	Only run /online command offline chat director test when SERPER KEY present	2024-01-16 13:09:38 +05:30
Debanjum Singh Solanky	7dfbcd2e5a	Handle subscribe renew date, langchain, pydantic & logger.warn warnings - Ensure langchain less than 0.2.0 is used, to prevent breaking ChatOpenAI, PyMuPDF usage due to their deprecation after 0.2.0 - Set subscription renewal date to a timezone aware datetime - Use logger.warning instead of logger.warn as latter is deprecated - Use `model_dump' not deprecated dict to get all configured content_types	2024-01-12 01:46:52 +05:30
Debanjum Singh Solanky	ba99089a12	Short-circuit API rate limiter for unauthenticated user Calls by unauthenticated users were failing at API rate limiter as it failed to access user info object. This is a bug. API rate limiter should short-circuit for unauthenicated users so a proper Forbidden response can be returned by API Add regression test to verify that unauthenticated users get 403 response when calling the /chat API endpoint	2024-01-12 00:23:50 +05:30
Debanjum Singh Solanky	4ded32cc64	Test 1000 file upload limit to index/update API endpoint Due to FastAPI limitation	2024-01-03 22:14:36 +05:30
sabaimran	79913d4c17	Add isort to the pre-commit configuration and apply it to the whole project (#595 ) * Apply isort to the entire repository * Fix missing import issues in text_to_entries * Fix imports in migration files	2023-12-28 18:04:02 +05:30
sabaimran	6dd2b05bf5	Rebase with master	2023-12-19 21:02:49 +05:30
sabaimran	ef21d78c99	Initial changes to support multiple search model configurations - All search models are loaded into memory, and stored in a dictionary indexed by name - Still need to add database migrations and create a UI for user to select their choice. Presently, it uses the default option	2023-12-05 00:35:40 -05:00
Debanjum Singh Solanky	2b09caa237	Make online results an optional argument to the gpt converse method	2023-12-04 12:15:29 -05:00
sabaimran	6e1ba11e59	Resolve merge conflicts for rendering chat response	2023-11-27 11:33:13 -08:00
sabaimran	e438853b09	Add additional unit tests to verify behavior of unsubscribed/subscribed users	2023-11-26 13:09:00 -08:00
Debanjum Singh Solanky	a0a7ab7ec8	Rename conversation.gpt4all package to conversation.offline	2023-11-26 04:19:32 -08:00
sabaimran	73e38fccf3	Explicitly set billing to off in the test for being able to index a large set of data	2023-11-25 20:48:32 -08:00
sabaimran	b2afbaa315	Add support for rate limiting the amount of data indexed - Add a dependency on the indexer API endpoint that rounds up the amount of data indexed and uses that to determine whether the next set of data should be processed - Delete any files that are being removed for adminstering the calculation - Show current amount of data indexed in the config page	2023-11-25 20:28:04 -08:00
sabaimran	60c23d9e3a	Add online search chat director tests	2023-11-21 23:08:36 -08:00
sabaimran	c652a7fd2d	Move text_to_entries under the new content folder	2023-11-21 22:25:17 -08:00
sabaimran	1e2af083f0	Rename the data_sources module to content	2023-11-21 22:11:32 -08:00
sabaimran	2bb989e9d8	Resolve merge conflicts and fix some import ordering	2023-11-21 12:30:43 -08:00
sabaimran	a474c31e02	Move the django app into the src/khoj folder for better organization and functionality - Our pypi package currently does not work because the django app and associated database is not included. To remedy this issue, move the app into the src/khoj folder. This has the added benefit of improved organization of the codebase, as all server related code is now in a single folder - Update associated file paths and system references	2023-11-21 10:56:04 -08:00
sabaimran	b8e6883a81	Merge branch 'master' of github.com:khoj-ai/khoj into features/internet-enabled-search	2023-11-19 16:20:08 -08:00
sabaimran	4def8cce36	Merge pull request #541 from asim-shrestha/patch-1 Add test separators	2023-11-19 14:14:34 -08:00
Debanjum	71799add0b	Index Parent Headings of Org-Mode Entries to Improve Search Context (#548 ) ### Overview The parent hierarchy of org-mode entries can store important context. This change updates OrgNode to track parent headings for each org entry and adds the parent outline for each entry to the index ### Details - Test search uses ancestor headings as context for improved results - Add ancestor headings of each org-mode entry to their compiled form - Track ancestor headings for each org-mode entry in org-node parser Resolves #85	2023-11-19 13:18:19 -08:00
sabaimran	e398a76779	Fix test word filter	2023-11-19 13:14:58 -08:00
sabaimran	33a9304428	Resolve merge conflicts	2023-11-19 12:57:55 -08:00
sabaimran	ef5e9d66c1	Resolve merge conflicts in dependency imports	2023-11-19 11:42:20 -08:00
sabaimran	f688529150	Update the default configuration for the AppConfig	2023-11-17 19:26:31 -08:00
Debanjum Singh Solanky	ca87b4ede9	Wrap common API query parameters into shared class to deduplicate code - Upgrade FastAPI to >= latest version. Required upgrade of FastAPI. Earlier version didn't support wrapping common query params in class - Use per fixture app instead of a global FastAPI app in conftest - Upgrade minimum required Django version - Fix no notes chat director test with updated no notes message No notes message was updated in commit `118f1143`	2023-11-17 18:43:49 -08:00
Debanjum Singh Solanky	33ad9b8e64	Update text search test since indexing ancestor hierarchy added	2023-11-17 15:26:55 -08:00
Debanjum Singh Solanky	55785d50c3	Use title, when present, as root ancestor of entries instead of file path	2023-11-17 15:03:27 -08:00
sabaimran	ec06d2c446	Move data indexer files into a separate folder under processor. Update assoc UTs	2023-11-16 17:19:55 -08:00
Debanjum Singh Solanky	ddb07def0d	Test search uses ancestor headings as context for improved results - Update test data to add deeper outline hierarchy for testing hierarchy as context - Update collateral tests that need count of entries updated, deleted asserts to be updated	2023-11-16 03:05:19 -08:00
Debanjum Singh Solanky	74403e3536	Add ancestor headings of each org-mode entry to their compiled form Resolves #85	2023-11-16 02:54:41 -08:00
Debanjum Singh Solanky	305c25ae1a	Track ancestor headings for each org-mode entry in org-node parser	2023-11-16 02:39:14 -08:00
Debanjum Singh Solanky	922983bd53	Set max cos distance to 0.18. Test search API query with max distance	2023-11-15 20:26:21 -08:00
Debanjum Singh Solanky	08a057bdd5	Rename SearchModel to SearchModelConfig DB model, Require Cross-Encoder	2023-11-15 17:31:50 -08:00
Debanjum Singh Solanky	5a6ab9cc85	Fix failing client tests	2023-11-15 00:17:44 -08:00
Debanjum Singh Solanky	8f200cf53f	Remove unused parameter from configure_search_type method	2023-11-14 19:09:35 -08:00
Debanjum Singh Solanky	4af194d74b	Make search model configurable on server - Expose ability to modify search model via Django admin interface - Previously the bi_encoder and cross_encoder models to use were set in code - Now it's user configurable but with a default config generated by default	2023-11-14 19:09:35 -08:00
Debanjum Singh Solanky	e98141f4c3	Subscribe default user to standard plan with a far away renewal date Self hosted users in anonymous mode have all capabilities unlocked	2023-11-14 16:31:39 -08:00
Asim Shrestha	0bfc094e18	Add test separators	2023-11-11 17:08:58 -08:00
Debanjum Singh Solanky	c4364b9100	Weaken asking follow-up qs and q&a mode in notes prompt to OpenAI models - Notes prompt doesn't need to be so tuned to question answering. User could just want to talk about life. The notes need to be used to response to those, not necessarily only retrieve answers from notes - System and notes prompts were forcing asking follow-up questions a little too much. Reduce strength of follow-up question asking	2023-11-10 23:36:43 -08:00
sabaimran	e2e96f9aa4	Add default settings to let new users be subscribed on trial - Add the default user to a subscription trial - Update associated unit tests	2023-11-10 22:38:28 -08:00
Debanjum Singh Solanky	c9c0ba67c6	Fix chat_client configurations for OpenAI chat director tests	2023-11-10 17:29:23 -08:00
sabaimran	262a8574d1	Add a test to verify that a user without data sucessfully returns a respones to the /search endpoint	2023-11-10 14:00:58 -08:00

1 2 3 4 5 ...

311 commits