sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-23 23:48:56 +01:00

Author	SHA1	Message	Date
Debanjum	f967bdf702	Show correct example index being currently processed in frames eval Previously the batch start index wasn't being passed so all batches started in parallel were showing the same processing example index This change doesn't impact the evaluation itself, just the index shown of the example currently being evaluated	2024-11-10 14:49:51 -08:00
Debanjum	84a8088c2b	Only evaluate non-empty responses to reduce eval script latency, cost Empty responses by Khoj will always be an incorrect response, so no need to make call to an evaluator agent to check that	2024-11-10 14:49:51 -08:00
Debanjum	1ccbf72752	Use logger instead of print to track eval	2024-11-04 00:40:26 -08:00
Debanjum	791eb205f6	Run prompt batches in parallel for faster eval runs	2024-11-02 04:58:03 -07:00
Debanjum	96904e0769	Add script to evaluate khoj on Google's FRAMES benchmark Google's FRAMES benchmark evaluates multi-step retrieval and reasoning capabilities of an agent. The script uses Gemini as an LLM Judge to evaluate Khoj responses to the FRAMES benchmark prompts against the ground truth provided by it.	2024-11-02 04:57:42 -07:00
Debanjum	50ffd7f199	Merge branch 'master' into features/advanced-reasoning	2024-10-28 04:10:59 -07:00
Debanjum	3e17ab438a	Separate notes, online context from user message sent to chat models (#950 ) Overview --- - Put context into separate user message before sending to chat model. This should improve model response quality and truncation logic in code - Pass online context from chat history to chat model for response. This should improve response speed when previous online context can be reused - Improve format of notes, online context passed to chat models in prompt. This should improve model response quality Details --- The document, online search context are now passed as separate user messages to chat model, instead of being added to the final user message. This will improve - Models ability to differentiate data from user query. That should improve response quality and reduce prompt injection probability - Make truncation logic simpler and more robust When context window hit, can simply pop messages to auto truncate context in order of context, user, assistant message for each conversation turn in history until reach current user query The complex, brittle logic to extract user query from context in last user message isn't required.	2024-10-28 02:03:18 -07:00
sabaimran	30f9225021	Merge branch 'master' of github.com:khoj-ai/khoj into features/advanced-reasoning	2024-10-23 19:15:51 -07:00
sabaimran	f3ce47b445	Create explicit flow to enable the free trial (#944 ) * Create explicit flow to enable the free trial The current design is confusing. It obfuscates the fact that the user is on a free trial. This design will make the opt-in explicit and more intuitive. * Use the Subscription Type enum instead of hardcoded strings everywhere * Use length of free trial in the frontend code as well	2024-10-23 15:29:23 -07:00
Debanjum Singh Solanky	39a613d3bc	Fix up openai chat actor tests	2024-10-22 03:09:36 -07:00
sabaimran	ad197be70c	Fix PDFs unit test, skip OCR	2024-10-20 22:25:41 -07:00
sabaimran	a979457442	Add unit tests for agents - Add permutations of testing for with, without knowledge base. Private, public, different users.	2024-10-20 20:04:50 -07:00
Debanjum Singh Solanky	6a8fd9bf33	Reorder embeddings search arguments based on argument importance	2024-10-10 04:45:00 -07:00
Debanjum Singh Solanky	91c76d4152	Intelligently initialize a decent default set of chat model options Given the LLM landscape is rapidly changing, providing a good default set of options should help reduce decision fatigue to get started Improve initialization flow during first run - Set Google, Anthropic Chat models too Previously only Offline, Openai chat models could be set during init - Add multiple chat models for each LLM provider Interactively set a comma separated list of models for each provider - Auto add default chat models for each provider in non-interactive model if the {OPENAI,GEMINI,ANTHROPIC}_API_KEY env var is set - Do not ask for max_tokens, tokenizer for offline models during initialization. Use better defaults inferred in code instead - Explicitly set default chat model to use If unset, it implicitly defaults to using the first chat model. Make it explicit to reduce this confusion Resolves #882	2024-09-19 20:32:08 -07:00
Debanjum Singh Solanky	bc2e889d72	Update chat director, client tests to call chat API using new POST method	2024-09-11 17:28:06 -07:00
Debanjum Singh Solanky	241b9009ba	Update OpenAI chat actor tests to handle more questions being extracted	2024-09-11 16:16:55 -07:00
Raghav Tirumale	549686a7a4	Add Vision Support (#889 ) # Summary of Changes * New UI to show preview of image uploads * ChatML message changes to support gpt-4o vision based responses on images * AWS S3 image uploads for persistent image context in conversations * Database changes to have `vision_enabled` option in server admin panel while configuring models * Render previously uploaded images in the chat history, show uploaded images for pending msgs * Pass the uploaded_image_url through to subqueries * Allow image to render upon first message from the homepage * Add rendering support for images to shared chat as well * Fix some UI/functionality bugs in the share page * Convert user attached images for chat to webp format before upload * Use placeholder to attached image for data source, response mode actors * Update all clients to call /api/chat as a POST instead of GET request * Fix copying chat messages with images to clipboard TLDR; Add vision support for openai models on Khoj via the web UI! --------- Co-authored-by: sabaimran <narmiabas@gmail.com> Co-authored-by: Debanjum Singh Solanky <debanjum@gmail.com>	2024-09-09 15:22:18 -07:00
Debanjum Singh Solanky	238bc11a50	Fix, improve openai chat actor, director tests & online search prompt	2024-08-22 19:09:33 -07:00
Debanjum Singh Solanky	9986c183ea	Default to gpt-4o-mini instead of gpt-3.5-turbo in tests, func args GPT-4o-mini is cheaper, smarter and can hold more context than GPT-3.5-turbo. In production, we also default to gpt-4o-mini, so makes sense to upgrade defaults and tests to work with it	2024-08-22 19:04:49 -07:00
Debanjum Singh Solanky	58c8068079	Upgrade default offline chat model to llama 3.1	2024-08-20 09:28:56 -07:00
Debanjum	39e566ba91	Improve Document, Online Search to Answer Vague or Meta Questions (#870 ) - Major - Improve doc search actor performance on vague, random or meta questions - Pass user's name to document and online search actors prompts - Minor - Fix and improve openai chat actor tests - Remove unused max tokns arg to extract qs func of doc search actor	2024-08-16 06:46:13 -07:00
srikary12	05c0aa3882	Support exclusion file filters (#826 ) ### Overview Support exclude file filter in user search queries ### Details - All of the exclude file filter terms need to be satisfied - Any one of the include file filter terms should be satisfied ### Example - Search Query: what happened yesterday? -file:"tasks.org" -file:"work.md" file:"diary.org" file:"journal.org - Behavior: Query will try find relevant notes in any of `journal.org` or `diary.org` and not in `tasks.org` and not in `work.md` ### Details * Add support for exclusion file filters * Translate file filter to valid Django DB entry filter regex * Exclude all files when multiple exclude file filter in query Previously we were applying an "Or" filter, which would exclude any file mentioned in a query with multiple exclude file filter. This is not what we naturally mean when we ask excluding a file in a query * Rename, rearrange, deduplicate and add file filter tests Closes #728 --------- Co-authored-by: Debanjum Singh Solanky <debanjum@gmail.com>	2024-08-12 05:41:54 -07:00
sabaimran	c08b9e89f0	Update test_db_lock with new function name	2024-08-08 13:03:01 +05:30
sabaimran	1a1d9c7257	Merge branch 'master' of github.com:khoj-ai/khoj into features/big-upgrade-chat-ux	2024-07-27 14:18:05 +05:30
Debanjum Singh Solanky	878cc023a0	Fix and improve openai chat actor tests - Use new form of passing doc references to now passing chat actor test - Fix message list generation from conversation logs provided Strangely the parent conversation_log gets passed down to message_to_log func when the kwarg is not explicitly specified	2024-07-26 23:53:47 +05:30
sabaimran	44d34f9090	Update the unit test for the subscribed user	2024-07-26 19:59:01 +05:30
sabaimran	377f7668c5	Merge pull request #858 from khoj-ai/use-sse-instead-of-websocket Use Single HTTP API for Robust, Generalizable Chat Streaming	2024-07-26 07:11:54 -07:00
Debanjum Singh Solanky	54b4203683	Update chat API client tests to mix testing of batch and streaming mode	2024-07-23 17:56:03 +05:30
Debanjum Singh Solanky	469a1cb6a2	Move API endpoints under /api/configure/content/ to /api/content/ Pull out /api/configure/content API endpoints into /api/content to allow for more logical organization of API path hierarchy This should make the url more succinct and API request intent more understandable by using existing HTTP method semantics along with the path. The /configure URL path segment was either - redundant (e.g POST /configure/notion) or - incorrect (e.g GET /configure/files) Some example of naming improvements: - GET /configure/types -> GET /content/types - GET /configure/files -> GET /content/files - DELETE /configure/files -> DELETE /content/files This should also align, merge better the the content indexing API triggered via PUT, PATCH /content Refactor Flow 1. Rename /api/configure/types -> /api/content/types 2. Rename /api/configure -> /api 3. Move /api/content to api_content from under api_config	2024-07-19 05:40:34 +05:30
Debanjum Singh Solanky	bba4e0b529	Accept file deletion requests by clients during sync - Remove unused full_corpus boolean. The full_corpus=False code path wasn't being used (accept for in a test) - The full_corpus=True code path used was ignoring file deletion requests sent by clients during sync. Unclear why this was done - Added unit test to prevent regression and show file deletion by clients during sync not ignored now	2024-07-19 04:53:01 +05:30
Debanjum Singh Solanky	5923b6d89e	Split /api/v1/index/update into /api/content PUT, PATCH API endpoints - This utilizes PUT, PATCH HTTP method semantics to remove need for the "regenerate" query param and "/update" url suffix - This should make the url more succinct and API request intent more understandable by using existing HTTP method semantics	2024-07-19 01:45:53 +05:30
Debanjum Singh Solanky	e9f86e320b	Fix and improve offline chat actor, director tests - Use updated references schema with compiled key - Enable director tests that are now expected to pass and that do pass (with Gemma 2 at least)	2024-07-18 03:43:09 +05:30
Debanjum Singh Solanky	de15a7a3fc	Rename API path /api/config to /api/configure - Update clients calling /api/config to call /api/configure instead	2024-07-16 16:13:27 +05:30
Debanjum Singh Solanky	21fe1a917b	Support syncing, searching images from Obsidian plugin	2024-07-11 16:22:31 +05:30
Debanjum Singh Solanky	010486fb36	Split current section once by heading to resolve org-mode indexing bug - Split once by heading (=first_non_empty) to extract current section body Otherwise child headings with same prefix as current heading will cause the section split to go into infinite loop - Also add check to prevent getting into recursive loop while trying to split entry into sub sections	2024-07-06 19:35:59 +05:30
Debanjum Singh Solanky	d5ceff2691	Update tests and documentation with Jina reader API usage and info Update offline, openai chat actor, director tests to not require Serper to run the online command tests Update documentation for self-hosted online search to mention no setup is required by default. But improvements can be made by using Serper.dev or Olostep	2024-07-02 17:19:09 +05:30
Raghav Tirumale	8eccd8a5e4	Support Indexing Images via OCR (#823 ) - Added support for uploading .jpeg, .jpg, and .png files to Khoj from Web, Desktop app - Updating indexer to generate raw text and entries using RapidOCR - Details * added support for indexing images via ocr * fixed pyproject.toml * Update src/khoj/processor/content/images/image_to_entries.py Co-authored-by: Debanjum <debanjum@gmail.com> * Update src/khoj/processor/content/images/image_to_entries.py Co-authored-by: Debanjum <debanjum@gmail.com> * removed redudant try except blocks * updated desktop js file to support image formats * added tests for jpg and png * Fix processing for image to entries files * Update unit tests with working image indexer * Change png test from version verificaition to open-cv verification --------- Co-authored-by: Debanjum <debanjum@gmail.com> Co-authored-by: sabaimran <narmiabas@gmail.com>	2024-07-01 06:00:00 -07:00
Debanjum Singh Solanky	732332a3c5	Spell fix s/e.g/e.g./ across code, tests and docs	2024-06-24 15:24:45 +05:30
Debanjum Singh Solanky	22f6db0a6b	Upgrade RapidOCR and enable for Python 3.12. Fix PDF OCR test	2024-06-22 16:01:55 +05:30
Raghav Tirumale	bd3b590153	Support Indexing Docx Files (#801 ) * Add support for indexing docx files and associated unit tests --------- Co-authored-by: sabaimran <narmiabas@gmail.com>	2024-06-20 11:18:01 +05:30
Raghav Tirumale	d4e5c95711	Add Ability to Summarize Documents (#800 ) * Uses entire file text and summarizer model to generate document summary. * Uses the contents of the user's query to create a tailored summary. * Integrates with File Filters #788 for a better UX.	2024-06-18 19:31:07 +05:30
Debanjum	6afbd8032e	Improve Intermediate Steps in Formulating Chat Response (#799 ) # Major - Disambiguate Text output mode to disambiguate from Default data source lookup - Fix showing headings in intermediate step in generating chat response - Remove "Path" prefix from org ancestor heading in compiled entry # Minor - Fix OpenAI chat actor, director unit tests	2024-06-09 07:55:01 +05:30
Debanjum Singh Solanky	f440ddbe1d	Fix openai chat actor, director tests - Update test ChatModelOptions setup since update to it's schema - Fix stale function calls using their updated signatures	2024-06-09 07:24:47 +05:30
Debanjum Singh Solanky	5f2442450c	Update truncation test to reduce flakyness in cloud tests Removed dependency on faker, factory for the truncation tests as that seems to be the point of flakiness	2024-06-07 19:42:48 +05:30
Debanjum Singh Solanky	18f7e6e7ed	Remove "Path" prefix from org ancestor heading in compiled entry	2024-06-06 16:51:26 +05:30
Debanjum Singh Solanky	22289a0002	Improve task scheduling by using json mode and agent scratchpad - The task scheduling actor was having trouble calculating the timezone. Giving the actor a scratchpad to improve correctness by thinking step by step - Add more examples to reduce chances of the inferred query looping to create another reminder instead of running the query and sharing results with user - Improve task scheduling chat actor test with more tests and by ensuring unexpected words not present in response	2024-05-01 08:30:10 +05:30
Debanjum Singh Solanky	7f5981594c	Only notify when scheduled task results satisfy user's requirements There's a difference between running a scheduled task and notifying the user about the results of running the scheduled task. Decide to notify the user only when the results of running the scheduled task satisfy the user's requirements. Use sync version of send_message_to_model_wrapper for scheduled tasks	2024-05-01 08:30:10 +05:30
Debanjum Singh Solanky	c28d7d3414	Add basic chat actor test to infer scheduled queries	2024-05-01 08:28:59 +05:30
Debanjum	17a06f152c	Support Llama 3 and Improve Offline Chat Actors (#724 ) - Add support for Llama 3 in Khoj offline mode - Make chat actors generate valid json with more local models - Fix offline chat actor tests	2024-04-25 14:00:56 +05:30
Debanjum Singh Solanky	ec41482324	Upgrade default cross-encoder to mixedbread ai's mxbai-rerank-xsmall Previous cross-encoder model was a few years old, newer models should have improved in quality. Model size increases by 50% compared to previous for better performance, at least on benchmarks	2024-04-24 09:50:09 +05:30

1 2 3 4 5 ...

403 commits