sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-12-03 12:23:02 +01:00

Author	SHA1	Message	Date
Debanjum Singh Solanky	6a8fd9bf33	Reorder embeddings search arguments based on argument importance	2024-10-10 04:45:00 -07:00
Debanjum Singh Solanky	0eacc0b2b0	Use consistent name for user, planner to not miss current user query Previously Khoj would start answering the previous query. This maybe because the prompt uses User for prompt in chat history but was using Q for current user prompt.	2024-10-10 04:45:00 -07:00
Debanjum Singh Solanky	284c8c331b	Increase default max iterations for research chat director to 5	2024-10-10 04:45:00 -07:00
Debanjum Singh Solanky	1e390325d2	Let research chat director decide which webpage to read, if any Make webpages to read automatically on search_online configurable via a argument. Set it to default to 1, so other callers of the function are unaffected. But iterative chat director can still decide which, if any, webpages to read based on the online search it performs	2024-10-10 04:45:00 -07:00
Debanjum Singh Solanky	5a699a52d2	Improve webpage summarization prompt to better extract links, excerpts This change allows the iterative director to dive deeper into its research as the data extracted contains relevant links from the webpage Previous summarization prompt didn't extract relevant links from the webpage which limited further explorations from webpages	2024-10-10 04:45:00 -07:00
Debanjum Singh Solanky	61df1d5db8	Pass previous iteration results to code interpreter chat actors This improves the code interpreter chat actors abilitiy to generate code with data collected during the previous iterations	2024-10-10 04:45:00 -07:00
Debanjum Singh Solanky	9e7025b330	Set python interpret sandbox url via environment variable	2024-10-10 04:45:00 -07:00
Debanjum Singh Solanky	2dc5804571	Extract defilter query into conversation utils for reuse	2024-10-10 04:45:00 -07:00
sabaimran	e69a8382f2	Add a code icon for code-related train of thought	2024-10-09 23:56:57 -07:00
sabaimran	536422a40c	Include code snippets in the reference panel	2024-10-09 23:54:11 -07:00
Debanjum Singh Solanky	8d33c764b7	Allow iterative chat director to use python interpreter as a tool	2024-10-09 23:38:20 -07:00
Debanjum Singh Solanky	b373073f47	Show executed code in web app chat message references	2024-10-09 22:13:18 -07:00
Debanjum Singh Solanky	a98f97ed5e	Refactor Run Code tool into separate module and modularize code functions Move construct_chat_history and ChatEvent enum into conversation.utils and move send_message_to_model_wrapper to conversation.helper to modularize code. And start thinning out the bloated routers.helper - conversation.util components are shared functions that conversation child packages can use. - conversation.helper components can't be imported by conversation packages but it can use these child packages This division allows better modularity while avoiding circular import dependencies	2024-10-09 22:13:17 -07:00
Debanjum Singh Solanky	8044733201	Give Khoj ability to run python code as a tool triggered via chat API Create python code executing chat actor - The chat actor generate python code within sandbox constraints - Run the generated python code in the cohere terrarium, pyodide based sandbox accessible at sandbox url	2024-10-09 21:37:22 -07:00
Debanjum Singh Solanky	4d33239af6	Improve prompts for the iterative chat director	2024-10-09 21:23:18 -07:00
Debanjum Singh Solanky	6ad85e2275	Fix to continue showing retrieved documents in train of thought	2024-10-09 21:20:22 -07:00
sabaimran	a6f6e4f418	Fix notes references and passage of user query in the chat flow	2024-10-09 20:34:20 -07:00
Debanjum Singh Solanky	ec248efd31	Allow iterative chat director to do notes search	2024-10-09 19:04:59 -07:00
Debanjum Singh Solanky	a6905a9f0c	Pass background context to iterating chat director	2024-10-09 19:04:59 -07:00
sabaimran	028b6e6379	Fix yield for scraping direct web page	2024-10-09 18:14:08 -07:00
sabaimran	717d9da8d8	Handle when summarize result is not present, rename variable in for loop from query	2024-10-09 17:57:08 -07:00
sabaimran	03544efde2	Ignore typing of the result dict for online, web page scrape	2024-10-09 17:48:24 -07:00
sabaimran	ab81b01fcb	Fix typing of direct_web_pages and remove the deprecated chat API	2024-10-09 17:46:28 -07:00
sabaimran	5b8d663cf1	Add intermediate summarization of results when planning with o1	2024-10-09 17:40:56 -07:00
sabaimran	7b288a1179	Clean up the function planning prompt a little bit	2024-10-09 16:59:20 -07:00
sabaimran	f71e4969d3	Skip summarize while it's broken, and snip some other parts of the workflow while under construction	2024-10-09 16:40:06 -07:00
sabaimran	f7e6f99a32	add typing for extract document references	2024-10-09 16:05:34 -07:00
sabaimran	6960fb097c	update types of prev iterations response	2024-10-09 16:04:39 -07:00
sabaimran	4978360852	Fix type of previous_iterations	2024-10-09 16:02:41 -07:00
sabaimran	46ef205a75	Add additional type annotations for compiled_references et al	2024-10-09 16:01:52 -07:00
sabaimran	4fbaef10e9	Correct usage of the summarize function	2024-10-09 15:58:05 -07:00
sabaimran	c91678078d	Correct the usage of query passed to summarize function	2024-10-09 15:55:55 -07:00
sabaimran	f867d5ed72	Working prototype of meta-level chain of reasoning and execution - Create a more dynamic reasoning agent that can evaluate information and understand what it doesn't know, making moves to get that information - Lots of hacks and code that needs to be reversed later on before submission	2024-10-09 15:54:25 -07:00
Debanjum	00546c1a63	Fix link to llama-cpp-python setup docs	2024-10-09 01:30:33 -07:00
Debanjum Singh Solanky	9a056383e0	Reduce size of start chat and edit buttons on agent card in web app	2024-10-09 00:00:32 -07:00
Debanjum Singh Solanky	dc7f22f76c	Mention no. of docs in agents knowledge base in its badge hover text	2024-10-08 23:51:00 -07:00
Debanjum Singh Solanky	13fb22f7e7	Update agent form data shown in edit card after save operaton on web app Previously you had to refresh the page to see the updated data on reopening the agents edit card after a save operation. Now you see the latest saved agent data on reopening the agents edit card. This should avoid confusion on whether the data was saved correctly	2024-10-08 23:26:04 -07:00
Debanjum Singh Solanky	dd770cf1b9	Start chat with public and protected agents when shared via link	2024-10-08 22:10:07 -07:00
Debanjum Singh Solanky	80212c50fd	Use default agent in others chats with an agent if agent made private If a public or protected agent is made private. Other users who were having conversation with that agent will have to carry on their conversation using default agent instead	2024-10-08 22:08:38 -07:00
Debanjum Singh Solanky	d628f89ce9	Prefetch agents related database models	2024-10-08 21:59:15 -07:00
Debanjum Singh Solanky	8de67c5d4d	Fallback to use general command if no tool selected by agent	2024-10-08 19:48:02 -07:00
Debanjum Singh Solanky	b80c4bcfdd	Improve agent command descriptions	2024-10-08 19:47:51 -07:00
Debanjum Singh Solanky	67d0e59eac	Pass chat history to the summarize chat actor	2024-10-08 18:44:52 -07:00
Debanjum Singh Solanky	7e3090060b	Encourage Gemini to output more verbose responses	2024-10-08 18:41:43 -07:00
Debanjum Singh Solanky	bbbdba3093	Time embedding model load for better visibility into app startup time Loading the embeddings model, even locally seems to be taking much longer. Use timer to track visibility into embedding, cross-encoder model load times	2024-10-08 18:41:43 -07:00
Debanjum Singh Solanky	516472a8d5	Switch default tokenizer to tiktoken as more widely used The tiktoken BPE based tokenizers seem more widely used these days. Fallback to gpt-4o tiktoken tokenizer to count tokens for context stuffing	2024-10-08 18:41:43 -07:00
Debanjum Singh Solanky	2b8f7f3efb	Reuse a single func to format conversation for Gemini This deduplicates code and prevents logic from deviating across gemini chat actors	2024-10-08 18:41:42 -07:00
Debanjum Singh Solanky	452e360175	Do not use max prompt size to limit Gemini max output tokens We should start disambiguating the the max input from output size. Max prompt size should only be used for the max input context to an LLM. If required max_output_tokens should be set as a separate new field	2024-10-08 15:30:08 -07:00
Debanjum Singh Solanky	bdc36fec5d	Remove unnecessary whitespace indent from personality context	2024-10-08 15:30:08 -07:00
sabaimran	3daa3c003d	When tool selection is not done successfully with an agent, return all agent tools as options	2024-10-08 15:03:58 -07:00

1 2 3 4 5 ...

3536 commits