sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-12-20 19:37:45 +00:00

Author	SHA1	Message	Date
Debanjum Singh Solanky	2fbc609233	Add content write permission to jobs in github release workflow	2023-07-01 06:23:45 -07:00
Debanjum Singh Solanky	d77e05c279	Release Khoj version 0.7.0	2023-07-01 05:44:22 -07:00
Debanjum Singh Solanky	32d73500ba	Update Khoj Github Plugin details in main Readme	2023-07-01 02:18:47 -07:00
Debanjum Singh Solanky	30d87a9a01	Update color of Khoj chat in Obsidinan plugin to Lantern theme	2023-07-01 02:18:47 -07:00
Debanjum Singh Solanky	51826d28d6	Ensure clicking Update in Khoj Obsidian indexes PDF files too	2023-07-01 02:18:47 -07:00
sabaimran	dac2d14380	Handle file names appropriately for md files and render commits in github results	2023-07-01 01:20:58 -07:00
sabaimran	dbe713604d	Fix error in tests for markdown_to_jsonl	2023-07-01 00:49:40 -07:00
sabaimran	931aab4464	Handle case for when headers value is None	2023-07-01 00:37:30 -07:00
sabaimran	d01afb3ee4	Fix path issues for URL-based markdown files	2023-07-01 00:25:11 -07:00
sabaimran	01aa285d7b	Merge pull request #260 from khoj-ai/features/add-demo-views-for-khoj Add demo view for Khoj	2023-06-30 21:57:43 -07:00
sabaimran	31655447e7	Add the sign-up list to the chat page as well and update copy	2023-06-30 21:43:01 -07:00
sabaimran	cebaa51c2f	Merge branch 'master' of github.com:debanjum/khoj into features/add-demo-views-for-khoj	2023-06-30 20:39:02 -07:00
sabaimran	796102c74e	Add separate configuration if the given Khoj instance is meant for demo - In theory, this will be suitable for any Khoj instance that's meant for external-facing purposes (as in, outside of the user's network) - Prevent re-indexing for Github data if this is a demo instance - Fix up some issues with the CSS which made settings page small in mobile - In the frontend views for Khoj, add a button to get on the waitlist and links to the landing page	2023-06-30 20:38:55 -07:00
sabaimran	a443af3a71	Merge pull request #256 from khoj-ai/features/improve-telemetry Add additional request headers to improve telemetry	2023-06-30 20:35:41 -07:00
sabaimran	db3026739d	Resolve diffs in api.py to make /chat endpoint async with new request parameter	2023-06-30 00:25:37 -07:00
sabaimran	ef72508914	Try/catch around github file decoding, await call to search in chat API, fix img width	2023-06-30 00:23:21 -07:00
Debanjum Singh Solanky	b950889f47	Fix org-mode web renderer to handle results containing list in block - Break out of rendering list if at end of org block in org.js - This would previous hang rendering results in web interface Should try fix this upstream in org.js as well	2023-06-29 19:01:25 -07:00
sabaimran	780c769567	Add additional request headers to improve telemetry	2023-06-29 18:51:24 -07:00
sabaimran	6c10d68262	Merge pull request #253 from khoj-ai/features/github-issues-indexing Support indexing Github issues as well as corresponding comments	2023-06-29 16:02:47 -07:00
sabaimran	b2dd946c6d	Rename issue to entry method for accuracy	2023-06-29 15:23:50 -07:00
Debanjum Singh Solanky	51dfa48e2b	Have Khoj support Python 3.11 as Pytorch supports it now - Previously Khoj could only support Python upto 3.10 due to pytorch. But lots of folks had python 3.11 installed by default on their machines. This required installing python 3.10 and dealing with virtual envs. With Torch >= 2.0.1 now able to support python 3.11, at least one class of installation troubles for Khoj should drop. See https://github.com/pytorch/pytorch/issues/86566 for reference - Preliminary testing indicates using the new torch 2.x may reduce search time by 25% (from 80ms to 60ms on Mac M1) - Update Docs to not require mentioning python <=3.10 required - Update Github test workflow to run khoj tests with python 3.11 too	2023-06-29 15:13:26 -07:00
sabaimran	65bf894302	Interpret org files as a list and put them in separate divs. Update styling of search results to separate into cards	2023-06-29 15:12:48 -07:00
Debanjum Singh Solanky	d212298573	Make Configure button on web interface incrementally update by default We should add a way to force index everything. But force indexing should not be the default when user is just trying update content to index	2023-06-29 14:52:51 -07:00
Debanjum Singh Solanky	da2de21339	Only return requested result count even if search in multiple content types - Set results_count to default value at start so it is an int, never None	2023-06-29 14:49:05 -07:00
sabaimran	77672ac0ae	Demarcate different results with a border box - Add back support for searching by type Github - Remove custom class name in markdown js file	2023-06-29 14:14:25 -07:00
sabaimran	6edc32f2f4	Accept current changes to include issues in rendering flow	2023-06-29 12:25:29 -07:00
Debanjum	f272d4503e	Search across all Asymmetric Text Content Types in Parallel - Allow searching across asymmetric text content types using threads - Query time on my Mac averages 95ms latency (140ms at 90 percentile) across (Org, Markdown, Github, PDF and Music content types) - This is not too much more than search for a single content type (maybe max ~50% latency increase?). Encoding query is what takes most of the time anyway and that's just done once like before, threading adds some overhead - An average of `95 ms` latency or `140ms` at 90th percentile is inline with keeping an incremental search (search-as-you-type) experience - Put logic to remove filter terms from query in a `defilter` method for each filter - Encode query once during search to encode query once across all (asymmetric) content types - Search across all content types via the web and emacs interfaces in [`d5fb419`](`d5fb4196de`) and [`5c4eb95`](`5c4eb950d5`) respectively - Allow Khoj Chat to pull relevant data from across content types (without the perf hit). Khoj chat is only pulling data from a single content type currently	2023-06-29 12:21:27 -07:00
sabaimran	b41c14b258	Use *.markdown in the khoj_docker.yml	2023-06-29 11:55:18 -07:00
sabaimran	e6053951f0	In chat conftest fixtures, use .markdown rather than .md	2023-06-29 11:53:47 -07:00
sabaimran	ab7dabe74f	Explicitly use Union type for function parameters for lint checks	2023-06-29 11:44:30 -07:00
sabaimran	601b738135	Bonus: Rename all md files to markdown for cleanliness	2023-06-29 11:27:47 -07:00
sabaimran	fecf6700d2	Limit small image rendering to just the avatar images	2023-06-29 11:27:18 -07:00
sabaimran	70e550250a	Add an additional data source for issues from Github repositories + quality of life updates - Use a request session to reduce the overhead of setting up a new connection with the Github URL each request - Use the streaming feature for the REST api to reduce some of the memory footprint	2023-06-29 10:59:54 -07:00
Debanjum Singh Solanky	5f2717cc4b	Use logger.warning since logger.warn is deprecated	2023-06-28 22:15:27 -07:00
Debanjum Singh Solanky	5f7eaa7ded	Add trio, move freezegun, factory-boy to project test dependencies	2023-06-28 22:07:02 -07:00
Debanjum Singh Solanky	56ce97ef9e	Use async/await in tests for query method of text and image search The text, image search query method has become async. So async/await is required to get results correctly in tests etc	2023-06-28 22:07:02 -07:00
Debanjum Singh Solanky	f516d127c8	Update client tests to expect "all" as a valid new content type	2023-06-28 22:07:02 -07:00
Debanjum Singh Solanky	b1767f93d6	Get any configured asymmetric search model to encode query for search - Set image_search.query to async to use it with multi-threading This is same as text_search.query being set to an async method - Exit search early if no search_model is defined in state.model	2023-06-28 22:07:02 -07:00
Debanjum Singh Solanky	8eae7c898c	Put each result under org heading when query for "all" content type in khoj.el - Add "all" as default content type when no content type retrieved from server	2023-06-28 22:07:02 -07:00
Debanjum Singh Solanky	630bf995f1	Style each result based on its content type in same view on Khoj web - So when searching across content types (with content-type = "all") org-mode results get rendered differently than markdown, PDF etc. results - Set div class for each result separately instead of a single uber div for styling. This allows styling div of each result based on the content-type of that result - No need to create placeholder "all" content type on web interface as server is passing an all content type by itself	2023-06-28 22:07:01 -07:00
Debanjum Singh Solanky	1773a78339	Fix createRequestUrl method signature to fetch results from khoj web	2023-06-28 12:10:45 -07:00
Debanjum Singh Solanky	212b1a96c8	Create "all" search type for search across all content types on khoj server Allows moving logic to handle search across all content types to server from clients	2023-06-28 11:34:26 -07:00
Debanjum Singh Solanky	0636ceaf14	Merge branch 'master' of github.com:khoj-ai/khoj into parallelize-search-across-all-asymmetric-text-content-types Conflicts: - src/khoj/routers/api.py: Use theirs	2023-06-27 16:10:32 -07:00
Debanjum Singh Solanky	510bb7e684	Use typing union in text_search for python 3.8 compatible type hinting	2023-06-27 15:59:50 -07:00
Debanjum Singh Solanky	1b11d5723d	Extract search request URL builder into js function in web interface	2023-06-27 15:50:41 -07:00
Debanjum Singh Solanky	09f739b8cc	Null check config, log warning instead of error when configuring search	2023-06-27 15:48:48 -07:00
sabaimran	c0d35bafdd	Merge pull request #250 from khoj-ai/features/github-multi-repo-and-more Support multiple Github repositories and support indexing of multiple file types	2023-06-27 15:14:49 -07:00
sabaimran	9d62d66a77	Simplify construction of repo shorthand in GithubToJsonl	2023-06-27 15:05:03 -07:00
sabaimran	2697c7a186	Update org tests to use new method, update Github configuration in tests	2023-06-27 15:04:48 -07:00
sabaimran	227169ebde	Support configuration of multiple Github repositories in the settings interface - Add cards to configure each of the Github repositories - Fix a bug in the API which caused all other settings to be wiped when updating one of the content types - Provide an error message to the user if they have a misconfiguration in their chat settings	2023-06-27 14:10:09 -07:00

... 53 54 55 56 57 ...

4012 commits