sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-24 07:55:07 +01:00

Author	SHA1	Message	Date
Saba	b55159f5bd	Fix URL for khoj.el quelpa setup instructions	2022-07-29 23:01:04 -04:00
Debanjum Singh Solanky	da118b3fed	Simplify incremental search function used in web interface Re-rank isn't passed to image search API in search function. So don't need to check type in incremental_search function too	2022-07-29 23:18:01 +04:00
Debanjum Singh Solanky	3079614981	Allow set up of search form via query params in web interface - Default search type to org, instead of images	2022-07-29 23:13:26 +04:00
Debanjum Singh Solanky	02ca2c05a1	Add Eagle Icon for Khoj to Web, Emacs Interfaces and Readme	2022-07-29 17:50:29 +04:00
Debanjum Singh Solanky	8290ef2a60	Convert Readme to Markdown from Org as better support on Github	2022-07-29 17:08:30 +04:00
Debanjum Singh Solanky	78314263a0	Add Table of Contents, Features, Performance Details to Readme	2022-07-29 17:08:17 +04:00
Debanjum	2d0d85cfda	Add Incremental Search Demo to Readme	2022-07-29 06:14:24 +04:00
Debanjum Singh Solanky	ed181f47c9	Prettify rendering of org music results on Khoj web interface	2022-07-29 04:28:22 +04:00
Debanjum Singh Solanky	7e5291a38e	Make org result headings at same level. Improve spacing of results Having org-mode result headings change size based on their depth in the source document makes is a confusing UI experience. Improve font-size, line-spacing and margins of results to make delineation between entries, and differntiating between entry heading and it's body easier to visually infer. Do not white-space: pre-line. Improves rendering of Markdown results	2022-07-29 01:55:46 +04:00
Debanjum Singh Solanky	4d5183063c	Create images directory if doesn't exist, to store image search results	2022-07-28 21:30:31 +04:00
Debanjum	a29af70de5	Prettify Render of Markdown results on Web Interface ## Details - We were previously just wrapping results from /search API into a pre formatted div field. This was not easy to read - Use [markdown-it](https://github.com/markdown-it/markdown-it) to render markdown results from Khoj `/search` API as proper HTML Closes #43	2022-07-28 10:13:40 -07:00
Debanjum Singh Solanky	a9bc17a6b0	Prettify Render of Markdown Results in Web Interface	2022-07-28 20:56:37 +04:00
Debanjum Singh Solanky	a6ae74f52e	Move JS files like org.js into a separate assets/ directory	2022-07-28 20:46:48 +04:00
Debanjum Singh Solanky	a12eaa4ce0	Move Khoj image results into a child images/ directory	2022-07-28 20:45:12 +04:00
Debanjum	a71253e137	Support Incremental Search on Web Interface ## Support Incremental Search on Khoj Web Interface - Use default, fast path to query /search API while user is typing - Upgrade to cross-encoder re-ranked results once user hits enter on search box ## Improve Render of Org Results on Web Interface - We were previously just wrapping results from /search API into a pre formatted div field. This was not easy to read - Use [org.js](https://mooz.github.io/org-js/) to render results from Khoj `/search` API as proper HTML - Improve org.js to render all task states, stylize task tags and make org-mode results look more like original content Closes #42 #41	2022-07-28 09:31:57 -07:00
Debanjum	8a4ae75897	Support Incremental Search in Khoj # Details ## Improve Search API Latency - Improve Search API Latency by ~50-100x to <100ms - Trade-off speed for accuracy in default, fast path of /search API by not re-ranking results using cross-encoder - Make re-ranking of results via cross-encoder configurable via new `?&r=<false\|true>` query param to /search API - Only deep-copy entries, embeddings to apply filters if query has any filter keywords ## Support Incremental Update via Khoj Emacs Frontend - Use default, fast path to query /search API while user is typing - Upgrade to cross-encoder re-ranked results once user goes idle (or ends search normally) Closes #37	2022-07-28 09:10:50 -07:00
Debanjum Singh Solanky	e8029bf415	Extract and Highlight org-mode tags in HTML render of search results	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	c6c248df26	Improve styling of org-mode results to original alignment, line breaks	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	9f59897eeb	Highlight all org-mode task states in HTML. Not just TODO, DONE. - Make logic to extract, mark todo state in org.js more generic - Add default todo state styling to html	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	f040b3f65c	Stylize TODO/DONE states with CSS	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	581b6097c7	Clean Results. Remove TOC, Heading Number and Property Drawers	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	965a93a2f2	Add Basic HTML Rendering of Org-Mode Results	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	1da44d4dfe	Add Incremental Search to Khoj Web Interface	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	af1dd31401	Do not pass verbose argument to image_search.query() as not supported	2022-07-28 19:52:58 +04:00
Debanjum Singh Solanky	80ac10835c	Rerank results on normal minibuffer exit In current state: - Rerank results: - If user idles while entering query OR - exits normally - Do not rerank results: - If user exits abnormally, e.g via C-g from query	2022-07-28 03:37:16 +04:00
Debanjum Singh Solanky	1b759597df	Make incremental search more robust. Follow standard user expectations - Rename functions to more standard, descriptive names - Keep known, required code for incremental search - E.g Do not set buffer local flag in hooks on minibuffer setup - Only query when user in khoj minibuffer - Use active-minibuffer-window and track khoj minibuffer - (minibuffer-prompt) is not useful for our use-case here - (For now) Run re-rank only if user idle while querying - Do not run rerank on teardown/completion - The reranking lag (~2s) is annoying; hit enter, wait to see results - Also triggered when user exits abnormally, so C-g also results in rerank which is even more annoying - Emacs will still hang if re-ranking gets triggered on idle but that's better than always getting triggered. And better than not having mechanism to get results re-ranked via cross-encoder at all	2022-07-28 02:52:27 +04:00
Debanjum Singh Solanky	9a6eee31be	Make number of results to get from Khoj API customizable in khoj.el	2022-07-27 18:55:18 +04:00
Debanjum Singh Solanky	9302b45fe0	Use khoj-incremental as the main khoj func. Rename khoj to khoj-simple - Update khoj-simple to work cross-encoder re-ranked results like before - Increment major version as incremental search considered a breaking change and a major update to search capability	2022-07-27 18:18:17 +04:00
Debanjum Singh Solanky	09727ac3be	Make bi-encoder return fewer results to reduce cross-encoder latency	2022-07-27 07:26:02 +04:00
Debanjum Singh Solanky	9ab3edf6d6	Re-rank incremental search results using cross-encoder if user idle This provides a relatively smooth mechanism - to improve relevance of results on idle - while providing the rapid, incremental results while typing	2022-07-27 07:25:42 +04:00
Debanjum Singh Solanky	ad242cafa7	Support querying all text search types in incremental search - Before incremental search was hard-coded to only query org	2022-07-27 07:25:42 +04:00
Debanjum Singh Solanky	bfcb962cbe	Use post-command-hook to only query on user input - Hooking into after-change-functions results in system logs triggering query	2022-07-27 07:25:42 +04:00
Debanjum Singh Solanky	0d49398954	Reuse code to query api, render results. Formalize method, arg names	2022-07-27 07:25:42 +04:00
Debanjum Singh Solanky	fd1963d781	Implement Basic Incremental Search Interface in Emacs for Org Mode Notes	2022-07-27 03:05:00 +04:00
Debanjum Singh Solanky	3fa7d8f03a	Skeleton to allow incremental search on Khoj via Emacs	2022-07-27 02:48:27 +04:00
Debanjum Singh Solanky	1168244c92	Make cross-encoder re-rank results if query param set on /search API - Improve search speed by ~10x Tested on corpus of 125K lines, 12.5K entries - Allow cross-encoder to re-rank results by settings &?r=true when querying /search API - It's an optional param that default to False - Earlier all results were re-ranked by cross-encoder - Making this configurable allows for much faster results, if desired but for lower accuracy	2022-07-26 22:56:36 +04:00
Debanjum Singh Solanky	b1e64fd4a8	Improve search speed. Only apply filter if filter keywords in query - Formalize filters into class with can_filter() and filter() methods - Use can_filter() method to decide whether to apply filter and create deep copies of entries and embeddings for it - Improve search speed for queries with no filters as deep copying entries, embeddings takes the most time after cross-encodes scoring when calling the /search API Earlier we would create deep copies of entries, embeddings even if the query did not contain any filter keywords	2022-07-26 22:47:26 +04:00
Debanjum Singh Solanky	f094c86204	Trace query response performance and display timings in verbose mode	2022-07-26 21:03:53 +04:00
Debanjum	d8efcd559f	Add Feature Section to Readme - Make Architecture a top-level section - Minor improvement to Configure section	2022-07-25 15:43:27 -07:00
Debanjum Singh Solanky	f953b20415	Add Khoj Architecture Diagram in Docs. Show it in the Project Readme	2022-07-26 02:09:51 +04:00
Debanjum Singh Solanky	674d933282	Improve Khoj Intro text. Move Run Unit Test Section under Developement Heading	2022-07-26 02:06:44 +04:00
Debanjum Singh Solanky	3728583e08	Update Readme. Add section for using Khoj via Web interface	2022-07-22 04:02:03 +04:00
Debanjum Singh Solanky	65fea7681a	Rename notes search type to org search, now that markdown notes supported	2022-07-21 22:09:44 +04:00
Debanjum Singh Solanky	1f4b5ac112	Create test markdown files. Use them in sample config, docker-compose	2022-07-21 22:09:44 +04:00
Debanjum Singh Solanky	4c24202e42	Update documentation. Simplify, reflect current capabilities	2022-07-21 22:09:44 +04:00
Debanjum Singh Solanky	d4d7dbaca6	Support Natural Search on Markdown Files - Reason: Allow natural search on markdown based notes, documentation, websites etc - Details: - Create markdown processor to extract Markdown entries (identified by Heading) into standard jsonl format required by text_search - Update API, Configs to support interfacing with new markdown type - Update Emacs, Web clients to support interfacing with new markdown type via API - Update Readme to mentiond markdown is also supported Closes #35	2022-07-21 22:07:05 +04:00
Debanjum Singh Solanky	0602d018c0	Merge Symmetric, Asymmetric Search Types into a single Text Search Type - The code for both the text search types were mostly the same It was earlier done this way for expedience while experimenting - The minor differences were reconciled and merged into a single text_search type - This simplifies the app and making it easier to process other text types	2022-07-21 21:19:52 +04:00
Debanjum Singh Solanky	0917f1574d	Consolidate jsonl helper methods in a single file under utils module	2022-07-21 03:30:13 +04:00
Debanjum Singh Solanky	de726c4b6c	Minor fixes to unused installer utility script	2022-07-21 03:30:13 +04:00
Debanjum Singh Solanky	5aad297286	Reuse logic to extract entries across symmetric, asymmetric search Now that the logic to compile entries is in the processor layer, the extract_entries method is standard across (text) search_types Extract the load_jsonl method as a utility helper method. Use it in (a)symmetric search types	2022-07-21 02:53:18 +04:00

... 2 3 4 5 6 ...

516 commits