sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-29 02:13:02 +01:00

Author	SHA1	Message	Date
Debanjum Singh Solanky	546fad570d	Use regex to extract include, exclude filter words from query	2022-09-03 23:41:43 +03:00
Debanjum Singh Solanky	ffb8e3988e	Use Python Logging Framework to Time Performance of Explicit Filter	2022-09-03 22:24:10 +03:00
Debanjum Singh Solanky	c7de57b8ea	Pre-compute entry word sets to improve explicit filter query performance	2022-09-03 16:16:31 +03:00
Debanjum Singh Solanky	094bd18e57	Use python standard logging framework for app logs - Stop passing verbose flag around app methods - Minor remap of verbosity levels to match python logging framework levels - verbose = 0 maps to logging.WARN - verbose = 1 maps to logging.INFO - verbose >=2 maps to logging.DEBUG - Minor clean-up of app: unused modules, conversation file opening	2022-09-03 14:43:32 +03:00
Debanjum Singh Solanky	d0531c3064	Update URL QueryParam when Type set in Dropdown on Web Interface - This also pushes the updated URL state to history - Allows jumping back to the web interface after clicking on an image and having the type set to image search - Previously type would get reset to the default search type on jumping back	2022-08-28 12:22:22 +03:00
Debanjum Singh Solanky	2eae32d743	Time, Log Image Search Performance	2022-08-28 00:28:46 +03:00
Debanjum Singh Solanky	c3ca99841b	Scale down images to generate image embeddings faster, with less memory - CLIP doesn't need full size images for generating embeddings with decent search results. The sentence transformers docs use images scaled to 640px width - Benefits - Normalize image sizes - Increase image embeddings generation speed - Decrease memory usage while generating embeddings from images	2022-08-24 14:09:02 +03:00
Debanjum Singh Solanky	ea4fdd9134	Fix logic to ignore notes with no body. Add tests to prevent regression - Notes with empty newlines in body were not being ignored - Add regression tests to avoid above regression in org_to_jsonl conversion	2022-08-21 19:41:40 +03:00
Debanjum	144986ebfd	Fix, Improve Desktop GUI Splash Screen and Main Window - `5e6625a` Fix file browser to not add empty line when no file/dir selected - `8098b8c` Bring main window to Top when open from System Tray - `1c122a8` Place window near top so buttons are not hidden by OS bottom bar - `dfe2546` Set Khoj Icon on Main Desktop Window - `1b1f8f9` Move Splash screen text below icon. Set the text color to black - `450f644` Fix path to remove shared libraries when packaging the Windows app	2022-08-20 23:19:01 +00:00
Debanjum Singh Solanky	5e6625ac68	Fix file browser to not add empty line when no file/dir selected - When no file selected in file browser an empty line/entry gets added to input entries list - Bug got introduced due to insufficient update on change to add instead of insert - Update is_none_or_empty helper method to also check for empty string	2022-08-21 02:03:28 +03:00
Debanjum Singh Solanky	8098b8c3a8	Bring Configure Window to Top when Opened from System Tray - Previously the window could get hidden behind other app windows when user clicked configure from the system tray	2022-08-20 23:38:43 +03:00
Debanjum Singh Solanky	1c122a8a91	Place window near top so buttons are not hidden by OS bottom bar	2022-08-20 22:38:06 +03:00
Debanjum Singh Solanky	dfe2546c04	Set Khoj Icon on Main Desktop Window	2022-08-20 20:36:15 +03:00
Debanjum Singh Solanky	82d2891765	Do not pass ML compute `device' around as argument to search funcs - It is a non-user configurable, app state that is set on app start - Reduce passing unneeded arguments around. Just set device where required by looking for ML compute device in global state	2022-08-20 14:44:53 +03:00
Debanjum Singh Solanky	acc9091260	Use MPS on Apple Mac M1 to GPU accelerate Encode, Query Performance - Note: Support for MPS in Pytorch is currently in v1.13.0 nightly builds - Users will have to wait for PyTorch MPS support to land in stable builds - Until then the code can be tweaked and tested to make use of the GPU acceleration on newer Macs	2022-08-20 14:44:06 +03:00
Debanjum Singh Solanky	7de9c58a1c	Load models, corpus embeddings onto GPU device for text search, if available - Pass device to load models onto from app state. - SentenceTransformer models accept device to load models onto during initialization - Pass device to load corpus embeddings onto from app state	2022-08-20 14:04:18 +03:00
Debanjum Singh Solanky	dc8dcc94a6	Bump Khoj.el package version to 0.1.6	2022-08-19 20:48:42 +03:00
Debanjum Singh Solanky	ffbf15eff8	Add helper function to identify when app running as pyinstaller app Useful for when want the app to behave differently in pyinstaller app scenario with frozen python. And in development scenarios	2022-08-19 19:17:54 +03:00
Debanjum Singh Solanky	6c5c1c33c1	Turn off Tokenizers Parallelism. Khoj doesn't support it right now - Forking and multiprocess are problemantic in frozen python scenarios. This will cause issues when running App packaged by pyinstaller	2022-08-19 19:17:54 +03:00
Debanjum Singh Solanky	d4072974d7	Use of XMP metadata in Khoj Image Search is broken. Disable by default - CLIP Image score and XMP metadata score are not combining well. When combined they give non sensical results. Enable only once figure how best to combine the two. - Show scores with higher precision for image search - Image search scores seem to be mostly be between 0.2 - 0.3 for some reason - Higher precision scores make it easier to understand the quality of returned results perceived by the model itself	2022-08-19 19:17:28 +03:00
Debanjum Singh Solanky	7c4417126c	Append files, directories selected by user to config in Desktop GUI - Allows adding multiple image directories via GUI - Allow adding multiple files in different directories via GUI - Previously users couldn't add multiple directories via GUI They'd have to manually append to input field if multiple files, directories - To clear/overwrite is much easier. The user can just select text to delete in input area	2022-08-19 19:16:10 +03:00
Debanjum Singh Solanky	00ddcfdac8	Use .ico icon when packaging for Windows (and Linux) using Pynstaller	2022-08-19 19:16:10 +03:00
Debanjum Singh Solanky	60dacf3f2c	Show splash screen on app start. Only supported on Windows, Linux	2022-08-19 19:16:10 +03:00
Debanjum Singh Solanky	0079c13bf7	Set input-directories in config for image search type on Desktop GUI - Issue Fix configuring image search from Desktop GUI. It was broken before. The Desktop GUI was updating input-files field under content-type > image. This field is not used for image search. So image search couldn't be configured from the Desktop GUI - Fix - Set input-directories when field of search type image is set from GUI - Otherwise set input-files field in config	2022-08-18 18:29:55 +03:00
Debanjum Singh Solanky	c4fd661909	Move the experimental /chat API to under /beta/chat	2022-08-16 16:36:15 +03:00
Debanjum Singh Solanky	b8913476ba	Fix if condition in router to trigger markdown search	2022-08-16 00:37:16 +03:00
Debanjum Singh Solanky	9bc4fd539e	Set Web Interface URL from loaded state in Desktop GUIs. Not hard-coded	2022-08-16 00:37:16 +03:00
Debanjum Singh Solanky	7f479b0104	Improve Displaying Error to User on Khoj window in Desktop GUI - Show a helpful error message in the GUI to the user, instead of the crashing if loading config fails, for e.g if file wasn't found - Collate GUI errors into an ErrorType enum class - Remove previous error messages before showing the new one	2022-08-16 00:37:16 +03:00
Debanjum Singh Solanky	873bb9dd97	Do not force the Khoj window to always be on top. It's needlessly annoying	2022-08-16 00:37:16 +03:00
Debanjum Singh Solanky	67ab40bb01	Regenerate embeddings everytime user clicks configure in Desktop GUI Previously if the embeddings were already there only the khoj.yml config file would get updated. The embeddings would remain old. 1. This results in a stale app state where the config doesn't match the embeddings 2. Currently the user cannot update their config from the config screen. They'd have to use a combination of config screen and web interface>regenerate button to trigger it or delete their ~/.khoj dir This commit should resolve the above issues	2022-08-16 00:37:16 +03:00
Debanjum Singh Solanky	2647e6bab4	Display re-ranked results triggered via keybinding in khoj.el - Prevent immediate overwrite of re-ranked results by incremental-search without rerank triggered via post-command-hook. - This triggers right after the reranking results are rendered, so user never ends up seeing them	2022-08-15 18:41:12 +03:00
Debanjum Singh Solanky	a91d2df300	Simplify Emacs interface to only rerank results on explicit command	2022-08-15 06:20:13 +03:00
Debanjum Singh Solanky	e846829a2e	Reset Khoj.el version to align with Khoj package version	2022-08-15 06:20:13 +03:00
Debanjum Singh Solanky	fed0b591af	Package Khoj as Debian app in Github Release Workflow	2022-08-14 05:07:58 +03:00
Debanjum Singh Solanky	541e03da3d	Make khoj.el pass checkdoc, package-lint, flycheck checks - Add docstrings, mention args in them. Make docstring crisper - prefix funcs, variables with khoj-- - Require emacs >27.1 for json-parse-buffer - Use lexical binding - Add quickstart docs to elisp file itself - Bump version of khoj.el	2022-08-13 21:37:41 +03:00
Debanjum Singh Solanky	3300378804	Minimal formatting to render beancount results legibly on web interface	2022-08-13 05:03:45 +03:00
Debanjum Singh Solanky	a0759dd923	Convert Configure Screen into the Main Application Window - What - Convert the config screen into the main application window with configuration as just one of the functionality it provides - Rename config screen to main window to match new designation - Why - System Tray isn't available everywhere (e.g Linux) - This requires moving functionality into a normal window for cross-compat	2022-08-13 02:05:52 +03:00
Debanjum Singh Solanky	684f497abe	Handle no System Tray on Linux (Gnome) - What - On Linux - Show Configure Screen, even if not first run experience - Do no show system tray on Linux - Quit app on closing Configure Screen - On Windows, Mac - Show Configure screen only if first run experience - Show system tray always - Do not quit app on closing Configure Screen - Why - Configure screen is the only GUI element on Linux. So closing it should close the application - On Windows, Mac the system tray exists, so app should not be closed on closing configure screen	2022-08-13 01:00:20 +03:00
Debanjum Singh Solanky	c2815c5d09	Open Search from Khoj Configure Screen - Start evolving configure screen away from just being a configure screen - Update Window Title to just say Khoj - Allow Opening Web Interface to Search from Khoj configure screen - Rename "Start" Button to more accurate "Configure" - Disable Search button on first run and while configuring app	2022-08-13 00:43:49 +03:00
Debanjum Singh Solanky	28a91ad1fd	Deep copy the default_config constant to prevent it being overwritten - Issue - In the previous form, updates to self.current_config would update default_config as python does a shallow copy - So self.current_config is just referencing the values of default_config - Hence updates to current_config updates the default_config values too - This is not what we want - Fix - Deep copy the default_config values. Now updates to self.current_config wouldn't affect the default_config	2022-08-12 23:54:16 +03:00
Debanjum Singh Solanky	62ac41ce3b	Reload settings in a separate thread to not freeze Config Screen - Generating embeddings takes time - If user enables a content type and clicks start. The app starts to generate embeddings when loading the new settings - Run this function in a separate thread to keep config screen responsive - But disable start button to prevent re-entrant threads - Also show a minimal visual indication that the app is saving state	2022-08-12 23:34:00 +03:00
Debanjum Singh Solanky	927547d0af	Update Title of Configure Screen to follow "<Screen> - App" pattern	2022-08-12 22:53:10 +03:00
Debanjum Singh Solanky	32ac1ea1b6	Allow user to quit application from the terminal via SIGINT Call python interpreter at regular interval to handle any interrupt signals. create custom handler to terminate server and application	2022-08-12 21:11:58 +03:00
Debanjum Singh Solanky	43301d488a	Increase Width of Configure Screen	2022-08-12 18:34:47 +03:00
Debanjum Singh Solanky	9baea9c9fd	Let Input Fields Wrap. Adjust Height based on Text in Field - Convert Input Fields into PlainTextEdit - Display Each Selected File on a Separate Line in Field - Set Height of FileBrowser Input Field based on Number of Lines/Files	2022-08-12 18:33:56 +03:00
Debanjum Singh Solanky	b7b96110e9	Rename FileBrowser Button Text to "Select" instead of "Add"	2022-08-12 17:08:40 +03:00
Debanjum Singh Solanky	a1c58a9470	Create, Use a Labelled Text Field for the Conversation Input Field - This fixes the field expanding when configure screen is expanded - Allows for reusability of the labelled text field - Simplifies the logic to save settings for conversation processor	2022-08-12 16:59:15 +03:00
Debanjum Singh Solanky	fa7e36cada	Rename external .js files to .min.js to mark them as vendored - Excludes from Github language stats. See linguists/vendor.yml for exclusion rules - Signifies them as external for Khoj developers too	2022-08-12 04:08:50 +03:00
Debanjum Singh Solanky	110e3df0b7	Set default config in the constant module. Use from there to configure app - Avoid having to pass the khoj_sample.yml data file into pip, native apps - Packaging data files into python packages is annoying. - There's `MANIFEST.in`, `data_files` and `package_data` in setup.py - Bdist, wheel, generated source tarball use different set of these fields and put the data files in different locations - Rather just code the default config into a constant. Avoid pointless file reads as well this way	2022-08-12 02:18:46 +03:00
Debanjum Singh Solanky	fad2f3a2e7	Resolve config_file to absolute right at start on parsing args in cli - Assume path is absolute in yaml util module while saving, loading file - This follows same convention as jsonl. Which just operates on passed file path, assuming it is of appropriate form. Responsibility to put it in appropriate form is on the caller, for now	2022-08-12 01:34:08 +03:00
Debanjum Singh Solanky	44fe70513a	Handle situation where default config directory or file does not exist - Include khoj_sample.yml in pip package to load default config from - Create khoj config directory if it doesn't exist - Load config from khoj_sample.yml if khoj.yml config doesn't exist	2022-08-12 01:17:34 +03:00
Debanjum Singh Solanky	41520e1608	Improve Docstring for Configure Screen and System Tray class, funcs	2022-08-11 23:36:02 +03:00
Debanjum Singh Solanky	a748acfeeb	Merge branch 'master' of github.com:debanjum/khoj into create-native-gui Conflicts: - src/main.py - router functions have moved to router - move logic to handle null query perf timer variables into router.py - set main.py to current branch, not master	2022-08-11 21:09:42 +03:00
Debanjum Singh Solanky	6af2d6bb6d	Add Flag to Start App without Native GUI	2022-08-11 20:59:57 +03:00
Debanjum Singh Solanky	b74ca1def6	Wrap error message instead of expanding screen to show message	2022-08-11 20:51:56 +03:00
Debanjum Singh Solanky	2646fa825b	Get Files from File input line to match user expectation - If a user manually edits the input file lines, clicking start should use that. Currently it just looks at the files selected last via file browser - We want to allow users to manually enter file paths in field. Which is why the field hasn't been set to read-only	2022-08-11 20:48:45 +03:00
Debanjum Singh Solanky	dad9133598	Split save_settings method into smaller methods for modularization	2022-08-11 20:00:52 +03:00
Debanjum Singh Solanky	56ba91fec8	Remove unused methods in file browser widget. Improve name of existing	2022-08-11 19:46:09 +03:00
Debanjum Singh Solanky	fd4e41495c	Use appropriate label for directory input types to minimize confusion	2022-08-11 19:45:19 +03:00
Debanjum Singh Solanky	c1e1466fb1	Validate new config before write. Show error if new config invalid	2022-08-11 19:18:22 +03:00
Debanjum Singh Solanky	1ff049599f	Show current config on config screen. Load default config if config unset - Track current (saved/loaded) config separate from the new config (to be written) when user clicks Start - Fallback to using default config when no config for the specific content type or processor is specified in khoj.yml - Earlier were only loading default config on first run, not after - Create Child CheckBox, LineEdit classes for Processor Widgets - Create ProcessorType, similar to SearchType - Track ProcessorType the widgets are associated with - Simplify update, save, load of config based on type	2022-08-11 19:11:25 +03:00
Debanjum Singh Solanky	23e06f483d	Do not emit type tags when dumping config YAML to file	2022-08-11 19:08:36 +03:00
Debanjum Singh Solanky	678fb6a3c7	Add Settings Panel for Conversation Settings to Config Screen	2022-08-11 04:52:40 +03:00
Debanjum Singh Solanky	c1fcf44405	Initialize Settings on Config Screen with Existing Settings from File	2022-08-11 04:51:33 +03:00
Debanjum Singh Solanky	3cec6229ad	Hot swap backend config via config screen start button click - Update configuration to use by the backend, while app is running - Trigger after user hits start button with their config. The config gets written to khoj.yml file first, then the updated config is loaded onto memory	2022-08-11 00:32:11 +03:00
Debanjum Singh Solanky	f7fdf8d8ce	Refactor app start to start server even if backend not configured - Decouple configuring backend from starting server. Backend search and processors can be configured after the backend server has started - Set global state in main instead of in configure_server method. This allows the app to start even if configure_server exits early in the first run scenario, where no config available to configure server - Now start server, even if no config, before GUI started in main - This refactor of app startup flow will allow users to configure backend using the configure screen after server start	2022-08-11 00:13:14 +03:00
Debanjum Singh Solanky	34018c7d4b	Store args passed from commandline at app start in global app state	2022-08-11 00:11:35 +03:00
Debanjum Singh Solanky	cc6ef0f450	Save configure screen settings to app config yaml on clicking Start	2022-08-10 23:10:39 +03:00
Debanjum Singh Solanky	dae65c5b6b	Create child class of Qt CheckBox to track search type it enables/disables	2022-08-10 22:44:37 +03:00
Debanjum Singh Solanky	f42f54019b	Type parent_layout passed as arguments to ConfigureScreen methods	2022-08-10 22:43:20 +03:00
Debanjum Singh Solanky	f63f11186f	Pass config file for app to configure screen	2022-08-10 22:42:32 +03:00
Debanjum Singh Solanky	82a7059b6a	Only setup conversation processor if it has configuration set	2022-08-10 22:34:03 +03:00
Debanjum Singh Solanky	9628ca073c	Extract conversation processor from config into separate function - Only pass processor config arg required by configure_processor. Not the unused full config object - Type arguments passed to methods configure processors - Import json for use by conversation processor to load logs	2022-08-10 22:33:33 +03:00
Debanjum Singh Solanky	62eb66b8ca	Rename load_config_from_file to more descriptive parse_config_from_file	2022-08-10 22:28:51 +03:00
Debanjum Singh Solanky	328cc00439	Create global constant to store app root directory	2022-08-10 20:09:03 +03:00
Debanjum Singh Solanky	d2c7b28172	Extract code to load config from YAML file into new utils.yaml module	2022-08-10 20:07:44 +03:00
Debanjum Singh Solanky	150ae19660	Indent Timestamps, Drawers at Body Level in OrgNode Entry Representation	2022-08-10 18:55:37 +03:00
Debanjum Singh Solanky	fd31d339c1	Remove spurious space in Entries without Todo in OrgNode Entry Repr	2022-08-10 13:48:44 +03:00
Debanjum Singh Solanky	eddf88f818	Org buffer customization settings to tail of khoj.el results buffer - Results get priority screen real estate - Allows quick speed key based traversal of results as cursor on switching to buffer is at top level heading - E.g C-x o n n o 2 jumps to entry in actual file of second result - Unlike before when it is at the #+STARTUP org buffer customization settings	2022-08-10 12:57:37 +03:00
Debanjum Singh Solanky	daef276fd1	Add files for each search type. Extract config on clicking start - Only allow adding files with appropriate file extension for each search type - e.g .org for org-mode search, directory for image search - Extract file paths added to config and enablement state of each search type - This extracted state will be used to populate the khoj.yml config file	2022-08-10 03:27:22 +03:00
Debanjum Singh Solanky	d74134e6cc	Reuse Single Method to Create Setting Panels for each Search Type	2022-08-09 23:50:43 +03:00
Debanjum Singh Solanky	509d52e2cd	Toggle Editability instead of Visibility of Per Search Type Settings - Simplifies the configure screen layout and allows it to be of constant width - It was buggy, the configure screen would dynamically expand but not restore back to original size on disabling search type after enable	2022-08-09 23:34:54 +03:00
Debanjum Singh Solanky	3c788f1d29	Rename configure window to more generic configure screen	2022-08-09 22:44:05 +03:00
Debanjum Singh Solanky	c50ab7c3ad	Split config settings GUI into functions. Convert Config Window to Dialog	2022-08-09 22:36:41 +03:00
Debanjum Singh Solanky	664713b24e	Extract Qt GUI code from main.py into separate interface/desktop dir	2022-08-09 22:12:29 +03:00
Debanjum Singh Solanky	84c1fc701d	Fix query timing variables from being referenced before assignment	2022-08-09 21:06:37 +03:00
Debanjum Singh Solanky	57026b802c	Set size of rendered images using user customizable vars	2022-08-09 21:06:37 +03:00
Debanjum Singh Solanky	0a758c9f0f	By default, wait for 2 seconds before initiating rerank in khoj.el - Subjectively, previous default seems to aggressive based on usage Doesn't give time for user to think and type their query	2022-08-09 21:06:30 +03:00
Debanjum Singh Solanky	f01fb16ebb	Use single hyphen in name of user configurable variables in khoj.el - Follow convention, two hyphens indicate variable private to library - Defcustom are user configurable variables. So they should have single - - Use khoj-results-count variable directly in code	2022-08-09 20:49:34 +03:00
Debanjum Singh Solanky	cd59982c9c	Add Qt Button to save Khoj configuration in Khoj Configuration Window	2022-08-09 20:42:44 +03:00
Debanjum Singh Solanky	2c77caf06c	Group ledger, org setting widgets into child Qt widgets of config window	2022-08-09 20:42:44 +03:00
Debanjum Singh Solanky	027da719aa	Open Configure Window on First Run or from System Tray - Trigger FRE if no config loaded. Open Configure Window automatically - Else user can manually open config window from App on System Tray	2022-08-09 17:05:27 +03:00
Debanjum Singh Solanky	a588a8e21f	Make config_file an optional argument. It can be generated on FRE - Make config_file an optional arg. It defaults to default khoj config dir - Return args.config as None if no config_file explicitly passed by user - Parent can use args.config = None as signal to trigger first run experience	2022-08-09 17:02:02 +03:00
Debanjum Singh Solanky	21af122447	Clean up unused methods, module imports. Add comments	2022-08-09 16:59:38 +03:00
Debanjum Singh Solanky	80fa9fde6a	Quit GUI via SysTray instead of sys.exit to cleanly terminate server	2022-08-08 23:49:26 +03:00
Debanjum Singh Solanky	e5691f9d1d	PyInstaller Spec to Wrap Khoj into a Basic Native App - Verified functionality on MacOS - Add ICNS Icon to use as MacOS App Icon - Spec generated by PyInstaller: ```sh pyinstaller \ src/main.py \ --windowed \ --onefile \ --name "Khoj" \ --target-arch arm64 \ -i src/interface/web/assets/icons/favicon.icns \ --add-data "src/interface/web:src/interface/web" \ --copy-metadata tqdm \ --copy-metadata regex \ --copy-metadata requests \ --copy-metadata packaging \ --copy-metadata filelock \ --copy-metadata numpy \ --copy-metadata tokenizers ```	2022-08-08 23:23:02 +03:00
Debanjum Singh Solanky	ef009323e7	Use sys.exit to quit via system tray. Fix pip install cmd in Readme	2022-08-08 21:42:36 +03:00
Debanjum Singh Solanky	eacd95bebd	Start Creating Native Configure Page using PyQt	2022-08-08 18:31:47 +03:00
Debanjum Singh Solanky	dddc57e132	Rename get-enabled-search-types to get-enabled-content-types as more appropriate	2022-08-07 18:53:14 +03:00
Debanjum Singh Solanky	127c6e78df	Only show keybindings for enabled search types in simple info menu too Convert the khoj--keybindings-info-message into a func Dynamically generate info menu Show keybindings for enabled search types only	2022-08-07 18:40:35 +03:00
Debanjum Singh Solanky	d08c25b62b	Make default search type used in the Emacs interface configurable	2022-08-07 18:24:53 +03:00
Debanjum Singh Solanky	5a10c47499	Allow setting music as search type in khoj.el. Had forgotten to include it earlier	2022-08-07 18:24:53 +03:00
Debanjum Singh Solanky	ebee716026	Only show keybindings reference for enabled search types in khoj.el	2022-08-07 18:24:53 +03:00
Debanjum Singh Solanky	6dc9801f45	Get Khoj search-types enabled by user in Emacs	2022-08-07 18:24:53 +03:00
Debanjum Singh Solanky	f3c1512c38	Fix to let user to start enter query right after initiating khoj on emacs - Fix regression since moving to use `which-key-show-full-keymap~ - The above function reads user keypress, so eats up 1 keypress before starting to enter query - No way to pass no-paging config via the external function to the internally used which-key--show-keymap function that does allow setting no-paging to not read user keypress - So use the internal function instead and set no-paging arg to t	2022-08-07 15:57:08 +03:00
Debanjum Singh Solanky	e95686c89c	Show complete Khoj keybindings when initiate search in Emacs - The keybindings to select search types was previously confusing as it only highlighted the final symbol to press (the C-x was shown but it wasn't made apparent that it had to be pressed before) - Previously some keybindings unrelated to khoj were also being shown in the which-key popup. Now only the khoj keybindings are visible	2022-08-06 16:36:57 +03:00
Debanjum Singh Solanky	4696eadc02	Fix definition of khoj--search-<content-type> functions in khoj.el	2022-08-06 15:19:01 +03:00
Debanjum Singh Solanky	c5bf051a29	Rename initialize_{search,processor,server} to configure_{search,procesor,server} - Search is being reconfigured multiple times in /regenerate and n/reload. More appropriate name is configure_ rather than initialize_ for it - Standardize name of methods under configure.py	2022-08-06 03:23:02 +03:00
Debanjum Singh Solanky	7b04978f52	Put global state variables into separate state module - Variables storing app, device state aren't constants. Do not mix with actual constants like empty_escape_sequence, web_directory	2022-08-06 03:13:18 +03:00
Debanjum Singh Solanky	b04c84721b	Extract configure and routers from main.py into separate modules - Main.py was becoming too big to manage. It had both controllers/routers and component configurations (search, processors) in it - Now that the native app GUI code is also getting added to the main path, good time to split/modularize/clean main.py - Put global state into a separate file to share across modules	2022-08-06 02:39:18 +03:00
Debanjum Singh Solanky	083fefdd07	Create Native Menu Bar with PyQt to open Search, Config webpages - Run FastAPI server in a separate thread. - This allows starting both the server and gui in parallel - Create System Tray for Khoj - Contains menu items that open search or config pages in browser - Rearrange code to have only the code required to start Backend and GUI in the run() method - Move the backend setup code into a separate method	2022-08-06 01:00:25 +03:00
Debanjum Singh Solanky	9fa3345000	Show available Khoj keybindings to customize search using which-key Fallback to showing simple khoj keybindings info message in echo area when which-key not available	2022-08-05 20:24:29 +03:00
Debanjum Singh Solanky	6a8b2a6936	Do not run incremental search when query is empty	2022-08-05 19:35:42 +03:00
Debanjum Singh Solanky	609cd6e8bb	Show keybindings to set khoj search type in echo area to assist user	2022-08-05 19:35:42 +03:00
Debanjum Singh Solanky	48e4a983c5	Allow switching search type in the middle of querying Khoj on Emacs - More generally, this allows configuring the khoj search anytime while in khoj minibuffer window - Earlier could only configure search type at the start of the search	2022-08-05 19:35:42 +03:00
Debanjum Singh Solanky	48c33b93cc	Generalize khoj keymap to func that can update existing keybdings	2022-08-05 19:35:42 +03:00
Debanjum Singh Solanky	19c4701f3f	Default to ledger search from files with .beancount extensions	2022-08-05 19:35:42 +03:00
Debanjum Singh Solanky	cc9a395e0a	Keep name of buffer for Khoj results in a variable	2022-08-05 19:35:42 +03:00
Debanjum Singh Solanky	0a5c6d067a	Do not prompt user to set search type before querying Khoj via Emacs - What - Default to last used search type, when no search type specified - Allow user to change search type before they enter query (and after they've called khoj), if they want - Why - Reduce time from intent to results by using reasonable defaults - Make interactions smoother, more intuitive	2022-08-05 19:35:38 +03:00
Debanjum Singh Solanky	24ccba74d4	Put type dropdown, regenerate button on same row. Regain screen space	2022-08-05 06:17:43 +03:00
Debanjum Singh Solanky	017e287b8a	Remove redundant query as title in results section - Regain screen real-estate - Remove unused parameters, html being returned by org.js	2022-08-05 06:17:25 +03:00
Debanjum Singh Solanky	06afeec7e2	Hide stars of org entry results on Emacs to reduce visual clutter They've all been normlized to the same level and hence don't hold much data. So good opportunity to reduce, non-useful visual clutter	2022-08-05 05:27:57 +03:00
Saba	d1fe6353b5	Check whether processor_config exists during shutdown event	2022-08-04 21:57:36 -04:00
Debanjum Singh Solanky	4d4d2ff921	Ensure all org entries are unfolded in results buffer on Emacs	2022-08-05 04:54:29 +03:00
Debanjum Singh Solanky	49ef741d4b	Prevent Zoom on Input in Web Interface. Document Pip upgrade in Readme - Name /Reload API Controller Reload	2022-08-05 03:51:34 +03:00
Debanjum Singh Solanky	675e821d95	Make embeddings, jsonl paths absolute. Create directories if non-existent	2022-08-05 02:57:59 +03:00
Debanjum Singh Solanky	d5b43eb836	Use input filter in image search setup. Input filter wasn't used earlier	2022-08-05 02:40:03 +03:00
Debanjum Singh Solanky	ca5a8bd113	Make config file a positional argument, as it is required - Test invalid config file path throws. Remove redundant cli test - Simplify cli parser code - Do not need to explicitly check if args.config_file set. argparser checks for positional arguments automatically - Use standard semantics for cli args - All positional args are required. Non positional args are optional - Improve command line --help description	2022-08-05 01:09:40 +03:00
Debanjum Singh Solanky	1374065092	Mark all required fields for config. Throw if no input_* field specified - Add custom validator to throw if neither input_filter or input_<files\|directories> are specified - Set field expecting paths to type Path - Now that default_config isn't used in code. We can update fields in rawconfig to specify whether they're required or not. This lets pydantic validate config file and throw appropriate error	2022-08-05 01:08:48 +03:00
Debanjum Singh Solanky	f78d6ae754	Create khoj_sample file with all configurable fields in one place - Reason - Simplifies code. No merge_dict required - 1 place for user to see all configurables, defaults and required values - Details - Remove default_config from code. Set defaults in khoj_sample.yml itself - Keep fields required to be set by user as empty in khoj_sample to YAML - Set defaults for fields not requiring configuration by user	2022-08-05 01:08:33 +03:00
Debanjum Singh Solanky	3abf3e5ee0	Update merge_dicts to recursively merge the dictionaries Previously it was only merging dictionary at the first/top level	2022-08-04 22:46:20 +03:00
Debanjum Singh Solanky	61c26ba611	Only show large Khoj favicon on web interface - Do not want browsers to use the small, grainy favicons - Firefox for Android does use the bigger icon, when it's the only one available - Update svg to match the 144x144 ratio just for consistency	2022-08-04 14:33:29 +03:00
Debanjum Singh Solanky	1649fa644c	Autofocus on Query field in Web Interface. Improve time to query	2022-08-04 05:23:19 +03:00
Debanjum Singh Solanky	71fcb1087f	Add icons for web interface to render on more browsers and as PWA Safari, Firefox for Android etc don't support SVG Favicons yet	2022-08-03 18:52:41 +03:00
Debanjum Singh Solanky	5b6b7ec123	Delete khoj network connections on incremental search teardown on Emacs interface Currently only get into this state when debug breakpoints on backend are keeping the connection open and user exits khoj search from Emacs Results in a number of open connections that slow khoj down.	2022-08-03 18:52:41 +03:00
Debanjum Singh Solanky	555c1088cc	Cache queries in /search controller using LRU cache - Most concretely right now, it eliminates the re-rank latency hit on re-rank triggered on user hitting enter after re-rank is already done on user idle in the emacs interface - Improves search latency of (incremental) search	2022-08-03 18:52:41 +03:00
Debanjum Singh Solanky	38df727ef4	Fix escape sequence usage in strings. Remove unneeded import of os Rename /config API method to config to match it's purpose. UI is anyway too generic, and not what it is doing	2022-08-03 18:51:55 +03:00
Debanjum Singh Solanky	f642450ed9	Disable Incremental Search for Images on Web Bug introduced in commit `da118b3fed`	2022-08-03 11:52:51 +03:00
Debanjum Singh Solanky	b9e6273644	Include interfaces in pip package. Fix paths to web interface in app	2022-08-03 00:02:39 +03:00
Debanjum Singh Solanky	1b55462fb0	Convert search_filter, conversation dir to proper modules Add __init__.py files to their directories	2022-08-02 20:23:42 +03:00
Debanjum Singh Solanky	5108d45951	Wrap application startup steps into a method	2022-08-02 20:13:14 +03:00
Debanjum Singh Solanky	0ebfbb43ce	Nest org, md results at level 2 on Emacs interface. Improve readability - Makes it easier to fold/unfold, traverse and read results - This 2 level nesting is already being used on the web interface - Previously we were using the original nesting depth of the entry. This was aimed at providing more of the orginal context of the results. But currently this additional information does not provide as much, for the decreased legibility of the results	2022-08-01 04:01:18 +03:00
Debanjum Singh Solanky	1201bfddf3	Simplify name of config css from config-style.css to config.css	2022-08-01 01:34:00 +03:00
Debanjum Singh Solanky	075dba5d64	Use Khoj Title, Favicon in Config Page for Consistency	2022-08-01 01:27:14 +03:00
Debanjum Singh Solanky	56a4429f01	Move web interface to configure application into src/interface/web directory - Improve code layout by ensuring all web interface specific code under the src/interface/web directory - Rename config API to more specifi /config instead of /ui - Rename config data GET, POST api to /config/data instead of /config	2022-08-01 00:53:42 +03:00
Debanjum	bb2ccec1ca	Populate type dropdown on the web interface with only enabled search types - Previously we were statically populating types dropdown field in the web interface with all available search types - This change populates the type dropdown field with only search types that are enabled/configured - It queries the `/config` backend API to see which of the available search types are configured	2022-08-01 00:20:45 +03:00
Debanjum Singh Solanky	8b6058c879	Fix instantiating type field with value from URL query parameter - Populate via `.then` after enabled search types in dropdown are populated - Call to `/config` API is async and will usually complete after the value of type field is set from url - So value of type field would earlier be overridden when search types dropdown is populated after the call to `/config` API completes	2022-08-01 00:04:50 +03:00
Debanjum Singh Solanky	be253bab39	Populate type dropdown with only enabled search types in web interface - Get /config API and check config for which available search types is populated. This gives us the list of enabled search types - Dynamically populate search type field with enabled search types only	2022-07-31 23:42:00 +03:00
Debanjum Singh Solanky	0abd40aeb7	Only set query field when appropriate query param passed via URL - Setting query value to default option when query param wasn't passed via URL was overriding placeholder text in query field - We wanted placeholder text in field, not the query field to actually be populated by placeholder text - This clears field when user starts typing query into the query field, instead of them having to manually delete the default text populated	2022-07-31 22:29:23 +03:00
Debanjum Singh Solanky	17c38b526a	Default config for each search types to None - Setting up default compressed-jsonl, embeddings-file was only required for org search_type, while org-files and org-filter were allowed to be passed as command line argument - This avoided having to set compressed-jsonl and embeddings-file via command line argument as well for org search type - Now that all search types are only configurable via config file, We can default all search types to None. The default config for the rest of the search types wasn't being used anyway	2022-07-31 22:23:57 +03:00
Debanjum Singh Solanky	b83021a723	Improve code readability of merge_dicts helper method	2022-07-31 22:07:56 +03:00
Debanjum Singh Solanky	38aede68f2	Only configure org via config file for consistency across search types - Previously org-files were configurable via cmdline args. Where as none of the other search types are - This is an artifact of how the application grew - It can be removed for better consistency and equal preference given all search types	2022-07-31 22:02:03 +03:00
Saba	b55159f5bd	Fix URL for khoj.el quelpa setup instructions	2022-07-29 23:01:04 -04:00
Debanjum Singh Solanky	da118b3fed	Simplify incremental search function used in web interface Re-rank isn't passed to image search API in search function. So don't need to check type in incremental_search function too	2022-07-29 23:18:01 +04:00
Debanjum Singh Solanky	3079614981	Allow set up of search form via query params in web interface - Default search type to org, instead of images	2022-07-29 23:13:26 +04:00
Debanjum Singh Solanky	02ca2c05a1	Add Eagle Icon for Khoj to Web, Emacs Interfaces and Readme	2022-07-29 17:50:29 +04:00
Debanjum Singh Solanky	78314263a0	Add Table of Contents, Features, Performance Details to Readme	2022-07-29 17:08:17 +04:00
Debanjum Singh Solanky	ed181f47c9	Prettify rendering of org music results on Khoj web interface	2022-07-29 04:28:22 +04:00
Debanjum Singh Solanky	7e5291a38e	Make org result headings at same level. Improve spacing of results Having org-mode result headings change size based on their depth in the source document makes is a confusing UI experience. Improve font-size, line-spacing and margins of results to make delineation between entries, and differntiating between entry heading and it's body easier to visually infer. Do not white-space: pre-line. Improves rendering of Markdown results	2022-07-29 01:55:46 +04:00
Debanjum Singh Solanky	4d5183063c	Create images directory if doesn't exist, to store image search results	2022-07-28 21:30:31 +04:00
Debanjum Singh Solanky	a9bc17a6b0	Prettify Render of Markdown Results in Web Interface	2022-07-28 20:56:37 +04:00
Debanjum Singh Solanky	a6ae74f52e	Move JS files like org.js into a separate assets/ directory	2022-07-28 20:46:48 +04:00
Debanjum Singh Solanky	a12eaa4ce0	Move Khoj image results into a child images/ directory	2022-07-28 20:45:12 +04:00
Debanjum	a71253e137	Support Incremental Search on Web Interface ## Support Incremental Search on Khoj Web Interface - Use default, fast path to query /search API while user is typing - Upgrade to cross-encoder re-ranked results once user hits enter on search box ## Improve Render of Org Results on Web Interface - We were previously just wrapping results from /search API into a pre formatted div field. This was not easy to read - Use [org.js](https://mooz.github.io/org-js/) to render results from Khoj `/search` API as proper HTML - Improve org.js to render all task states, stylize task tags and make org-mode results look more like original content Closes #42 #41	2022-07-28 09:31:57 -07:00
Debanjum Singh Solanky	e8029bf415	Extract and Highlight org-mode tags in HTML render of search results	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	c6c248df26	Improve styling of org-mode results to original alignment, line breaks	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	9f59897eeb	Highlight all org-mode task states in HTML. Not just TODO, DONE. - Make logic to extract, mark todo state in org.js more generic - Add default todo state styling to html	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	f040b3f65c	Stylize TODO/DONE states with CSS	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	581b6097c7	Clean Results. Remove TOC, Heading Number and Property Drawers	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	965a93a2f2	Add Basic HTML Rendering of Org-Mode Results	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	1da44d4dfe	Add Incremental Search to Khoj Web Interface	2022-07-28 19:55:15 +04:00
Debanjum Singh Solanky	af1dd31401	Do not pass verbose argument to image_search.query() as not supported	2022-07-28 19:52:58 +04:00
Debanjum Singh Solanky	80ac10835c	Rerank results on normal minibuffer exit In current state: - Rerank results: - If user idles while entering query OR - exits normally - Do not rerank results: - If user exits abnormally, e.g via C-g from query	2022-07-28 03:37:16 +04:00
Debanjum Singh Solanky	1b759597df	Make incremental search more robust. Follow standard user expectations - Rename functions to more standard, descriptive names - Keep known, required code for incremental search - E.g Do not set buffer local flag in hooks on minibuffer setup - Only query when user in khoj minibuffer - Use active-minibuffer-window and track khoj minibuffer - (minibuffer-prompt) is not useful for our use-case here - (For now) Run re-rank only if user idle while querying - Do not run rerank on teardown/completion - The reranking lag (~2s) is annoying; hit enter, wait to see results - Also triggered when user exits abnormally, so C-g also results in rerank which is even more annoying - Emacs will still hang if re-ranking gets triggered on idle but that's better than always getting triggered. And better than not having mechanism to get results re-ranked via cross-encoder at all	2022-07-28 02:52:27 +04:00
Debanjum Singh Solanky	9a6eee31be	Make number of results to get from Khoj API customizable in khoj.el	2022-07-27 18:55:18 +04:00
Debanjum Singh Solanky	9302b45fe0	Use khoj-incremental as the main khoj func. Rename khoj to khoj-simple - Update khoj-simple to work cross-encoder re-ranked results like before - Increment major version as incremental search considered a breaking change and a major update to search capability	2022-07-27 18:18:17 +04:00
Debanjum Singh Solanky	09727ac3be	Make bi-encoder return fewer results to reduce cross-encoder latency	2022-07-27 07:26:02 +04:00
Debanjum Singh Solanky	9ab3edf6d6	Re-rank incremental search results using cross-encoder if user idle This provides a relatively smooth mechanism - to improve relevance of results on idle - while providing the rapid, incremental results while typing	2022-07-27 07:25:42 +04:00
Debanjum Singh Solanky	ad242cafa7	Support querying all text search types in incremental search - Before incremental search was hard-coded to only query org	2022-07-27 07:25:42 +04:00
Debanjum Singh Solanky	bfcb962cbe	Use post-command-hook to only query on user input - Hooking into after-change-functions results in system logs triggering query	2022-07-27 07:25:42 +04:00
Debanjum Singh Solanky	0d49398954	Reuse code to query api, render results. Formalize method, arg names	2022-07-27 07:25:42 +04:00
Debanjum Singh Solanky	fd1963d781	Implement Basic Incremental Search Interface in Emacs for Org Mode Notes	2022-07-27 03:05:00 +04:00
Debanjum Singh Solanky	3fa7d8f03a	Skeleton to allow incremental search on Khoj via Emacs	2022-07-27 02:48:27 +04:00
Debanjum Singh Solanky	1168244c92	Make cross-encoder re-rank results if query param set on /search API - Improve search speed by ~10x Tested on corpus of 125K lines, 12.5K entries - Allow cross-encoder to re-rank results by settings &?r=true when querying /search API - It's an optional param that default to False - Earlier all results were re-ranked by cross-encoder - Making this configurable allows for much faster results, if desired but for lower accuracy	2022-07-26 22:56:36 +04:00
Debanjum Singh Solanky	b1e64fd4a8	Improve search speed. Only apply filter if filter keywords in query - Formalize filters into class with can_filter() and filter() methods - Use can_filter() method to decide whether to apply filter and create deep copies of entries and embeddings for it - Improve search speed for queries with no filters as deep copying entries, embeddings takes the most time after cross-encodes scoring when calling the /search API Earlier we would create deep copies of entries, embeddings even if the query did not contain any filter keywords	2022-07-26 22:47:26 +04:00
Debanjum Singh Solanky	f094c86204	Trace query response performance and display timings in verbose mode	2022-07-26 21:03:53 +04:00
Debanjum Singh Solanky	65fea7681a	Rename notes search type to org search, now that markdown notes supported	2022-07-21 22:09:44 +04:00
Debanjum Singh Solanky	4c24202e42	Update documentation. Simplify, reflect current capabilities	2022-07-21 22:09:44 +04:00
Debanjum Singh Solanky	d4d7dbaca6	Support Natural Search on Markdown Files - Reason: Allow natural search on markdown based notes, documentation, websites etc - Details: - Create markdown processor to extract Markdown entries (identified by Heading) into standard jsonl format required by text_search - Update API, Configs to support interfacing with new markdown type - Update Emacs, Web clients to support interfacing with new markdown type via API - Update Readme to mentiond markdown is also supported Closes #35	2022-07-21 22:07:05 +04:00
Debanjum Singh Solanky	0602d018c0	Merge Symmetric, Asymmetric Search Types into a single Text Search Type - The code for both the text search types were mostly the same It was earlier done this way for expedience while experimenting - The minor differences were reconciled and merged into a single text_search type - This simplifies the app and making it easier to process other text types	2022-07-21 21:19:52 +04:00
Debanjum Singh Solanky	0917f1574d	Consolidate jsonl helper methods in a single file under utils module	2022-07-21 03:30:13 +04:00
Debanjum Singh Solanky	de726c4b6c	Minor fixes to unused installer utility script	2022-07-21 03:30:13 +04:00
Debanjum Singh Solanky	5aad297286	Reuse logic to extract entries across symmetric, asymmetric search Now that the logic to compile entries is in the processor layer, the extract_entries method is standard across (text) search_types Extract the load_jsonl method as a utility helper method. Use it in (a)symmetric search types	2022-07-21 02:53:18 +04:00
Debanjum Singh Solanky	e220ecc00b	Generate compiled form of each transaction directly in the beancount processor - The logic for compiling a beancount entry (for later encoding) now completely resides in the org-to-jsonl processor layer - This allows symmetric search to be generic and not be aware of beancount specific properties that were extracted by the beancount-to-jsonl processor layer - Now symmetric search just expects the jsonl to (at least) have the 'compiled' and 'raw' keys for each entry. What original text the entry was compiled from is irrelevant to it. The original text could be location, transaction, chat etc, it doesn't have to care	2022-07-21 02:43:28 +04:00
Debanjum Singh Solanky	06cf425314	Generate compiled form of each entry directly in the org-mode processor - The logic for compiling an org-mode entry (for later encoding) now completely resides in the org-to-jsonl processor layer - This allows asymmetric search to be generic and not be aware of org-mode specific properties that were extracted by the org-to-jsonl processor layer - Now asymmetric search just expects the jsonl to (at least) have the 'compiled' and 'raw' keys for each entry. What original text the entry was compiled from is irrelevant to it. The original text could be mail, chat, markdown, org-mode etc, it doesn't have to care	2022-07-21 02:08:02 +04:00
Debanjum Singh Solanky	4ead79d272	Make Notes Search Natural Language Date Aware - Pass Scheduled, Closed Dates of Entries to Include in Embeddings - The (new?) model seems to understand dates. So can give more relevant entries if date in natural language mentioned in query - E.g "Went Surfing with Friends" vs "Went Surfing with Friends in 1984" will give different results, with the second prioritizing entries mentioning any entries with closed, scheduled dates from 1984	2022-07-21 01:06:49 +04:00
Debanjum Singh Solanky	d50bfb5188	Parse Logbook Entries in the OrgNode parser for Org-Mode. Update tests	2022-07-21 00:15:30 +04:00
Debanjum Singh Solanky	70e70d4b15	Rename 'embed' key to more generic 'compiled' for jsonl extracted results - While it's true those strings are going to be used to generated embeddings, the more generic term allows them to be used elsewhere as well - Their main property is that they are processed, compiled for usage by semantic search - Unlike the 'raw' string which contains the external representation of the data, as is	2022-07-20 20:35:50 +04:00
Debanjum Singh Solanky	c1369233db	Consistently use "entry", "score" in json response for all search types - Had already made some progress on this earlier by updating the image search responses. But needed to update the text search responses to use lowercase entry and score - Update khoj.el to consume the updated json response keys for text search	2022-07-20 20:33:27 +04:00
Debanjum Singh Solanky	d68a9dc445	Sort extracted images before computing their embeddings - Image order returned by glob is OS dependent - This prevented sharing image embeddings across machines running different OS - A stable sort order for processed images allows sharing embeddings across machines. - Use case: A more powerful, always on machine actually computes the image embeddings regularly The client machine just load these periodically to provide semantic search functionality	2022-07-20 03:51:27 +04:00

... 2 3 4 5 6 ...

510 commits