mirror of
https://github.com/khoj-ai/khoj.git
synced 2024-11-27 09:25:06 +01:00
2f7a6af56a
- What - Hash the entries and compare to find new/updated entries - Reuse embeddings encoded for existing entries - Only encode embeddings for updated or new entries - Merge the existing and new entries and embeddings to get the updated entries, embeddings - Why - Given most note text entries are expected to be unchanged across time. Reusing their earlier encoded embeddings should significantly speed up embeddings updates - Previously we were regenerating embeddings for all entries, even if they had existed in previous runs |
||
---|---|---|
.. | ||
data | ||
__init__.py | ||
conftest.py | ||
test_chatbot.py | ||
test_cli.py | ||
test_client.py | ||
test_date_filter.py | ||
test_file_filter.py | ||
test_helpers.py | ||
test_image_search.py | ||
test_org_to_jsonl.py | ||
test_orgnode.py | ||
test_text_search.py | ||
test_word_filter.py |