mirror of
https://github.com/khoj-ai/khoj.git
synced 2024-11-29 02:13:02 +01:00
2f7a6af56a
- What - Hash the entries and compare to find new/updated entries - Reuse embeddings encoded for existing entries - Only encode embeddings for updated or new entries - Merge the existing and new entries and embeddings to get the updated entries, embeddings - Why - Given most note text entries are expected to be unchanged across time. Reusing their earlier encoded embeddings should significantly speed up embeddings updates - Previously we were regenerating embeddings for all entries, even if they had existed in previous runs |
||
---|---|---|
.. | ||
__init__.py | ||
org_to_jsonl.py | ||
orgnode.py |