khoj/tests/data/notes at 4a90972e3860ff769e1152c60768be0499263bd8 - sij/khoj

sij/khoj

mirror of https://github.com/khoj-ai/khoj.git synced 2024-11-27 17:35:07 +01:00

History

Debanjum Singh Solanky 4a90972e38 Use a better model for asymmetric semantic search - The multi-qa-MiniLM-L6-cos-v1 is more extensively benchmarked[1] - It has the right mix of model query speed, size and performance on benchmarks - On hugging face it has way more downloads and likes than the msmarco model[2] - On very preliminary evaluation of the model - It doubles the encoding speed of all entries (down from ~8min to 4mins) - It gave more entries that stay relevant to the query (3/5 vs 1/5 earlier) [1]: https://www.sbert.net/docs/pretrained_models.html [2]: https://huggingface.co/sentence-transformers	2022-07-18 20:27:26 +04:00
..
interface_emacs_readme.org	Improve test data organization and update correspoding conftests	2022-01-29 02:03:17 -05:00
main_readme.org	Use a better model for asymmetric semantic search	2022-07-18 20:27:26 +04:00

Debanjum Singh Solanky 4a90972e38 Use a better model for asymmetric semantic search

- The multi-qa-MiniLM-L6-cos-v1 is more extensively benchmarked[1]
- It has the right mix of model query speed, size and performance on benchmarks
- On hugging face it has way more downloads and likes than the msmarco model[2]
- On very preliminary evaluation of the model
  - It doubles the encoding speed of all entries (down from ~8min to 4mins)
  - It gave more entries that stay relevant to the query (3/5 vs 1/5 earlier)

[1]: https://www.sbert.net/docs/pretrained_models.html
[2]: https://huggingface.co/sentence-transformers

2022-07-18 20:27:26 +04:00

interface_emacs_readme.org

Improve test data organization and update correspoding conftests

2022-01-29 02:03:17 -05:00

main_readme.org

Use a better model for asymmetric semantic search

2022-07-18 20:27:26 +04:00