mirror of
https://github.com/khoj-ai/khoj.git
synced 2024-11-23 15:38:55 +01:00
7c0fd71bfd
- Evaluate khoj on random 200 questions from each of google frames and openai simpleqa benchmarks across *general*, *default* and *research* modes - Run eval with Gemini 1.5 Flash as test giver and Gemini 1.5 Pro as test evaluator models - Trigger eval workflow on release or manually - Make dataset, khoj mode and sample size configurable when triggered via manual workflow - Enable Web search, webpage read tools during evaluation |
||
---|---|---|
.. | ||
build_khoj_el.yml | ||
desktop.yml | ||
dockerize.yml | ||
dockerize_telemetry_server.yml | ||
github_pages_deploy.yml | ||
pre-commit.yml | ||
pypi.yml | ||
release.yml | ||
run_evals.yml | ||
test.yml | ||
test_khoj_el.yml |