Mirror of khoj from Github
Find a file
Debanjum Singh Solanky 44b3247869 Update logical splitting of org-mode text into entries
- Major
  - Do not split org file, entry if it fits within the max token limits
    - Recurse down org file entries, one heading level at a time until
      reach leaf node or the current parent tree fits context window
    - Update `process_single_org_file' func logic to do this recursion

  - Convert extracted org nodes with children into entries
    - Previously org node to entry code just had to handle leaf entries
    - Now it recieve list of org node trees
    - Only add ancestor path to root org-node of each tree
    - Indent each entry trees headings by +1 level from base level (=2)

- Minor
  - Stop timing org-node parsing vs org-node to entry conversion
    Just time the wrapping function for org-mode entry extraction
    This standardizes what is being timed across at md, org etc.
  - Move try/catch to `extract_org_nodes' from `parse_single_org_file'
    func to standardize this also across md, org
2024-04-04 02:41:55 +05:30
.github Update stale Khoj pypi package metadata 2024-03-29 00:06:55 +05:30
documentation Fix docs showing how to setup llama-cpp with Khoj 2024-03-31 15:36:40 +05:30
scripts Fix bump_version.sh to commit, clean-up after desktop app version bump 2023-12-22 21:42:03 +05:30
src Update logical splitting of org-mode text into entries 2024-04-04 02:41:55 +05:30
tests Update logical splitting of org-mode text into entries 2024-04-04 02:41:55 +05:30
.dockerignore Use pypi khoj to fix docker builds and dockerize github workflow 2023-02-19 01:57:01 -06:00
.gitattributes Exclude tests data file from programming stats on Github 2023-08-28 11:00:52 -07:00
.gitignore [Multi-User Part 5]: Add a production Docker file and use a gunicorn configuration with it (#514) 2023-10-26 13:15:31 -07:00
.pre-commit-config.yaml Add isort to the pre-commit configuration and apply it to the whole project (#595) 2023-12-28 18:04:02 +05:30
docker-compose.yml Set default value of KHOJ_DEBUG to False in the docker-compose file 2024-03-01 21:51:13 +05:30
Dockerfile Remove unused git dependency from Docker images 2024-02-16 17:41:09 +05:30
gunicorn-config.py Configure production setup for moving to single worker model 2024-03-30 10:35:55 +05:30
LICENSE Change license to GNU AGPLv3 from GNU GPLv3 2023-11-16 11:14:06 -08:00
manifest.json Release Khoj version 1.8.0 2024-03-31 00:06:15 +05:30
prod.Dockerfile Configure production setup for moving to single worker model 2024-03-30 10:35:55 +05:30
pyproject.toml Rebase with matser 2024-04-02 16:16:06 +05:30
pytest.ini Move the django app into the src/khoj folder for better organization and functionality 2023-11-21 10:56:04 -08:00
README.md Add num online for Discord badge 2024-03-10 17:48:30 +05:30
versions.json Release Khoj version 1.8.0 2024-03-31 00:06:15 +05:30

Khoj Logo

test dockerize pypi Discord

An AI personal assistant for your digital brain


Khoj is an AI application to search and chat with your notes and documents.
It is open-source, self-hostable and accessible on Desktop, Emacs, Obsidian, Web and Whatsapp.
It works with pdf, markdown, org-mode, notion files and github repositories.
It can paint, search the internet and understand speech.


🔎 Search 💬 Chat
Quickly retrieve relevant documents using natural language Get answers and create content from your existing knowledge base
Does not need internet Can be configured to work without internet

Contributors

Cheers to our awesome contributors! 🎉

Made with contrib.rocks.