khoj/tests/helpers.py

import os
from datetime import datetime

import factory
from django.utils.timezone import make_aware

from khoj.database.models import (
    ChatModelOptions,
    Conversation,
    KhojApiUser,
    KhojUser,
    OpenAIProcessorConversationConfig,
    ProcessLock,
    SearchModelConfig,
    Subscription,
    UserConversationConfig,
)


class UserFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = KhojUser

    username = factory.Faker("name")
    email = factory.Faker("email")
    password = factory.Faker("password")
    uuid = factory.Faker("uuid4")


class ApiUserFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = KhojApiUser

    user = None
    name = factory.Faker("name")
    token = factory.Faker("password")


class OpenAIProcessorConversationConfigFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = OpenAIProcessorConversationConfig

    api_key = os.getenv("OPENAI_API_KEY")


class ChatModelOptionsFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = ChatModelOptions

    max_prompt_size = 3500
    tokenizer = None
    chat_model = "NousResearch/Hermes-2-Pro-Mistral-7B-GGUF"
    model_type = "offline"
    openai_config = factory.SubFactory(OpenAIProcessorConversationConfigFactory)


class UserConversationProcessorConfigFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = UserConversationConfig

    user = factory.SubFactory(UserFactory)
    setting = factory.SubFactory(ChatModelOptionsFactory)


class ConversationFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = Conversation

    user = factory.SubFactory(UserFactory)


class SearchModelFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = SearchModelConfig

    name = "default"
    model_type = "text"
    bi_encoder = "thenlper/gte-small"
    cross_encoder = "mixedbread-ai/mxbai-rerank-xsmall-v1"


class SubscriptionFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = Subscription

    user = factory.SubFactory(UserFactory)
    type = "standard"
    is_recurring = False
    renewal_date = make_aware(datetime.strptime("2100-04-01", "%Y-%m-%d"))


class ProcessLockFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = ProcessLock

    name = "test_lock"
[Multi-User Part 3]: Separate chat sesssions based on authenticated users (#511) - Add a data model which allows us to store Conversations with users. This does a minimal lift over the current setup, where the underlying data is stored in a JSON file. This maintains parity with that configuration. - There does _seem_ to be some regression in chat quality, which is most likely attributable to search results. This will help us with #275. It should become much easier to maintain multiple Conversations in a given table in the backend now. We will have to do some thinking on the UI. 2023-10-26 20:37:41 +02:00			`import os`
Handle subscribe renew date, langchain, pydantic & logger.warn warnings - Ensure langchain less than 0.2.0 is used, to prevent breaking ChatOpenAI, PyMuPDF usage due to their deprecation after 0.2.0 - Set subscription renewal date to a timezone aware datetime - Use logger.warning instead of logger.warn as latter is deprecated - Use `model_dump' not deprecated dict to get all configured content_types 2024-01-11 21:02:46 +01:00			`from datetime import datetime`
[Multi-User Part 3]: Separate chat sesssions based on authenticated users (#511) - Add a data model which allows us to store Conversations with users. This does a minimal lift over the current setup, where the underlying data is stored in a JSON file. This maintains parity with that configuration. - There does _seem_ to be some regression in chat quality, which is most likely attributable to search results. This will help us with #275. It should become much easier to maintain multiple Conversations in a given table in the backend now. We will have to do some thinking on the UI. 2023-10-26 20:37:41 +02:00
Add isort to the pre-commit configuration and apply it to the whole project (#595) * Apply isort to the entire repository * Fix missing import issues in text_to_entries * Fix imports in migration files 2023-12-28 13:34:02 +01:00			`import factory`
Handle subscribe renew date, langchain, pydantic & logger.warn warnings - Ensure langchain less than 0.2.0 is used, to prevent breaking ChatOpenAI, PyMuPDF usage due to their deprecation after 0.2.0 - Set subscription renewal date to a timezone aware datetime - Use logger.warning instead of logger.warn as latter is deprecated - Use `model_dump' not deprecated dict to get all configured content_types 2024-01-11 21:02:46 +01:00			`from django.utils.timezone import make_aware`
Add isort to the pre-commit configuration and apply it to the whole project (#595) * Apply isort to the entire repository * Fix missing import issues in text_to_entries * Fix imports in migration files 2023-12-28 13:34:02 +01:00
Move the django app into the src/khoj folder for better organization and functionality - Our pypi package currently does not work because the django app and associated database is not included. To remedy this issue, move the app into the src/khoj folder. This has the added benefit of improved organization of the codebase, as all server related code is now in a single folder - Update associated file paths and system references 2023-11-21 19:56:04 +01:00			`from khoj.database.models import (`
[Multi-User Part 8]: Make conversation processor settings server-wide (#529) - Rather than having each individual user configure their conversation settings, allow the server admin to configure the OpenAI API key or offline model once, and let all the users re-use that code. - To configure the settings, the admin should go to the `django/admin` page and configure the relevant chat settings. To create an admin, run `python3 src/manage.py createsuperuser` and enter in the details. For simplicity, the email and username should match. - Remove deprecated/unnecessary endpoints and views for configuring per-user chat settings 2023-11-02 18:43:27 +01:00			`ChatModelOptions,`
Add isort to the pre-commit configuration and apply it to the whole project (#595) * Apply isort to the entire repository * Fix missing import issues in text_to_entries * Fix imports in migration files 2023-12-28 13:34:02 +01:00			`Conversation,`
			`KhojApiUser,`
			`KhojUser,`
[Multi-User Part 3]: Separate chat sesssions based on authenticated users (#511) - Add a data model which allows us to store Conversations with users. This does a minimal lift over the current setup, where the underlying data is stored in a JSON file. This maintains parity with that configuration. - There does _seem_ to be some regression in chat quality, which is most likely attributable to search results. This will help us with #275. It should become much easier to maintain multiple Conversations in a given table in the backend now. We will have to do some thinking on the UI. 2023-10-26 20:37:41 +02:00			`OpenAIProcessorConversationConfig,`
Add tests for the db lock 2024-04-17 09:52:41 +02:00			`ProcessLock,`
Rename SearchModel to SearchModelConfig DB model, Require Cross-Encoder 2023-11-16 02:12:54 +01:00			`SearchModelConfig,`
Add default settings to let new users be subscribed on trial - Add the default user to a subscription trial - Update associated unit tests 2023-11-11 07:38:28 +01:00			`Subscription,`
Add isort to the pre-commit configuration and apply it to the whole project (#595) * Apply isort to the entire repository * Fix missing import issues in text_to_entries * Fix imports in migration files 2023-12-28 13:34:02 +01:00			`UserConversationConfig,`
[Multi-User Part 3]: Separate chat sesssions based on authenticated users (#511) - Add a data model which allows us to store Conversations with users. This does a minimal lift over the current setup, where the underlying data is stored in a JSON file. This maintains parity with that configuration. - There does _seem_ to be some regression in chat quality, which is most likely attributable to search results. This will help us with #275. It should become much easier to maintain multiple Conversations in a given table in the backend now. We will have to do some thinking on the UI. 2023-10-26 20:37:41 +02:00			`)`


			`class UserFactory(factory.django.DjangoModelFactory):`
			`class Meta:`
			`model = KhojUser`

			`username = factory.Faker("name")`
			`email = factory.Faker("email")`
			`password = factory.Faker("password")`
			`uuid = factory.Faker("uuid4")`


[Multi-User Part 4]: Authenticate using API Tokens (#513) ### ✨ New - Use API keys to authenticate from Desktop, Obsidian, Emacs clients - Create API, UI on web app config page to CRUD API Keys - Create user API keys table and functions to CRUD them in Database ### 🧪 Improve - Default to better search model, [gte-small](https://huggingface.co/thenlper/gte-small), to improve search quality - Only load chat model to GPU if enough space, throw error on load failure - Show encoding progress, truncate headings to max chars supported - Add instruction to create db in Django DB setup Readme ### ⚙️ Fix - Fix error handling when configure offline chat via Web UI - Do not warn in anon mode about Google OAuth env vars not being set - Fix path to load static files when server started from project root 2023-10-26 21:33:03 +02:00			`class ApiUserFactory(factory.django.DjangoModelFactory):`
			`class Meta:`
			`model = KhojApiUser`

			`user = None`
			`name = factory.Faker("name")`
			`token = factory.Faker("password")`


Fix openai chat actor, director tests - Update test ChatModelOptions setup since update to it's schema - Fix stale function calls using their updated signatures 2024-06-09 03:46:55 +02:00			`class OpenAIProcessorConversationConfigFactory(factory.django.DjangoModelFactory):`
			`class Meta:`
			`model = OpenAIProcessorConversationConfig`

			`api_key = os.getenv("OPENAI_API_KEY")`


[Multi-User Part 8]: Make conversation processor settings server-wide (#529) - Rather than having each individual user configure their conversation settings, allow the server admin to configure the OpenAI API key or offline model once, and let all the users re-use that code. - To configure the settings, the admin should go to the `django/admin` page and configure the relevant chat settings. To create an admin, run `python3 src/manage.py createsuperuser` and enter in the details. For simplicity, the email and username should match. - Remove deprecated/unnecessary endpoints and views for configuring per-user chat settings 2023-11-02 18:43:27 +01:00			`class ChatModelOptionsFactory(factory.django.DjangoModelFactory):`
[Multi-User Part 3]: Separate chat sesssions based on authenticated users (#511) - Add a data model which allows us to store Conversations with users. This does a minimal lift over the current setup, where the underlying data is stored in a JSON file. This maintains parity with that configuration. - There does _seem_ to be some regression in chat quality, which is most likely attributable to search results. This will help us with #275. It should become much easier to maintain multiple Conversations in a given table in the backend now. We will have to do some thinking on the UI. 2023-10-26 20:37:41 +02:00			`class Meta:`
[Multi-User Part 8]: Make conversation processor settings server-wide (#529) - Rather than having each individual user configure their conversation settings, allow the server admin to configure the OpenAI API key or offline model once, and let all the users re-use that code. - To configure the settings, the admin should go to the `django/admin` page and configure the relevant chat settings. To create an admin, run `python3 src/manage.py createsuperuser` and enter in the details. For simplicity, the email and username should match. - Remove deprecated/unnecessary endpoints and views for configuring per-user chat settings 2023-11-02 18:43:27 +01:00			`model = ChatModelOptions`
[Multi-User Part 3]: Separate chat sesssions based on authenticated users (#511) - Add a data model which allows us to store Conversations with users. This does a minimal lift over the current setup, where the underlying data is stored in a JSON file. This maintains parity with that configuration. - There does _seem_ to be some regression in chat quality, which is most likely attributable to search results. This will help us with #275. It should become much easier to maintain multiple Conversations in a given table in the backend now. We will have to do some thinking on the UI. 2023-10-26 20:37:41 +02:00
Use llama.cpp for offline chat models - Benefits of moving to llama-cpp-python from gpt4all: - Support for all GGUF format chat models - Support for AMD, Nvidia, Mac, Vulcan GPU machines (instead of just Vulcan, Mac) - Supports models with more capabilities like tools, schema enforcement, speculative ddecoding, image gen etc. - Upgrade default chat model, prompt size, tokenizer for new supported chat models - Load offline chat model when present on disk without requiring internet - Load model onto GPU if not disabled and device has GPU - Load model onto CPU if loading model onto GPU fails - Create helper function to check and load model from disk, when model glob is present on disk. `Llama.from_pretrained' needs internet to get repo info from HuggingFace. This isn't required, if the model is already downloaded Didn't find any existing HF or llama.cpp method that looked for model glob on disk without internet 2024-03-15 21:19:44 +01:00			`max_prompt_size = 3500`
[Multi-User Part 3]: Separate chat sesssions based on authenticated users (#511) - Add a data model which allows us to store Conversations with users. This does a minimal lift over the current setup, where the underlying data is stored in a JSON file. This maintains parity with that configuration. - There does _seem_ to be some regression in chat quality, which is most likely attributable to search results. This will help us with #275. It should become much easier to maintain multiple Conversations in a given table in the backend now. We will have to do some thinking on the UI. 2023-10-26 20:37:41 +02:00			`tokenizer = None`
Use llama.cpp for offline chat models - Benefits of moving to llama-cpp-python from gpt4all: - Support for all GGUF format chat models - Support for AMD, Nvidia, Mac, Vulcan GPU machines (instead of just Vulcan, Mac) - Supports models with more capabilities like tools, schema enforcement, speculative ddecoding, image gen etc. - Upgrade default chat model, prompt size, tokenizer for new supported chat models - Load offline chat model when present on disk without requiring internet - Load model onto GPU if not disabled and device has GPU - Load model onto CPU if loading model onto GPU fails - Create helper function to check and load model from disk, when model glob is present on disk. `Llama.from_pretrained' needs internet to get repo info from HuggingFace. This isn't required, if the model is already downloaded Didn't find any existing HF or llama.cpp method that looked for model glob on disk without internet 2024-03-15 21:19:44 +01:00			`chat_model = "NousResearch/Hermes-2-Pro-Mistral-7B-GGUF"`
[Multi-User Part 8]: Make conversation processor settings server-wide (#529) - Rather than having each individual user configure their conversation settings, allow the server admin to configure the OpenAI API key or offline model once, and let all the users re-use that code. - To configure the settings, the admin should go to the `django/admin` page and configure the relevant chat settings. To create an admin, run `python3 src/manage.py createsuperuser` and enter in the details. For simplicity, the email and username should match. - Remove deprecated/unnecessary endpoints and views for configuring per-user chat settings 2023-11-02 18:43:27 +01:00			`model_type = "offline"`
Fix openai chat actor, director tests - Update test ChatModelOptions setup since update to it's schema - Fix stale function calls using their updated signatures 2024-06-09 03:46:55 +02:00			`openai_config = factory.SubFactory(OpenAIProcessorConversationConfigFactory)`
[Multi-User Part 8]: Make conversation processor settings server-wide (#529) - Rather than having each individual user configure their conversation settings, allow the server admin to configure the OpenAI API key or offline model once, and let all the users re-use that code. - To configure the settings, the admin should go to the `django/admin` page and configure the relevant chat settings. To create an admin, run `python3 src/manage.py createsuperuser` and enter in the details. For simplicity, the email and username should match. - Remove deprecated/unnecessary endpoints and views for configuring per-user chat settings 2023-11-02 18:43:27 +01:00

			`class UserConversationProcessorConfigFactory(factory.django.DjangoModelFactory):`
			`class Meta:`
			`model = UserConversationConfig`

			`user = factory.SubFactory(UserFactory)`
			`setting = factory.SubFactory(ChatModelOptionsFactory)`
[Multi-User Part 3]: Separate chat sesssions based on authenticated users (#511) - Add a data model which allows us to store Conversations with users. This does a minimal lift over the current setup, where the underlying data is stored in a JSON file. This maintains parity with that configuration. - There does _seem_ to be some regression in chat quality, which is most likely attributable to search results. This will help us with #275. It should become much easier to maintain multiple Conversations in a given table in the backend now. We will have to do some thinking on the UI. 2023-10-26 20:37:41 +02:00

			`class ConversationFactory(factory.django.DjangoModelFactory):`
			`class Meta:`
			`model = Conversation`

			`user = factory.SubFactory(UserFactory)`
Add default settings to let new users be subscribed on trial - Add the default user to a subscription trial - Update associated unit tests 2023-11-11 07:38:28 +01:00

Make search model configurable on server - Expose ability to modify search model via Django admin interface - Previously the bi_encoder and cross_encoder models to use were set in code - Now it's user configurable but with a default config generated by default 2023-11-15 01:56:26 +01:00			`class SearchModelFactory(factory.django.DjangoModelFactory):`
			`class Meta:`
Rename SearchModel to SearchModelConfig DB model, Require Cross-Encoder 2023-11-16 02:12:54 +01:00			`model = SearchModelConfig`
Make search model configurable on server - Expose ability to modify search model via Django admin interface - Previously the bi_encoder and cross_encoder models to use were set in code - Now it's user configurable but with a default config generated by default 2023-11-15 01:56:26 +01:00
			`name = "default"`
			`model_type = "text"`
			`bi_encoder = "thenlper/gte-small"`
Upgrade default cross-encoder to mixedbread ai's mxbai-rerank-xsmall Previous cross-encoder model was a few years old, newer models should have improved in quality. Model size increases by 50% compared to previous for better performance, at least on benchmarks 2024-04-24 05:43:14 +02:00			`cross_encoder = "mixedbread-ai/mxbai-rerank-xsmall-v1"`
Make search model configurable on server - Expose ability to modify search model via Django admin interface - Previously the bi_encoder and cross_encoder models to use were set in code - Now it's user configurable but with a default config generated by default 2023-11-15 01:56:26 +01:00

Add default settings to let new users be subscribed on trial - Add the default user to a subscription trial - Update associated unit tests 2023-11-11 07:38:28 +01:00			`class SubscriptionFactory(factory.django.DjangoModelFactory):`
			`class Meta:`
			`model = Subscription`

			`user = factory.SubFactory(UserFactory)`
Subscribe default user to standard plan with a far away renewal date Self hosted users in anonymous mode have all capabilities unlocked 2023-11-15 01:20:55 +01:00			`type = "standard"`
Add default settings to let new users be subscribed on trial - Add the default user to a subscription trial - Update associated unit tests 2023-11-11 07:38:28 +01:00			`is_recurring = False`
Handle subscribe renew date, langchain, pydantic & logger.warn warnings - Ensure langchain less than 0.2.0 is used, to prevent breaking ChatOpenAI, PyMuPDF usage due to their deprecation after 0.2.0 - Set subscription renewal date to a timezone aware datetime - Use logger.warning instead of logger.warn as latter is deprecated - Use `model_dump' not deprecated dict to get all configured content_types 2024-01-11 21:02:46 +01:00			`renewal_date = make_aware(datetime.strptime("2100-04-01", "%Y-%m-%d"))`
Add tests for the db lock 2024-04-17 09:52:41 +02:00

			`class ProcessLockFactory(factory.django.DjangoModelFactory):`
			`class Meta:`
			`model = ProcessLock`

			`name = "test_lock"`