cubre los diferentes modelos soportados por ChatBotKit, incluyendo los modelos base de OpenAI como GPT-4 y GPT-3, así como modelos propios para varios casos de uso.

ChatBotKit supports various models to create engaging conversational AI experiences. These include foundational OpenAI models such as GPT4o, GPT4, and GPT3, along with models from Anthropic, Mistral, and others. Additionally, ChatBotKit uses several of its own models, including text-algo-002 and text-algo-003, for our in-house general assistant.

Below is a table that summarizes the different models. It includes their names, short descriptions, and context sizes (the maximum number of tokens).

Model NameShort DescriptionToken RatioContext Size
gpt-4o-mini-nextGPT-4o mini is our most cost-efficient small model that’s smarter and cheaper than GPT-3.5 Turbo, and has vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.0.0333128000
gpt-4o-mini-classicGPT-4o mini is our most cost-efficient small model that’s smarter and cheaper than GPT-3.5 Turbo, and has vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.0.0333128000
gpt-4o-miniGPT-4o mini is our most cost-efficient small model that’s smarter and cheaper than GPT-3.5 Turbo, and has vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.0.0333128000
gpt-4o-siguienteGPT-4o es más rápido y barato que GPT-4 Turbo, con mayores capacidades de visión. El modelo tiene un contexto de 128K y una fecha límite de conocimiento de octubre de 2023.0.8333128000
gpt-4o-clásicoGPT-4o es más rápido y barato que GPT-4 Turbo, con mayores capacidades de visión. El modelo tiene un contexto de 128K y una fecha límite de conocimiento de octubre de 2023.0.8333128000
gpt-4oGPT-4o es más rápido y barato que GPT-4 Turbo, con mayores capacidades de visión. El modelo tiene un contexto de 128K y una fecha límite de conocimiento de octubre de 2023.0.8333128000
gpt-4-turbo-siguienteGPT-4 Turbo se ofrece en un contexto de 128K con un límite de conocimientos de abril de 2023 y soporte básico para la visión.1.6667128000
gpt-4-turbo-clásicoGPT-4 Turbo se ofrece en un contexto de 128K con un límite de conocimientos de abril de 2023 y soporte básico para la visión.1.6667128000
gpt-4-turboGPT-4 Turbo se ofrece en un contexto de 128K con un límite de conocimientos de abril de 2023 y soporte básico para la visión.1.6667128000
gpt-4-siguienteEl modelo GPT-4 se construyó con amplios conocimientos generales y experiencia en el dominio.3.33338192
gpt-4-clásicoEl modelo GPT-4 se construyó con amplios conocimientos generales y experiencia en el dominio.3.33338192
gpt-4El modelo GPT-4 se construyó con amplios conocimientos generales y experiencia en el dominio.3.33338192
gpt-3.5-turbo-siguienteGPT-3.5 Turbo es un modelo rápido y económico para tareas más sencillas.0.083316384
gpt-3.5-turbo-clásicoGPT-3.5 Turbo es un modelo rápido y económico para tareas más sencillas.0.22224096
gpt-3.5-turboGPT-3.5 Turbo es un modelo rápido y económico para tareas más sencillas.0.083316384
gpt-3.5-turbo-instrucciónGPT-3.5 Turbo es un modelo rápido y económico para tareas más sencillas.0.11114096
mistral-large-latestRazonamiento de alto nivel para tareas de gran complejidad. El modelo más potente de la familia Mistral AI.0.666732000
mistral-pequeño-últimoRazonamiento rentable para cargas de trabajo de baja latencia.0.166732000
claude-v3-opusEl modelo de IA más potente de Anthropic, con un rendimiento de alto nivel en tareas muy complejas. Es capaz de desenvolverse en escenarios abiertos e imprevistos con notable fluidez y una comprensión similar a la humana.4.1667200000
claude-v3-sonnetClaude 3 Sonnet logra el equilibrio ideal entre inteligencia y velocidad, especialmente para cargas de trabajo empresariales. Ofrece la máxima utilidad y está diseñado para ser fiable en despliegues de IA a gran escala.0.8333200000
claude-v3-haikuEl modelo más rápido y compacto de Anthropic para una capacidad de respuesta casi instantánea. Responde con rapidez a consultas y peticiones sencillas.0.0694200000
claude-v3Claude 3 Sonnet logra el equilibrio ideal entre inteligencia y velocidad, especialmente para cargas de trabajo empresariales. Ofrece la máxima utilidad y está diseñado para ser fiable en despliegues de IA a gran escala.0.8333200000
claude-v2.1Claude 2.1 es un gran modelo lingüístico (LLM) de Anthropic con una ventana de contexto de 200.000 tokens, índices de alucinación reducidos y una precisión mejorada en documentos largos.1.3333200000
claude-v2Claude 2.0 es un LLM líder de Anthropic que permite una amplia gama de tareas, desde el diálogo sofisticado y la generación creativa de contenidos hasta la instrucción detallada.1.3333100000
claude-instant-v1Claude Instant es el LLM más rápido y económico de Anthropic.0.1333100000
personalizadoCualquier modelo personalizado creado por el usuario.0.014096
texto-qaa-003Este modelo pertenece a la familia GPT-4 Turbo de modelos ChatBotKit. Está diseñado para aplicaciones de preguntas y respuestas. El modelo tiene un límite de tokens de 128000 y proporciona un equilibrio entre coste y calidad. Es un modelo personalizado basado en la arquitectura del modelo gpt.1.6667128000
texto-qaa-002Este modelo pertenece a la familia GPT-4 de modelos ChatBotKit. Está diseñado para aplicaciones de preguntas y respuestas. El modelo tiene un límite de tokens de 8 * ONE_K y proporciona un equilibrio entre coste y calidad. Es un modelo personalizado basado en la arquitectura del modelo gpt.3.33338192
texto-qaa-001Este modelo pertenece a la familia Turbo de modelos ChatBotKit. Está diseñado para aplicaciones de preguntas y respuestas. El modelo tiene un límite de tokens de 4000 y proporciona un equilibrio entre coste y calidad. Es un modelo personalizado basado en la arquitectura del modelo gpt.0.14096
texto-algo-003ste modelo pertenece a la familia GPT-4 de modelos ChatBotKit.3.33338192
texto-algo-002Este modelo pertenece a la familia Turbo de modelos ChatBotKit.0.14096
About our latest models

We will try to keep this page up-to-date. The latest and most up-to-date list of supported models and their configurations can be found here.

ChatBotKit uses the token ratio as a multiplier to calculate the actual number of tokens consumed by the model. Each model token is multiplied by the token ratio to determine the number of tokens ChatBotKit records. This ensures accurate tracking of the resources each model uses and correct user billing.

The context size refers to the maximum tokens (words or symbols) the model can consider when generating a response. A larger context size allows for more information to be taken into account, potentially leading to more accurate and relevant responses.

When choosing a model, it's essential to evaluate not just its capabilities, but also its cost and size. Larger and more expensive models aren't always the best choice for every task. Often, a smaller model can perform equally well or even better. As a rule of thumb, gpt-4o and gpt-4 are the best choices if you need the most advanced and capable model. However, if you're looking for a capable model that's also smaller, gpt-3.5-turbo might be a better fit.

Bring Your Own Model

ChatBotKit offers the unique option of bringing your own model and keys to the platform. This feature is designed for those who desire more control over their models and costs. If you have a model that you've trained and perfected over time for your specific use case or requirement, you're free to bring it to our platform. This means you can use your own keys, which allows you to handle the payment for the model usage directly. This could be beneficial, especially if you have particular budget constraints or specific cost strategies. In essence, with ChatBotKit, you're not just limited to using our pre-built models, but you can also introduce your custom-made models, providing you with more flexibility and control to meet your specific needs.

Here is an outline of the steps required to create your own custom model.

  1. Navigate to the Bot Configuration Screen

    • From the main dashboard, click on the "Bots" section in the left-hand menu.
    • Select the bot you want to configure or create a new bot.
  2. Choose the Model

    • Under the "Model" section, select "custom" from the dropdown menu as shown in the first screenshot.

    • Press the “Settings” button.

  3. Model Configuration Window

    • Enter a name for your custom model in the "Name" field. For example, "gpt-3.5-turbo."

    • Choose the provider of your custom model from the "Provider" dropdown menu. In this case, select "OpenAI."

    • Provide the necessary credentials for accessing the custom model. Click on the credentials field and enter the required information.

    • Define the maximum number of tokens the chatbot will use for each interaction in the "Max Tokens" field. The default value is 4096.

BYOK Caveats

When you opt to use your own key (BYOK) for model access, you assume full responsibility for the model's availability and operational limits. This shift occurs because you are no longer utilizing the default ChatBotKit service tiers, which may offer different capabilities and restrictions.

Customising Model Settings

To customize a model's settings, click on the settings icon next to the model name.

There are four main properties that can be customized: Max Tokens, Temperature, Interaction Max Messages, Region, Frequency Penalty, Presence Penalty, and Vision.

Max Tokens: This property determines the maximum number of tokens that the model can consume when generating a response. By default, this is set to the maximum context size for the model, but you can reduce it to limit the amount of resources used by the model. This can help save token cost but may also reduce the ability of the chatbot to keep up with the conversation.

Temperature: This property determines the level of randomness or creativity in the model's responses. A higher temperature value will result in more diverse and creative responses, while a lower value will result in more conservative and predictable responses.

Interaction Max Messages: The maximum number of messages to use per model interaction. Setting this value to low will make the model more deterministic. Increasing the value will result in more creativity. For Q&A-style conversation it is recommended to keep the value to 2.

Region: The region property allows you to specify the geographical region for the model. This can be particularly useful for services that have specific regional requirements or restrictions. However, it's important to note that the availability of certain models may vary depending on the region.

Frequency Penalty: This property determines how much the model penalizes the repetition of certain words or phrases in its responses. A higher frequency penalty value will result in responses that are more varied and less repetitive.

Presence Penalty: This property determines how much the model penalizes the use of certain words or phrases in its responses. A higher presence penalty value will result in responses that are less likely to contain specific words or phrases.

Vision: This property applies solely to vision models. It enables bots to utilize native vision capabilities as opposed to Skillset Vision Actions. While we generally recommend Skillset for cost-efficiency and control, there are situations where native vision capabilities may be preferred.

By customizing these properties, you can fine-tune the behavior of the model to better suit your specific use case and requirements. However, it's important to note that changing these properties can have a significant impact on the model's performance and accuracy, so it's recommended to experiment with different settings to find the best balance between performance and creativity.

FAQ

Can I get regional access to some models?

Yes. Some models such as Claude can be accessed within your own designated region. Please contact us for more information.

Can I bring my own model?

Our models are designed to scale no matter the circumstances. However, customers that wish to bring their own model can do so on some of our higher-tier plans such as Pro, Pro Plus and Team.