ObjectiveAI - LLM Confidence

Chat Studio Models Docs Team

Request Headers

authorizationstringoptional

The authorization token (optional).

Request Query

metadatabooleanoptional

Whether to include metadata in the response. If false, all other query fields are ignored.

mebooleanoptional

Whether to only return the user's own metadata statistics. If true, authorization is required.

fromstringoptional

An RFC 3339 start date to filter metadata statistics by.

tostringoptional

An RFC 3339 end date to filter metadata statistics by.

Request Body

idstring*required

Model ID used to generate the response.

modeenum*required

The mode of the model, which determines whether it generates a response or selects from the generated options.

Variants

Generate"generate"

The model generates a response.

Select Thinking"select_thinking"

The model selects a Generate ID. The model will output reasoning, even if the LLM is not a reasoning model. Best for non-reasoning models.

Select Non Thinking"select_non_thinking"

The model selects a Generate ID.

Select Thinking Logprobs"select_thinking_logprobs"

The model selects one or more Generate IDs as a probability distribution. The model will output reasoning, even if the LLM is not a reasoning model. Best for non-reasoning models.

Select Non Thinking Logprobs"select_non_thinking_logprobs"

The model selects one or more Generate IDs as a probability distribution.

select_top_logprobsnumbermin: 0max: 20optional

If the mode is one of the select logprobs modes, this controls how many of the top options are returned with their probabilities.

frequency_penaltynumbermin: -2max: 2optional

This setting aims to control the repetition of tokens based on how often they appear in the input. It tries to use less frequently those tokens that appear more in the input, proportional to how frequently they occur. Token penalty scales with the number of occurrences. Negative values will encourage token reuse.

logit_biasmap<string, number>optional

Accepts a JSON object that maps tokens (specified by their token ID in the tokenizer) to an associated bias value from -100 to 100. Mathematically, the bias is added to the logits generated by the model prior to sampling. The exact effect will vary per model, but values between -1 and 1 should decrease or increase likelihood of selection; values like -100 or 100 should result in a ban or exclusive selection of the relevant token.

max_completion_tokensnumbermin: 1optional

An upper bound for the number of tokens that can be generated for a completion, including visible output tokens and reasoning tokens.

presence_penaltynumbermin: -2max: 2optional

This setting aims to control the presence of tokens in the output. It tries to encourage the model to use tokens that are less present in the input, proportional to their presence in the input. Token presence scales with the number of occurrences. Negative values will encourage more diverse token usage.

reasoning_effortenumoptional

Constrains effort on reasoning for some reasoning models.

Variants

Low"low"

Medium"medium"

High"high"

stopenumoptional

Stop generation immediately if the model encounters any token specified in the stop array.

Variants

Stop Stringstring

Stop Arrayarray

Items

Stop Stringstring

temperaturenumbermin: 0max: 2optional

This setting influences the variety in the model’s responses. Lower values lead to more predictable and typical responses, while higher values encourage more diverse and less common responses. At 0, the model always gives the same response for a given input.

top_pnumbermin: 0max: 1optional

This setting limits the model’s choices to a percentage of likely tokens: only the top tokens whose probabilities add up to P. A lower value makes the model’s responses more predictable, while the default setting allows for a full range of token choices. Think of it like a dynamic Top-K.

max_tokensnumbermin: 1optional

This sets the upper limit for the number of tokens the model can generate in response. It won’t produce more than this limit. The maximum value is the context length minus the prompt length.

min_pnumbermin: 0max: 1optional

Represents the minimum probability for a token to be considered, relative to the probability of the most likely token. (The value changes depending on the confidence level of the most probable token.) If your Min-P is set to 0.1, that means it will only allow for tokens that are at least 1/10th as probable as the best possible option.

providerobjectoptional

OpenRouter provider preferences.

Properties

orderarrayoptional

List of provider slugs to try in order.

Items

Provider Slugstring

allow_fallbacksbooleanoptional

Whether to allow backup providers when the primary is unavailable.

require_parametersbooleanoptional

Only use providers that support all parameters in your request.

data_collectionenumoptional

Control whether to use providers that may store data.

Variants

Allow"allow"

Deny"deny"

onlyarrayoptional

List of provider slugs to allow for this request.

Items

Provider Slugstring

ignorearrayoptional

List of provider slugs to skip for this request.

Items

Provider Slugstring

quantizationsarrayoptional

List of quantization levels to filter by.

Items

Quantization Levelstring

sortstringoptional

Sort providers by price or throughput.

reasoningobjectoptional

OpenRouter reasoning configuration.

Properties

max_tokensnumbermin: 1optional

An upper bound for the number of tokens that can be generated for reasoning.

effortenumoptional

Constrains effort on reasoning for some reasoning models.

Variants

Low"low"

Medium"medium"

High"high"

enabledbooleanoptional

Whether reasoning is enabled for this request.

repetition_penaltynumbermin: 0max: 2optional

Helps to reduce the repetition of tokens from the input. A higher value makes the model less likely to repeat tokens, but too high a value can make the output less coherent (often with run-on sentences that lack small words). Token penalty scales based on original token’s probability.

top_anumbermin: 0max: 1optional

Consider only the top tokens with “sufficiently high” probabilities based on the probability of the most likely token. Think of it like a dynamic Top-P. A lower Top-A value focuses the choices based on the highest probability token but with a narrower scope. A higher Top-A value does not necessarily affect the creativity of the output, but rather refines the filtering process based on the maximum probability.

top_knumbermin: 1optional

This limits the model’s choice of tokens at each step, making it choose from a smaller set. A value of 1 means the model will always pick the most likely next token, leading to predictable results. By default this setting is disabled, making the model to consider all choices.

verbosityenumoptional

Controls the verbosity and length of the model response. Lower values produce more concise responses, while higher values produce more detailed and comprehensive responses.

Variants

Low"low"

Medium"medium"

High"high"

modelsarrayoptional

Fallback models. Will be tried in order if the first one fails.

Items

Model IDstring

weightenum*required

The weight of the model, which determines its influence on the Confidence Score. Must match the weight strategy of the parent Model.

Variants

Static Weightobject

A static weight value.

Properties

type"static"*required

weightnumber*required

The static weight value.

Training Table Weightobject

A dynamic weight value based on training table data.

Properties

type"training_table"*required

base_weightnumber*required

The base weight value, uninfluenced by training table data.

min_weightnumber*required

The minimum weight value. A model that never matches the correct answer will have this weight.

max_weightnumber*required

The maximum weight value. A model that always matches the correct answer will have this weight.

Response Body

idstring*required

Model ID used to generate the response.

modeenum*required

The mode of the model, which determines whether it generates a response or selects from the generated options.

Variants

Generate"generate"

The model generates a response.

Select Thinking"select_thinking"

The model selects a Generate ID. The model will output reasoning, even if the LLM is not a reasoning model. Best for non-reasoning models.

Select Non Thinking"select_non_thinking"

The model selects a Generate ID.

Select Thinking Logprobs"select_thinking_logprobs"

The model selects one or more Generate IDs as a probability distribution. The model will output reasoning, even if the LLM is not a reasoning model. Best for non-reasoning models.

Select Non Thinking Logprobs"select_non_thinking_logprobs"

The model selects one or more Generate IDs as a probability distribution.

select_top_logprobsnumbermin: 0max: 20optional

If the mode is one of the select logprobs modes, this controls how many of the top options are returned with their probabilities.

frequency_penaltynumbermin: -2max: 2optional

logit_biasmap<string, number>optional

max_completion_tokensnumbermin: 1optional

An upper bound for the number of tokens that can be generated for a completion, including visible output tokens and reasoning tokens.

presence_penaltynumbermin: -2max: 2optional

reasoning_effortenumoptional

Constrains effort on reasoning for some reasoning models.

Variants

Low"low"

Medium"medium"

High"high"

stopenumoptional

Stop generation immediately if the model encounters any token specified in the stop array.

Variants

Stop Stringstring

Stop Arrayarray

Items

Stop Stringstring

temperaturenumbermin: 0max: 2optional

top_pnumbermin: 0max: 1optional

max_tokensnumbermin: 1optional

This sets the upper limit for the number of tokens the model can generate in response. It won’t produce more than this limit. The maximum value is the context length minus the prompt length.

min_pnumbermin: 0max: 1optional

providerobjectoptional

OpenRouter provider preferences.

Properties

orderarrayoptional

List of provider slugs to try in order.

Items

Provider Slugstring

allow_fallbacksbooleanoptional

Whether to allow backup providers when the primary is unavailable.

require_parametersbooleanoptional

Only use providers that support all parameters in your request.

data_collectionenumoptional

Control whether to use providers that may store data.

Variants

Allow"allow"

Deny"deny"

onlyarrayoptional

List of provider slugs to allow for this request.

Items

Provider Slugstring

ignorearrayoptional

List of provider slugs to skip for this request.

Items

Provider Slugstring

quantizationsarrayoptional

List of quantization levels to filter by.

Items

Quantization Levelstring

sortstringoptional

Sort providers by price or throughput.

reasoningobjectoptional

OpenRouter reasoning configuration.

Properties

max_tokensnumbermin: 1optional

An upper bound for the number of tokens that can be generated for reasoning.

effortenumoptional

Constrains effort on reasoning for some reasoning models.

Variants

Low"low"

Medium"medium"

High"high"

enabledbooleanoptional

Whether reasoning is enabled for this request.

repetition_penaltynumbermin: 0max: 2optional

top_anumbermin: 0max: 1optional

top_knumbermin: 1optional

verbosityenumoptional

Controls the verbosity and length of the model response. Lower values produce more concise responses, while higher values produce more detailed and comprehensive responses.

Variants

Low"low"

Medium"medium"

High"high"

modelsarrayoptional

Fallback models. Will be tried in order if the first one fails.

Items

Model IDstring

weightenum*required

The weight of the model, which determines its influence on the Confidence Score. Must match the weight strategy of the parent Model.

Variants

Static Weightobject

A static weight value.

Properties

type"static"*required

weightnumber*required

The static weight value.

Training Table Weightobject

A dynamic weight value based on training table data.

Properties

type"training_table"*required

base_weightnumber*required

The base weight value, uninfluenced by training table data.

min_weightnumber*required

The minimum weight value. A model that never matches the correct answer will have this weight.

max_weightnumber*required

The maximum weight value. A model that always matches the correct answer will have this weight.

namestring*required

A base62 22-character unique identifier for the Query LLM. This is a hash of all parameters.

training_table_namestringoptional

A base62 22-character unique identifier for the Query LLM. This is a hash of some parameters. Only present with Training Table Weight.

user_idstringoptional

The ID of the user who created the Query LLM

createdstringoptional

The RFC 3339 timestamp when the Query LLM was created

requestsnumberoptional

The number of requests made with the Query LLM

chat_completion_tokensnumberoptional

The number of chat completion tokens generated by the Query LLM

chat_prompt_tokensnumberoptional

The number of chat prompt tokens processed by the Query LLM

chat_costnumberoptional

The total cost of chat completions generated by the Query LLM, in Credits

embedding_completion_tokensnumberoptional

The number of embedding completion tokens generated by the Query LLM

embedding_prompt_tokensnumberoptional

The number of embedding prompt tokens processed by the Query LLM

embedding_costnumberoptional

The total cost of embedding completions generated by the Query LLM, in Credits

Objective Artificial Intelligence, Inc.

	Overview
		FAQ
	Frequently Asked Questions

	Query
		Response Format
	Control the output and the Confidence Scores of your Query Completion

	POST	/query/completions
	Create a new Query Completion

	GET	/query/completions
	List your historical Query Completions

	GET	/query/completions/{id}
	Retrieve a historical Query Completion

	POST	/query/completions/{id}/publish
	Publish a historical Query Completion

	POST	/query/completions/{id}/training_table
	Mark the Correct Confidence ID for a historical Query Completion, adding it to the Query Model's training table

	DELETE	/query/completions/{id}/training_table
	Unmark the Correct Confidence ID for a historical Query Completion, removing it from the Query Model's training table

	Query Models
	GET	/query_models
	List all ObjectiveAI Query Models

	GET	/query_models/count
	Get the count of all ObjectiveAI Query Models

	GET	/query_models/{slug}
	Retrieve details about a specific ObjectiveAI Query Model (via slug)

	POST	/query_models
	Retrieve details about a specific ObjectiveAI Query Model (via JSON body)

	GET	/query_llms/{slug}
	Retrieve details about a specific ObjectiveAI Query LLM (via slug)

	POST	/query_llms
	Retrieve details about a specific ObjectiveAI Query LLM (via JSON body)

	Chat
	POST	/chat/completions
	Create a new Chat Completion

	GET	/chat/completions
	List your historical Chat Completions

	GET	/chat/completions/{id}
	Retrieve a historical Chat Completion

	POST	/chat/completions/{id}/publish
	Publish a historical Chat Completion

	Chat Models
	GET	/models
	List all LLMs provided by OpenRouter

	GET	/models/{author}/{slug}
	Retrieve details about a specific LLM provided by OpenRouter

	Auth
	GET	/auth/credits
	Retrieve your available credits

	GET	/auth/keys
	List your API keys

	POST	/auth/keys
	Create a new API key

	DELETE	/auth/keys
	Disable an API key

	GET	/auth/keys/openrouter
	Retrieve your BYOK OpenRouter API key

	POST	/auth/keys/openrouter
	Set your BYOK OpenRouter API key

	DELETE	/auth/keys/openrouter
	Remove your BYOK OpenRouter API key

	Other
		SDKs
	Use one of our Official Open-Source SDKs

	GET	/metadata
	Retrieve ObjectiveAI metadata statistics