Model Inference Pricing Explanation

Concepts
Billing Unit
Billing Logic
Model Pricing

Concepts

Billing Unit

Token: A token represents a common sequence of characters. The number of tokens used for each English character may vary. For example, a single character like “antidisestablishmentarianism” might be broken down into several tokens, while a short and common phrase like “word” might use just one token. Generally speaking, for a typical English text, 1 token is roughly equivalent to 3-4 English characters. The exact number of tokens generated by each call can be obtained through the Token Calculation API.

Billing Logic

Chat Completion API charges: We bill both the Input and Output based on usage. If you upload and extract content from a document and then pass the extracted content as Input to the model, the document content will also be billed based on usage. File-related interfaces (file content extraction/file storage) are temporarily free. In other words, if you only upload and extract a document, this API itself will not incur any charges.

Model Pricing

See detailed pricing for each model:

Kimi K2.6

Multi-modal model with visual and text input

Kimi K2

MoE model with exceptional code and agent capabilities

Moonshot V1

Generation model series

Kimi K2.6

Model Pricing

Other

Support

Model Inference Pricing Explanation

Concepts

Billing Unit

Billing Logic

Model Pricing

Kimi K2.6

Kimi K2

Moonshot V1

Model Pricing

Other

Support

Documentation Index

​Concepts

​Billing Unit

​Billing Logic

​Model Pricing

Kimi K2.6

Kimi K2

Moonshot V1

Concepts

Billing Unit

Billing Logic

Model Pricing