> ## Documentation Index
> Fetch the complete documentation index at: https://platform.kimi.ai/docs/llms.txt
> Use this file to discover all available pages before exploring further.

# Model Inference Pricing Explanation

## Concepts

### Billing Unit

Token: A token represents a common sequence of characters. The number of tokens used for each English character may vary. For example, a single character like "antidisestablishmentarianism" might be broken down into several tokens, while a short and common phrase like "word" might use just one token. Generally speaking, for a typical English text, 1 token is roughly equivalent to 3-4 English characters. The exact number of tokens generated by each call can be obtained through the [Token Calculation API](/api/estimate).

#### Billing Logic

Chat Completion API charges: We bill both the Input and Output based on usage. If you upload and extract content from a document and then pass the extracted content as Input to the model, the document content will also be billed based on usage. File-related interfaces (file content extraction/file storage) are **temporarily free**. In other words, if you only upload and extract a document, this API itself will not incur any charges.

## Model Pricing

See detailed pricing for each model:

<CardGroup cols={3}>
  <Card title="Kimi K2.6" icon="star" href="/pricing/chat-k26">
    Multi-modal model with visual and text input
  </Card>

  <Card title="Kimi K2" icon="bolt" href="/pricing/chat-k2">
    MoE model with exceptional code and agent capabilities
  </Card>

  <Card title="Moonshot V1" icon="moon" href="/pricing/chat-v1">
    Generation model series
  </Card>
</CardGroup>
