Output Token

Simon BudziakCTO
An Output Token is a unit of text generated by the AI model. Because generation requires heavy computation (predicting one token at a time), output tokens are typically more expensive and slower than input tokens.
Optimizing for concise output tokens is a key strategy for reducing latency and cost in production AI apps.
Optimizing for concise output tokens is a key strategy for reducing latency and cost in production AI apps.