There's not much incentive to subsidize prices for OpenRouter providers for example, and the prices are much lower than the $6.37/M estimate from the article.
https://openrouter.ai/meta-llama/llama-3.3-70b-instruct
avg $0.37/M input tokens, $0.73/M output tokens (21 providers)
Llama is not even a good example, as the recent models are more optimized using Mixture Of Experts and KV cache compression.
There's not much incentive to subsidize prices for OpenRouter providers for example, and the prices are much lower than the $6.37/M estimate from the article.
https://openrouter.ai/meta-llama/llama-3.3-70b-instruct
avg $0.37/M input tokens, $0.73/M output tokens (21 providers)
Llama is not even a good example, as the recent models are more optimized using Mixture Of Experts and KV cache compression.