[Feature Request]: Add ability in Token Counter to retrieve Open AI cached_tokens #17372

c64er4ever · 2024-12-26T07:22:21Z

Feature Description

Open AI enables automatic Prompt Caching, as described here: https://platform.openai.com/docs/guides/prompt-caching
The number of cached tokens for a prompt is returned in the usage structure, in the "cached_tokens" field. For example:
"usage": {
"prompt_tokens": 2006,
"completion_tokens": 300,
"total_tokens": 2306,
"prompt_tokens_details": {
"cached_tokens": 1920
},
"completion_tokens_details": {
"reasoning_tokens": 0,
"accepted_prediction_tokens": 0,
"rejected_prediction_tokens": 0
}
}

The request is for Token Counter to enable to get this field in addition to prompt_tokens, completion_tokens and total_tokens.

Reason

I am not familiar with a way to access this field when using Open AI indirectly with Llamaindex.

Value of Feature

As mentioned in Open AI's docs, having access to this value enables to monitor metrics such as cache hit rates, latency, and the percentage of tokens cached to optimize prompt and caching strategy.

…ama#17372

c64er4ever added enhancement New feature or request triage Issue needs to be triaged/prioritized labels Dec 26, 2024

sangwonku pushed a commit to sangwonku/llama_index that referenced this issue Dec 27, 2024

Add ability in Token Counter to retrieve Open AI cached_tokens run-ll…

6863ec8

…ama#17372

sangwonku linked a pull request Dec 27, 2024 that will close this issue

Add ability in Token Counter to retrieve Open AI cached_tokens #17372 #17380

Open

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request]: Add ability in Token Counter to retrieve Open AI cached_tokens #17372

[Feature Request]: Add ability in Token Counter to retrieve Open AI cached_tokens #17372

c64er4ever commented Dec 26, 2024

[Feature Request]: Add ability in Token Counter to retrieve Open AI cached_tokens #17372

[Feature Request]: Add ability in Token Counter to retrieve Open AI cached_tokens #17372

Comments

c64er4ever commented Dec 26, 2024

Feature Description

Reason

Value of Feature