Configuring semantic caching on F5 AI Gateway

Question

The semantic caching feature is mentioned on the F5 AI Gateway introduction page, but I couldn't find any documentation on how to use it. Is there a guide available for this?

Also, I'm curious whether token-based rate limiting will be supported in the future.

nikoolayy1 · Answer

Can't say about the semantic caching feature but about token based rate limit this can be done with authorization header with XC as I have shown in F5 XC Session tracking with User Identification Policy | DevCentral&nbsp;
&nbsp;
On nginx you can use njs javascript module to make not source ip based rate limit and on BIG-IP an irule should do the trick:
&nbsp;
GitHub - nginx/njs-examples: NGINX JavaScript examples
&nbsp;
3.1.2. Lab 2 - HTTP Throttling
&nbsp;
&nbsp;
The idea is to place the XC , nginx or BIG-IP before the AI Gateway as the AI gateway is for exact AI protections while pure API protection is done on the normal systems like XC, Nginx or BIG-IP.
&nbsp;

Introduction

devopssong · Answer

Hi. Thanks for the reply.Actually, I was wondering about input/output text token used in LLM API pricing, not JWT tokens.For AI Gateway use cases, I believe token-based rate limiting would be more effective than traditional request-based limits.

nikoolayy1 · Answer

If the token is not in a header but the request body then extracting it and rate limiting on it will be a little harder. Big-IP with irules or nginx with njs javascript module could do it but it will be complex.
&nbsp;
https://clouddocs.f5.com/training/community/nginx/html/class3/module1/module12.html
https://github.com/nginx/njs-examples
&nbsp;

Forum Discussion

Configuring semantic caching on F5 AI Gateway

3 Replies

F5 Architecture Track Sessions - AppWorld 2026

Recent Discussions

In Using the AST tool am I pointing it to TENANTS or to the F5OS(hardware) I have?

2026 is Almost Here

can you help with getting a BigIP virtual edition serial key

VIP is not responding on SYN after enabling other modules like ASM, APM and AFM.

VIP in https that redirect to another vip in https

Related Content

Configuring the F5 BIG-IP to Perform Name Resolution Using a DNS Resolver Cache

DNS Caching

What is Web Cache Exploitation?

3. SYN Cookie: SYN Cache

wccp configuration for SSL Orchestrator

ABOUT DEVCENTRAL

RESOURCES

SUPPORT

PARTNERS