This skill speeds up structured generation and agent workflows with prefix caching for JSON outputs and tool calls, boosting latency and throughput.
This skill speeds up structured generation and agent workflows with prefix caching for JSON outputs and tool calls, boosting latency and throughput.