Skip to content

Search is only available in production builds. Try building and previewing the site to test it out locally.

Throttle Detection

The Throttle Detection page identifies when API rate limits are slowing you down.

Prism runs 5 throttle detectors:

DetectorWhat it finds
Rate limit responsesHTTP 429 responses from the API
Elevated latencyResponse times significantly above baseline
Retry headersRetry-After headers in API responses
Request gapsUnusual pauses between requests suggesting backoff
Error burstsClusters of errors indicating capacity issues

The page calculates total developer hours lost to throttling:

Hours lost = Σ (throttle_duration × sessions_affected)

This is presented as:

  • Total hours lost in the selected time period
  • Per-week average — trend line showing if throttling is improving
  • Peak times — when throttling is most common (helps with scheduling)

The dashboard shows the ROI of addressing throttling:

Developer hourly rate × hours lost = cost of throttling

This helps justify infrastructure changes or plan tier upgrades.

Based on detected patterns, the page suggests:

PatternSuggestion
Frequent rate limitsUpgrade API tier or distribute requests
Peak-time throttlingShift heavy work to off-peak hours
Burst errorsImplement request queuing or backoff
Model-specific limitsUse different models to distribute load