Automatically wait for retries in `gh eval` #75

sgoedecke · 2025-07-21T22:02:07Z

This PR updates gh eval to listen for the retry-after and x-retry-timeremaining headers and automatically wait.

If we can't read either of those headers, we just wait for a minute.

@sgoedecke ➜ /workspaces/gh-models (sgoedecke/eval-retries) $ ./gh-models eval examples/sample_prompt.yml 
Running evaluation: Sample Evaluation
Description: A sample evaluation for testing the eval command
Model: deepseek-v3-0324
Test cases: 2

Running test case 1/2...
  ✓ PASSED
    ✓ string evaluator (score: 1.00)
      Expected to contain: 'hello'
    ✓ similarity check (score: 0.25)
      LLM evaluation matched choice: '2'

Running test case 2/2...
    Rate limited, waiting 49s before retry (attempt 1/4)...

Copilot

Pull Request Overview

This PR adds automatic retry functionality for rate-limited requests in the gh eval command. The implementation listens for rate limit headers and waits for the specified duration before retrying.

Introduces a new RateLimitError type with retry timing information
Adds automatic retry logic in the evaluation command with configurable wait times
Refactors model calling logic to support retry functionality

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
internal/azuremodels/azure_client.go	Adds rate limit error handling and `RateLimitError` type definition
internal/azuremodels/rate_limit_test.go	Comprehensive test coverage for rate limit error handling logic
cmd/eval/eval.go	Implements retry logic with automatic waiting and refactors model calling

internal/azuremodels/rate_limit_test.go

cmd/eval/eval.go

Eval retries

e618ed9

sgoedecke marked this pull request as ready for review July 21, 2025 22:20

Copilot AI review requested due to automatic review settings July 21, 2025 22:21

sgoedecke requested a review from a team as a code owner July 21, 2025 22:21

Copilot AI reviewed Jul 21, 2025

View reviewed changes

internal/azuremodels/rate_limit_test.go Show resolved Hide resolved

cmd/eval/eval.go Show resolved Hide resolved

cmd/eval/eval.go Outdated Show resolved Hide resolved

sgoedecke force-pushed the sgoedecke/eval-retries branch from ff4d73e to e618ed9 Compare July 21, 2025 22:27

jalafel approved these changes Jul 21, 2025

View reviewed changes

sgoedecke requested a review from jalafel July 21, 2025 22:37

Update rate limit exceeded log line

ffabf58

sgoedecke force-pushed the sgoedecke/eval-retries branch from b7ae777 to ffabf58 Compare July 21, 2025 22:40

sgoedecke merged commit b21bd7a into main Jul 21, 2025
5 checks passed

sgoedecke deleted the sgoedecke/eval-retries branch July 21, 2025 22:50

sgoedecke mentioned this pull request Jul 22, 2025

Gracefully handle 429 Too Many Request responses when rate limit is met #74

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Automatically wait for retries in `gh eval` #75

Automatically wait for retries in `gh eval` #75

Uh oh!

sgoedecke commented Jul 21, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Automatically wait for retries in gh eval #75

Automatically wait for retries in gh eval #75

Uh oh!

Conversation

sgoedecke commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Automatically wait for retries in `gh eval` #75

Automatically wait for retries in `gh eval` #75

sgoedecke commented Jul 21, 2025 •

edited

Loading