feat: add maximum evaluation time limit with PROMPTFOO_MAX_EVAL_TIME_MS #4322

mldangelo · 2025-06-04T23:31:35Z

Adds a new maximum evaluation time limit feature with PROMPTFOO_MAX_EVAL_TIME_MS environment variable and maxEvalTimeMs API option. Includes comprehensive documentation and test coverage. Useful for CI/CD time limits, cost control, and preventing runaway evaluations.

…MS env var and maxEvalTimeMs option

…ences

…d direct action-oriented guidance

gru-agent · 2025-06-04T23:31:55Z

TestGru Assignment

Summary

Link	CommitId	Status	Reason
Detail	`a168f20`	✅ Finished

Files

File	Pull Request
src/envars.ts	🛑 Cancelled (Canceled by Auto Rebase Detail)

Tip

You can @gru-agent and leave your feedback. TestGru will make adjustments based on your input

greptile-apps

_{6 file(s) reviewed, no comment(s)}
_{Edit PR Review Bot Settings | Greptile}

sourcery-ai

Hey @mldangelo - I've reviewed your changes and found some issues that need to be addressed.

Here's what I looked at during the review

🟡 General issues: 1 issue found
🟢 Security: all looks good
🟢 Testing: all looks good
🟢 Documentation: all looks good

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

site/docs/usage/command-line.md

src/evaluator.ts

test/evaluator.test.ts

gru-agent · 2025-06-04T23:34:04Z

TestGru Assignment

Summary

Link	CommitId	Status	Reason
Detail	`274165e`	✅ Finished

Files

File	Pull Request
src/envars.ts	🛑 Cancelled (Canceled by Auto Rebase Detail)

Tip

You can @gru-agent and leave your feedback. TestGru will make adjustments based on your input

src/evaluator.ts

gru-agent · 2025-06-04T23:38:31Z

TestGru Assignment

Summary

Link	CommitId	Status	Reason
Detail	`542c031`	✅ Finished

Files

File	Pull Request
src/envars.ts	🛑 Cancelled (Canceled by Auto Rebase Detail)

Tip

You can @gru-agent and leave your feedback. TestGru will make adjustments based on your input

… feedback

gru-agent · 2025-06-04T23:48:22Z

TestGru Assignment

Summary

Link	CommitId	Status	Reason
Detail	`a0cd4fb`	✅ Finished

Files

File	Pull Request
src/envars.ts	🟣 Merged #4323

Tip

You can @gru-agent and leave your feedback. TestGru will make adjustments based on your input

Copilot

Pull Request Overview

Adds a global time‐limit feature for evaluations so users can cap total run time via maxEvalTimeMs or PROMPTFOO_MAX_EVAL_TIME_MS.

Introduces a new maxEvalTimeMs option and corresponding environment variable for overall evaluation timeouts
Implements global abort logic in Evaluator.evaluate, tracks pending steps, and records timeout results
Updates types, schema, documentation, and tests to cover the new feature

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
test/evaluator.test.ts	Refactored mocks and added a test for aborting when `maxEvalTimeMs` is exceeded
src/types/index.ts	Added `maxEvalTimeMs` to `EvaluateOptionsSchema` and the exported type
src/evaluator.ts	Implemented global timeout/abort logic, tracking, and pending-result handling
src/envars.ts	Introduced `PROMPTFOO_MAX_EVAL_TIME_MS` env var and `getMaxEvalTimeMs` helper
site/static/config-schema.json	Extended JSON schema with `"maxEvalTimeMs"` property
site/docs/usage/troubleshooting.md	Updated troubleshooting guide with triage steps and examples
site/docs/usage/command-line.md	Documented the new `PROMPTFOO_MAX_EVAL_TIME_MS` and updated related tips

Copilot · 2025-06-04T23:59:56Z

src/envars.ts

+ * @returns The max duration in milliseconds, or the default value if not set.
+ */
+export function getMaxEvalTimeMs(defaultValue: number = 0): number {
+  return getEnvInt('PROMPTFOO_MAX_EVAL_TIME_MS', defaultValue);


The implementation calls getEnvInt with a defaultValue parameter, but getEnvInt only accepts one argument. This means the defaultValue is ignored and may cause a type error; change to const val = getEnvInt('PROMPTFOO_MAX_EVAL_TIME_MS'); return val != null ? val : defaultValue;.

Suggested change

return getEnvInt('PROMPTFOO_MAX_EVAL_TIME_MS', defaultValue);

const val = getEnvInt('PROMPTFOO_MAX_EVAL_TIME_MS');

return val != null ? val : defaultValue;

Copilot · 2025-06-04T23:59:56Z

test/evaluator.test.ts

    const originalReadFileSync = fs.readFileSync;
-    fs.readFileSync = jest.fn().mockImplementation((path) => {
-      if (path.includes('test_file.txt')) {
+    jest.spyOn(fs, 'readFileSync').mockImplementation((path) => {
+      if (typeof path === 'string' && path.includes('test_file.txt')) {
        return '<h1>Sample Report</h1><p>This is a test report with some data for the year 2023.</p>';
      }
      return originalReadFileSync(path);


[nitpick] After using jest.spyOn on fs.readFileSync, the spy is never restored and could leak into other tests. Consider adding afterEach(() => jest.restoreAllMocks()); or calling .mockRestore() on the spy.

Copilot · 2025-06-04T23:59:57Z

src/evaluator.ts

-          logger.info(
-            `[${numComplete}/${serialRunEvalOptions.length}] Running ${provider} with vars: ${vars}`,
-          );
+    try {


[nitpick] The large try/catch block covers both serial and concurrent evaluation logic, making the method hard to follow. Consider extracting serial and concurrent processing into separate helper functions for clarity.

Co-authored-by: gru-agent[bot] <185149714+gru-agent[bot]@users.noreply.github.com>

use-tusk · 2025-06-12T05:18:59Z

⏩ No test scenarios generated (24f7e7d) View output ↗

View output in GitHub ↗

Tip

New to Tusk? Learn more here.

View check history

Commit	Status	Output	Created (UTC)
`24f7e7d`	⏩ No test scenarios generated	Output	Jun 12, 2025 5:18AM

coderabbitai · 2025-06-12T05:19:01Z

Walkthrough

This change introduces a global evaluation timeout feature across the documentation, configuration schema, environment variable utilities, evaluator logic, and tests. Two new environment variables, PROMPTFOO_EVAL_TIMEOUT_MS (per-step timeout) and PROMPTFOO_MAX_EVAL_TIME_MS (global timeout), are documented and supported. The JSON schema and TypeScript types are updated to include maxEvalTimeMs as an optional configuration option. The evaluator logic is enhanced to enforce a global timeout that aborts all ongoing and pending evaluations if the maximum duration is exceeded, marking unprocessed steps as failed. Tests are added and updated to verify the new timeout behavior and environment variable handling.

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 ESLint

If the error stems from missing dependencies, add them to the package.json file. For unrecoverable errors (e.g., due to private dependencies), disable the tool in the CodeRabbit configuration.

npm error Exit handler never called!
npm error This is an error with npm itself. Please report this error at:
npm error https://github.com/npm/cli/issues
npm error A complete log of this run can be found in: /.npm/_logs/2025-06-12T05_20_34_137Z-debug-0.log

✨ Finishing Touches

📝 Generate Docstrings

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 3

🔭 Outside diff range comments (1)

src/evaluator.ts (1)

1027-1030: ⚠️ Potential issue

Global timeout signal is lost inside per-step timeout wrapper
processEvalStepWithTimeout overwrites evalStep.abortSignal, dropping the global maxEvalTimeMs signal. Running steps will keep going after the global abort.

-  const evalStepWithSignal = {
-    ...evalStep,
-    abortSignal: signal,
-  };
+  const evalStepWithSignal = {
+    ...evalStep,
+    abortSignal: evalStep.abortSignal
+      ? /* combine signals */ ((AbortSignal as any).any
+          ? (AbortSignal as any).any([evalStep.abortSignal, signal])
+          : (() => {
+              const ac = new AbortController();
+              const handler = () => ac.abort();
+              evalStep.abortSignal.addEventListener('abort', handler);
+              signal.addEventListener('abort', handler);
+              return ac.signal;
+            })())
+      : signal,
+  };

This ensures both per-step and global cancellations propagate to the provider.

♻️ Duplicate comments (1)

test/evaluator.test.ts (1)

30-44: Redundant result variable – can be inlined
Returning the object directly keeps the mock concise and removes one unnecessary allocation.

🧹 Nitpick comments (4)

site/docs/usage/command-line.md (1)

517-518: Add default values for timeout environment variables
The table entries for PROMPTFOO_EVAL_TIMEOUT_MS and PROMPTFOO_MAX_EVAL_TIME_MS lack default values. Consider adding 0 to the Default column to reflect the fallback behavior.

site/docs/usage/troubleshooting.md (1)

175-181: Minor wording nitpick

“…providers that get stuck” → “…providers that become stuck” for a slightly more formal tone (mirrors LanguageTool lint warning).

🧰 Tools

🪛 LanguageTool

[style] ~178-~178: The verb “get” can be informal. Consider replacing it with a form of “to be”.
Context: ...ndle custom providers or providers that get stuck - Prevent runaway costs from long-runni...

(GET_USED_ARE_USED)

test/evaluator.test.ts (2)

302-308: Optional: speed-up by switching to fake timers
This test relies on real timers; jest.useFakeTimers() + advanceTimersByTime() would make it complete instantly and avoid flakiness on busy CI runners.

2297-2369: Test looks good – but add afterEach cleanup for mocks
slowApiProvider.cleanup is asserted but the spy itself isn’t restored. Add afterEach(() => jest.restoreAllMocks()) in this describe block to avoid leakage into other tests.

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 297af82 and 24f7e7d.

📒 Files selected for processing (8)

site/docs/usage/command-line.md (2 hunks)
site/docs/usage/troubleshooting.md (1 hunks)
site/static/config-schema.json (1 hunks)
src/envars.ts (2 hunks)
src/evaluator.ts (5 hunks)
src/types/index.ts (1 hunks)
test/envars.test.ts (2 hunks)
test/evaluator.test.ts (3 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

`src/**`: - This is a CLI tool so errors need to be handled gracefully and logged with lots of information so the user can give us enough data to fix the issue or pass it to the de...

src/**: - This is a CLI tool so errors need to be handled gracefully and logged with lots of information so the user can give us enough data to fix the issue or pass it to the developers.

src/types/index.ts
src/envars.ts
src/evaluator.ts

🧬 Code Graph Analysis (1)

test/envars.test.ts (1)

src/envars.ts (1)

getMaxEvalTimeMs (399-401)

🪛 LanguageTool

site/docs/usage/troubleshooting.md

[style] ~178-~178: The verb “get” can be informal. Consider replacing it with a form of “to be”.
Context: ...ndle custom providers or providers that get stuck - Prevent runaway costs from long-runni...

(GET_USED_ARE_USED)

⏰ Context from checks skipped due to timeout of 90000ms (17)

GitHub Check: Share Test
GitHub Check: Redteam Custom Enterprise Server
GitHub Check: Redteam
GitHub Check: Test on Node 24.x and macOS-latest
GitHub Check: Test on Node 20.x and windows-latest
GitHub Check: Test on Node 22.x and macOS-latest
GitHub Check: Test on Node 24.x and ubuntu-latest
GitHub Check: Test on Node 18.x and ubuntu-latest
GitHub Check: Test on Node 18.x and macOS-latest
GitHub Check: Test on Node 18.x and windows-latest
GitHub Check: Build on Node 20.x
GitHub Check: Style Check
GitHub Check: Build Docs
GitHub Check: Build on Node 24.x
GitHub Check: Build on Node 22.x
GitHub Check: Build on Node 18.x
GitHub Check: Analyze (javascript-typescript)

🔇 Additional comments (4)

src/envars.ts (2)

41-41: Added PROMPTFOO_MAX_EVAL_TIME_MS to EnvVars
This mirrors the existing PROMPTFOO_EVAL_TIMEOUT_MS entry and correctly expands the supported environment variables.

394-402: Implemented getMaxEvalTimeMs correctly
The function consistently uses getEnvInt with a default value and matches the pattern of getEvalTimeoutMs.

site/static/config-schema.json (1)

1523-1526: Schema extension for maxEvalTimeMs is correct
The new numeric property is appropriately added under evaluateOptions and aligns with timeoutMs.

test/envars.test.ts (1)

346-386: Questionable acceptance of negative timeouts

The new negative-value test (lines 377-380) expects getMaxEvalTimeMs() to return -1000.
A negative global timeout has no clear meaning (should the evaluation abort immediately? be unlimited?). Accepting it silently risks surprises later in evaluator.ts.

Consider clamping to 0 or rejecting < 0 in getMaxEvalTimeMs (and adjust the schema as suggested) instead of supporting an undefined semantic.

If negative values are intentionally supported, add documentation clarifying their effect and a unit test in evaluator.test.ts that demonstrates the expected behaviour.

coderabbitai · 2025-06-12T05:25:17Z

src/types/index.ts

  timeoutMs: z.number().optional(),
+  /**
+   * Maximum runtime in milliseconds for the entire evaluation. Default is 0 (no limit).
+   */
+  maxEvalTimeMs: z.number().optional(),
 });


🛠️ Refactor suggestion

Make maxEvalTimeMs an integer ≥ 0 to prevent invalid configurations

timeoutMs and maxEvalTimeMs serve the same purpose (time limits) and should behave identically.
Using just .number() accepts floats and negative values even though neither makes sense for a timeout. Tighten the schema to match the runtime helper (getMaxEvalTimeMs, which floors the value to an int) and avoid unexpected behaviour.

- /** - * Maximum runtime in milliseconds for the entire evaluation. Default is 0 (no limit). - */ - maxEvalTimeMs: z.number().optional(), + /** + * Maximum runtime in milliseconds for the entire evaluation. + * Must be an integer ≥ 0. 0 means “no limit”. + */ + maxEvalTimeMs: z.coerce.number().int().nonnegative().optional(),

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

timeoutMs: z.number().optional(),

/**

* Maximum runtime in milliseconds for the entire evaluation. Default is 0 (no limit).

*/

maxEvalTimeMs: z.number().optional(),

});

timeoutMs: z.number().optional(),

/**

* Maximum runtime in milliseconds for the entire evaluation.

* Must be an integer ≥ 0. 0 means “no limit”.

*/

maxEvalTimeMs: z.coerce.number().int().nonnegative().optional(),

});

🤖 Prompt for AI Agents

In src/types/index.ts around lines 188 to 193, the maxEvalTimeMs field is currently defined as an optional number, which allows floats and negative values. To fix this, change the schema to require maxEvalTimeMs to be an integer greater than or equal to 0, matching the behavior of timeoutMs and the runtime helper getMaxEvalTimeMs. Use appropriate zod methods to enforce integer type and non-negativity to prevent invalid timeout configurations.

coderabbitai · 2025-06-12T05:25:18Z

src/evaluator.ts

+    if (globalTimeout) {
+      clearTimeout(globalTimeout);
+    }
+
+    if (evalTimedOut) {
+      for (let i = 0; i < runEvalOptions.length; i++) {
+        if (!processedIndices.has(i)) {
+          const evalStep = runEvalOptions[i];
+          const timeoutResult = {
+            provider: {
+              id: evalStep.provider.id(),
+              label: evalStep.provider.label,
+              config: evalStep.provider.config,
+            },
+            prompt: {
+              raw: evalStep.prompt.raw,
+              label: evalStep.prompt.label,
+              config: evalStep.prompt.config,
+            },
+            vars: evalStep.test.vars || {},
+            error: `Evaluation exceeded max duration of ${maxEvalTimeMs}ms`,
+            success: false,
+            failureReason: ResultFailureReason.ERROR,
+            score: 0,
+            namedScores: {},
+            latencyMs: Date.now() - startTime,
+            promptIdx: evalStep.promptIdx,
+            testIdx: evalStep.testIdx,
+            testCase: evalStep.test,
+            promptId: evalStep.prompt.id || '',
+          } as EvaluateResult;
+
+          await this.evalRecord.addResult(timeoutResult);
+          this.stats.errors++;
+          const { metrics } = prompts[evalStep.promptIdx];
+          if (metrics) {
+            metrics.testErrorCount += 1;
+            metrics.totalLatencyMs += timeoutResult.latencyMs;
+          }
+        }
+      }
+    }


🛠️ Refactor suggestion

Unprocessed-step timeout results update prompt metrics but not top-level token usage
this.stats.errors++ is updated, yet aggregate tokenUsage isn’t touched. If any tokens were spent before the global abort, totals will be under-reported. Consider adding the same token-usage accumulation you perform elsewhere.

🤖 Prompt for AI Agents

In src/evaluator.ts between lines 1395 and 1436, the timeout handling code updates error counts and prompt-level metrics but does not update the top-level token usage statistics. To fix this, add the same token usage accumulation logic here as used elsewhere in the evaluation process, ensuring that any tokens consumed before the timeout are correctly added to the aggregate tokenUsage totals.

coderabbitai · 2025-06-12T05:25:18Z

src/evaluator.ts

+    const startTime = Date.now();
+    const maxEvalTimeMs = options.maxEvalTimeMs ?? getMaxEvalTimeMs();
+    let evalTimedOut = false;
+    let globalTimeout: NodeJS.Timeout | undefined;
+    let globalAbortController: AbortController | undefined;
+    const processedIndices = new Set<number>();
+
+    if (maxEvalTimeMs > 0) {
+      globalAbortController = new AbortController();
+      options.abortSignal = options.abortSignal
+        ? AbortSignal.any([options.abortSignal, globalAbortController.signal])
+        : globalAbortController.signal;
+      globalTimeout = setTimeout(() => {
+        evalTimedOut = true;
+        globalAbortController?.abort();
+      }, maxEvalTimeMs);
+    }


⚠️ Potential issue

AbortSignal.any may break Node≤18 & older TypeScript targets
AbortSignal.any() is only available from Node 20 / lib.dom-2023. Building/running on older runtimes will fail at compile-time and at runtime.

- options.abortSignal = options.abortSignal - ? AbortSignal.any([options.abortSignal, globalAbortController.signal]) - : globalAbortController.signal; + options.abortSignal = options.abortSignal + ? // Fallback for Node <20 + (AbortSignal as any).any + ? (AbortSignal as any).any([options.abortSignal, globalAbortController.signal]) + : (() => { + const ac = new AbortController(); + const handler = () => ac.abort(); + options.abortSignal!.addEventListener('abort', handler); + globalAbortController.signal.addEventListener('abort', handler); + return ac.signal; + })() + : globalAbortController.signal;

Provide a polyfill or guard as above, or bump the minimum Node / TS target.

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

const startTime = Date.now();

const maxEvalTimeMs = options.maxEvalTimeMs ?? getMaxEvalTimeMs();

let evalTimedOut = false;

let globalTimeout: NodeJS.Timeout | undefined;

let globalAbortController: AbortController | undefined;

const processedIndices = new Set<number>();

if (maxEvalTimeMs > 0) {

globalAbortController = new AbortController();

options.abortSignal = options.abortSignal

? AbortSignal.any([options.abortSignal, globalAbortController.signal])

: globalAbortController.signal;

globalTimeout = setTimeout(() => {

evalTimedOut = true;

globalAbortController?.abort();

}, maxEvalTimeMs);

}

const startTime = Date.now();

const maxEvalTimeMs = options.maxEvalTimeMs ?? getMaxEvalTimeMs();

let evalTimedOut = false;

let globalTimeout: NodeJS.Timeout | undefined;

let globalAbortController: AbortController | undefined;

const processedIndices = new Set<number>();

if (maxEvalTimeMs > 0) {

globalAbortController = new AbortController();

options.abortSignal = options.abortSignal

? // Fallback for Node <20

(AbortSignal as any).any

? (AbortSignal as any).any([options.abortSignal, globalAbortController.signal])

: (() => {

const ac = new AbortController();

const handler = () => ac.abort();

options.abortSignal!.addEventListener('abort', handler);

globalAbortController.signal.addEventListener('abort', handler);

return ac.signal;

})()

: globalAbortController.signal;

globalTimeout = setTimeout(() => {

evalTimedOut = true;

globalAbortController?.abort();

}, maxEvalTimeMs);

}

🤖 Prompt for AI Agents

In src/evaluator.ts around lines 531 to 547, the use of AbortSignal.any() is not supported in Node.js versions 18 and below or older TypeScript targets, causing build and runtime failures. To fix this, implement a polyfill or conditional check that only uses AbortSignal.any() if it exists; otherwise, fallback to an alternative approach such as manually combining multiple abort signals. This ensures compatibility with older Node.js versions without raising errors.

mldangelo added 4 commits June 4, 2025 16:11

feat: add maximum evaluation time limit with PROMPTFOO_MAX_EVAL_TIME_…

e348e59

…MS env var and maxEvalTimeMs option

docs: add comprehensive timeout troubleshooting guide and cross-refer…

cb23800

…ences

docs: simplify timeout troubleshooting with progressive disclosure an…

2c334ef

…d direct action-oriented guidance

update docs

a168f20

greptile-apps bot reviewed Jun 4, 2025

View reviewed changes

update assets

274165e

sourcery-ai bot reviewed Jun 4, 2025

View reviewed changes

site/docs/usage/command-line.md Outdated Show resolved Hide resolved

src/evaluator.ts Outdated Show resolved Hide resolved

test/evaluator.test.ts Show resolved Hide resolved

mldangelo requested a review from Copilot June 4, 2025 23:34

This comment was marked as outdated.

Sign in to view

github-advanced-security bot found potential problems Jun 4, 2025

View reviewed changes

src/evaluator.ts Dismissed Show dismissed Hide dismissed

fix: update broken anchor link in command-line docs after heading change

542c031

refactor: use object destructuring for metrics access per code review…

a0cd4fb

… feedback

mldangelo requested a review from Copilot June 4, 2025 23:56

gru-agent bot mentioned this pull request Jun 4, 2025

test: add unit test for src/envars.ts #4323

Merged

Copilot AI reviewed Jun 4, 2025

View reviewed changes

gru-agent bot and others added 2 commits June 4, 2025 22:11

test: add unit test for src/envars.ts (#4323)

f4b9dc9

Co-authored-by: gru-agent[bot] <185149714+gru-agent[bot]@users.noreply.github.com>

merge main

24f7e7d

coderabbitai bot reviewed Jun 12, 2025

View reviewed changes

mldangelo merged commit 3489654 into main Jun 12, 2025
62 of 63 checks passed

mldangelo deleted the feat/max-eval-timeout branch June 12, 2025 05:40

	return getEnvInt('PROMPTFOO_MAX_EVAL_TIME_MS', defaultValue);
	const val = getEnvInt('PROMPTFOO_MAX_EVAL_TIME_MS');
	return val != null ? val : defaultValue;

Uh oh!

feat: add maximum evaluation time limit with PROMPTFOO_MAX_EVAL_TIME_MS #4322

feat: add maximum evaluation time limit with PROMPTFOO_MAX_EVAL_TIME_MS #4322

Uh oh!

Conversation

mldangelo commented Jun 4, 2025

Uh oh!

gru-agent bot commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TestGru Assignment

Summary

Files

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gru-agent bot commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TestGru Assignment

Summary

Files

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

gru-agent bot commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TestGru Assignment

Summary

Files

Uh oh!

gru-agent bot commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TestGru Assignment

Summary

Files

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

use-tusk bot commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 12, 2025

Choose a reason for hiding this comment

gru-agent bot commented Jun 4, 2025 •

edited

Loading

gru-agent bot commented Jun 4, 2025 •

edited

Loading

gru-agent bot commented Jun 4, 2025 •

edited

Loading

gru-agent bot commented Jun 4, 2025 •

edited

Loading

use-tusk bot commented Jun 12, 2025 •

edited

Loading

coderabbitai bot commented Jun 12, 2025 •

edited

Loading