Promptfoo Promptfoo Versions Save

Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.

0.55.0

2 weeks ago

What's Changed

[Docs] Add llama3 example to ollama docs by @chanonroy in https://github.com/promptfoo/promptfoo/pull/695
bugfix in answer-relevance by @alexandres in https://github.com/promptfoo/promptfoo/pull/697
feat: add support for provider transform property by @typpo in https://github.com/promptfoo/promptfoo/pull/696
feat: add support for provider-specific delays by @typpo in https://github.com/promptfoo/promptfoo/pull/699
feat: portkey.ai integration by @typpo in https://github.com/promptfoo/promptfoo/pull/698
feat: eval -n arg for running the first n test cases by @typpo in https://github.com/promptfoo/promptfoo/pull/700
feat: ability to write outputs to google sheet by @typpo in https://github.com/promptfoo/promptfoo/pull/701
feat: first-class support for openrouter by @typpo in https://github.com/promptfoo/promptfoo/pull/702
Fix concurrent cache request behaviour by @chrisprice in https://github.com/promptfoo/promptfoo/pull/703

New Contributors

@chanonroy made their first contribution in https://github.com/promptfoo/promptfoo/pull/695
@alexandres made their first contribution in https://github.com/promptfoo/promptfoo/pull/697
@chrisprice made their first contribution in https://github.com/promptfoo/promptfoo/pull/703

Full Changelog: https://github.com/promptfoo/promptfoo/compare/0.54.1...0.55.0

0.54.1

2 weeks ago

What's Changed

Add support for Mixtral 8x22B by @streichsbaer in https://github.com/promptfoo/promptfoo/pull/687
fix: google sheets async loading by @typpo in https://github.com/promptfoo/promptfoo/pull/688
fix: trim spaces in csv assertions that can have file:// prefixes by @typpo in https://github.com/promptfoo/promptfoo/pull/689
fix: apply thresholds to custom python asserts by @typpo in https://github.com/promptfoo/promptfoo/pull/690
fix: include detail from external python assertion by @typpo in https://github.com/promptfoo/promptfoo/pull/691
chore(webui): allow configuration of results per page by @typpo in https://github.com/promptfoo/promptfoo/pull/694
fix: ability to override rubric prompt for all model-graded metrics by @typpo in https://github.com/promptfoo/promptfoo/pull/692

Full Changelog: https://github.com/promptfoo/promptfoo/compare/0.54.0...0.54.1

0.54.0

3 weeks ago

What's Changed

feat: support for authenticated google sheets access by @typpo in https://github.com/promptfoo/promptfoo/pull/686
fix: bugs in Answer-relevance calculation by @anthonyivn2 in https://github.com/promptfoo/promptfoo/pull/683
fix: Add tool calls to response from azure openai by @CamdenClark in https://github.com/promptfoo/promptfoo/pull/685

Full Changelog: https://github.com/promptfoo/promptfoo/compare/0.53.0...0.54.0

0.53.0

3 weeks ago

Breaking

When using promptfoo as a node library, Assertion value functions are now invoked with the same args as when using the CLI. See AssertionValueFunction for type details.

In practice, this change means that instead of:

function assertValue(output, testCase, assertion) { ... }

You can do:

function assertValue(output, { prompt, vars, test }) { ... }

The reason for this change is that it's confusing that the CLI and library accept different functions, and only the CLI function signature was documented.

What's Changed

fix!: make javascript assert function call consistent with external js function call by @typpo in https://github.com/promptfoo/promptfoo/pull/674
fix: node library supports prompt files by @typpo in https://github.com/promptfoo/promptfoo/pull/668
feat: Enable post-hoc evaluations through defining and using output value in TestSuite by @anthonyivn2 in https://github.com/promptfoo/promptfoo/pull/671
feat: Allow local files to define providerOutput value for TestCase by @anthonyivn2 in https://github.com/promptfoo/promptfoo/pull/675
feat: detect suitable anthropic default provider by @typpo in https://github.com/promptfoo/promptfoo/pull/677
feat: Ability to delete evals by @typpo in https://github.com/promptfoo/promptfoo/pull/676
feat: ability to create derived metrics by @typpo in https://github.com/promptfoo/promptfoo/pull/670

New Contributors

@anthonyivn2 made their first contribution in https://github.com/promptfoo/promptfoo/pull/671

Full Changelog: https://github.com/promptfoo/promptfoo/compare/0.52.0...0.53.0

0.52.0

3 weeks ago

What's Changed

feat: support for inline yaml for is-json, contains-json in csv by @jernkuan in https://github.com/promptfoo/promptfoo/pull/651
feat(webui): add pagination by @typpo in https://github.com/promptfoo/promptfoo/pull/649
feat: run providers 1 at a time with --interactive-providers by @typpo in https://github.com/promptfoo/promptfoo/pull/645
feat: --env-file arg by @typpo in https://github.com/promptfoo/promptfoo/pull/615
fix: Do not fail with api error when azure datasource is used by @alepek in https://github.com/promptfoo/promptfoo/pull/644
fix: allow loading of custom provider in windows (#518) by @jernkuan in https://github.com/promptfoo/promptfoo/pull/652
fix: don't show telemetry message without telemtry by @will-holley in https://github.com/promptfoo/promptfoo/pull/658
fix: E2BIG error during the execution of Python asserts by @sangwoo-joh in https://github.com/promptfoo/promptfoo/pull/660
fix(webui): handle invalid search regexes by @typpo in https://github.com/promptfoo/promptfoo/pull/663
fix: support relative filepaths for non-code assert values by @typpo in https://github.com/promptfoo/promptfoo/pull/664

New Contributors

@alepek made their first contribution in https://github.com/promptfoo/promptfoo/pull/644
@jernkuan made their first contribution in https://github.com/promptfoo/promptfoo/pull/652
@will-holley made their first contribution in https://github.com/promptfoo/promptfoo/pull/658
@sangwoo-joh made their first contribution in https://github.com/promptfoo/promptfoo/pull/660

Full Changelog: https://github.com/promptfoo/promptfoo/compare/0.51.0...0.52.0

0.51.0

1 month ago

Breaking update to python custom assertions

Python assertions now expect a get_assert function which returns a native value, rather than parsing stdout (#594). This means instead of:

print(json.dumps((result))

You should just return the assertion result:

return result

Here's a full example of a custom_assert.py:

def get_assert(output, context) -> Union[bool, float, Dict[str, Any]]
    print('Prompt:', context['prompt'])
    print('Vars', context['vars']['topic']

    # Determine the result...
    result = test_output(output)

    # Here's an example GradingResult dict
    result = {
      'pass': True,
      'score': 0.6,
      'reason': 'Looks good to me',
    }
    return result

See documentation

What's Changed

chore: improve json parsing errors by @typpo in https://github.com/promptfoo/promptfoo/pull/620
feat: ability to override path to python binary by @typpo in https://github.com/promptfoo/promptfoo/pull/619
feat(webui): store settings in localstorage by @typpo in https://github.com/promptfoo/promptfoo/pull/617
feat(azureopenai): apiKeyEnvar support by @typpo in https://github.com/promptfoo/promptfoo/pull/628
Add documentation for openai vision by @CamdenClark in https://github.com/promptfoo/promptfoo/pull/637
Support claude vision and images by @CamdenClark in https://github.com/promptfoo/promptfoo/pull/639
fix(webui): ability to save defaultTest and evaluateOptions in yaml editor by @typpo in https://github.com/promptfoo/promptfoo/pull/629
fix: assertion files use relative path by @typpo in https://github.com/promptfoo/promptfoo/pull/624
feat: add provider reference to prompt function by @guilhermetk in https://github.com/promptfoo/promptfoo/pull/633
feat(webui): "progress" page that shows provider/prompt pairs by @typpo in https://github.com/promptfoo/promptfoo/pull/631
feat: ability to import vars using glob by @typpo in https://github.com/promptfoo/promptfoo/pull/641
feat!: return values directly in python assertions by @typpo in https://github.com/promptfoo/promptfoo/pull/638

New Contributors

@CamdenClark made their first contribution in https://github.com/promptfoo/promptfoo/pull/637
@guilhermetk made their first contribution in https://github.com/promptfoo/promptfoo/pull/633

Full Changelog: https://github.com/promptfoo/promptfoo/compare/0.50.1...0.51.0

0.50.1

1 month ago

What's Changed

fix: compiled esmodule interop by @typpo in https://github.com/promptfoo/promptfoo/pull/613
fix: downgrade var resolution failure to warning by @typpo in https://github.com/promptfoo/promptfoo/pull/614
fix: glob behavior on windows by @typpo in https://github.com/promptfoo/promptfoo/pull/612

Full Changelog: https://github.com/promptfoo/promptfoo/compare/0.50.0...0.50.1

0.50.0

1 month ago

What's Changed

feat(webui): download button by @typpo in https://github.com/promptfoo/promptfoo/pull/482
fix(selfhost): add support for prompts and datasets api endpoints by @typpo in https://github.com/promptfoo/promptfoo/pull/600
feat: support .mjs external imports by @typpo in https://github.com/promptfoo/promptfoo/pull/601
feat: load .env from cli by @typpo in https://github.com/promptfoo/promptfoo/pull/602
feat(webui): toggle for showing full prompt in output cell by @typpo in https://github.com/promptfoo/promptfoo/pull/603
feat: ability to use js files as transform by @typpo in https://github.com/promptfoo/promptfoo/pull/605
feat: ability to reference vars from other vars by @typpo in https://github.com/promptfoo/promptfoo/pull/607
fix: handling for nonscript assertion files by @typpo in https://github.com/promptfoo/promptfoo/pull/608
fix(selfhost): Consolidate to NEXT_PUBLIC_PROMPTFOO_REMOTE_BASE_URL by @typpo in https://github.com/promptfoo/promptfoo/pull/609

Full Changelog: https://github.com/promptfoo/promptfoo/compare/0.49.3...0.50.0

0.49.3

1 month ago

What's Changed

fix: bedrock model parsing by @typpo in https://github.com/promptfoo/promptfoo/pull/593
fix: make llm-rubric more resilient to bad json responses. https://github.com/promptfoo/promptfoo/issues/596
feat: display progress bar for each parallel execution by @typpo in https://github.com/promptfoo/promptfoo/pull/597

Full Changelog: https://github.com/promptfoo/promptfoo/compare/0.49.2...0.49.3

0.49.2

1 month ago

What's Changed

fix: support relative paths for custom providers by @typpo in https://github.com/promptfoo/promptfoo/pull/589
fix: gemini generationConfig and safetySettings by @typpo in https://github.com/promptfoo/promptfoo/pull/590
feat: cli watch for vars and providers by @typpo in https://github.com/promptfoo/promptfoo/pull/591

Full Changelog: https://github.com/promptfoo/promptfoo/compare/0.49.1...0.49.2