Eval Results Query
Read grouped evaluation result rows for one or more evaluations.
Documentation Index
Fetch the complete documentation index at: https://wb-21fd5541-docs-2658.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
Basic authentication header of the form Basic <encoded-value>, where <encoded-value> is the base64-encoded string username:password.
Body
Evaluation root call IDs to include.
Alias for evaluation call IDs from the Evaluation Runs API.
When true, only include rows present in all requested evaluations.
When true, populate raw_data_row on each result row. Inline rows are returned as their dict value; dataset-referenced rows are returned as the ref string unless resolve_row_refs is also true.
When true (requires include_raw_data_rows=True), resolve dataset-row reference strings to actual row data via a table lookup. When false, dataset-row refs are returned as-is.
When true, include grouped row/trial data in rows and compute total_rows for the requested row-level view.
When true, include aggregated scorer/evaluation summary data in summary.
Optional intersection behavior for the summary section. When null, the value of require_intersection is used.
When true (default), fetch child calls (predict/score) of each predict_and_score call to populate predict_call_id, scorer_call_ids, and more precise latency/token data. When false, these fields are derived from the predict_and_score call itself (predict_call_id and scorer_call_ids will be null/empty).
Sort specification for result rows. Supported field prefixes: scores., inputs., outputs.. When null, rows are sorted by row_digest ASC.
Filters applied to grouped rows. Multiple filters are AND'd together.
Optional row-level page size applied after grouping and intersection.
Optional row-level page offset applied after grouping and intersection.