Skip to main content
Agents in Netra represent your external AI applications: chatbots, copilots, or any HTTP-accessible endpoint. Connect them once, then trigger automated test suite runs against your Datasets directly from the UI. No SDK integration required—Netra calls your agent’s API, collects responses, and runs Evaluators to score every interaction.

Why Agents Matter

Testing AI agents manually doesn’t scale. Connecting your agent to Netra unlocks automated, repeatable evaluation:
ChallengeHow Agents Help
Manual testing is slowRun entire datasets against your agent with a single click
Inconsistent test coverageEvery dataset item is tested systematically with the same evaluators
No multi-turn testingNetra simulates realistic users for multi-turn conversation testing
Disconnected tracingAgent responses are automatically traced and linked to evaluation results
Environment-specific bugsTest against staging, production, or any HTTP endpoint

How It Works

Netra provides two ways to interact with your connected agent:

Playground

Send individual messages to your agent in an interactive chat. Ideal for quick validation while configuring endpoints and response mappings.

Test Suite Run

Trigger a full dataset evaluation against your agent. Netra fans out every dataset item, collects responses, runs evaluators, and reports results.

Agents Dashboard

Navigate to Simulation → Agents from the left navigation panel. The dashboard displays all configured agents for the current project. Screenshot From 2026 06 12 14 52 01 Each agent card shows:
FieldDescription
NameAgent display name
EndpointThe HTTP method and base URL
CreatedWhen the agent was configured

Configuring an Agent

Click Add Agent in the top right corner to configure a new agent connection. Screenshot From 2026 06 12 14 53 07 The configuration page is split into two panels—API settings on the left and an interactive Playground on the right.
1

Name Your Agent

Enter a descriptive name (e.g., “Customer Support Bot - Staging”). This name identifies the agent when selecting it for test suite runs.
2

Set HTTP Method & Base URL

Choose the HTTP method and enter your agent’s endpoint URL.
MethodTypical Use
POSTMost common—send user messages in the request body
GETQuery-based agents with URL parameters
PUT / PATCHAgents that update state as part of the conversation
3

Configure Headers

Add any custom headers your agent requires. A Content-Type: application/json header is included by default.
Headers that contain sensitive values (API keys, tokens) are encrypted at rest and masked when viewed by project members.
4

Set Authentication

Choose your authentication method:
Auth TypeConfiguration
No AuthNo additional credentials required
Bearer TokenProvide a bearer token, sent as Authorization: Bearer <token>
API KeyProvide a key name and value, sent as a custom header
5

Define Request Body

Write a JSON body template using variable placeholders. Netra replaces these with actual values at runtime.Default template:
{
  "message": "{{userMessage}}",
  "session_id": "{{sessionId}}"
}
VariableDescription
{{userMessage}}The user message from the dataset item or playground input
{{sessionId}}A session identifier for multi-turn conversation continuity
Structure the body to match your agent’s expected request format. Any valid JSON is supported, nest variables at any depth.
6

Map Response Fields

Tell Netra where to find the agent’s response in the JSON output using JSONPath expressions.
FieldRequiredExampleDescription
Text PathYes$.responseJSONPath to the agent’s text reply
Session PathNo$.session_idJSONPath to a session ID for multi-turn continuity
The text path must start with $. and point to the field containing the agent’s response text.
7

Save

Click Create Agent to save the configuration. The agent becomes available for test suite runs and appears on the Agents dashboard.

Testing in the Playground

The Playground is an interactive chat panel on the right side of the agent configuration page. Use it to validate your endpoint configuration before running full evaluations. Screenshot From 2026 06 12 14 53 07

Sending Messages

  1. Type a message in the input field at the bottom of the Playground panel
  2. Press Enter or click the send button
  3. Netra constructs the HTTP request using your configuration, calls your agent, and displays the response

What Happens Behind the Scenes

When you send a playground message:
  1. Netra resolves your body template: replacing {{userMessage}} with your input and {{sessionId}} with the current session
  2. An async job is created and your agent’s endpoint is called via HTTP
  3. The response is parsed using your configured Text Path to extract the reply
  4. If a Session Path is configured, the session ID is stored for subsequent messages in the same conversation

Multi-Turn Conversations

The Playground maintains conversation context through sessions. Each message in the same Playground session includes the sessionId from the previous response, enabling multi-turn conversations with stateful agents. Click New Chat to reset the session and start a fresh conversation.
Validation runs before each message is sent. The agent name, base URL, and response text path are required. If the body template contains invalid JSON, you’ll see an error before the message is dispatched.

Triggering a Test Suite Run

This is the primary workflow for the Agent Trigger feature: running an entire Dataset against a configured agent from the UI.

Prerequisites

Before triggering a run, ensure you have:
  • At least one Agent configured in the current project
  • A Dataset with items (single-turn) or scenarios (multi-turn)
  • Evaluators attached to the dataset for scoring

Starting a Run

1

Open Your Dataset

Navigate to Simulation → Datasets and open the dataset you want to evaluate.
2

Click Run

In the dataset detail page toolbar, click the Run button.Screenshot From 2026 06 12 15 56 45
3

Configure the Run

The Run Test Suite modal appears with the following fields:
FieldRequiredDescription
AgentYesSelect a configured agent from the dropdown
Run NameYesAuto-generated as {Dataset Name} - {Date}, editable
The modal also displays read-only context: dataset name, turn type (single or multi), record count, and evaluator count.Screenshot From 2026 06 12 14 54 49
4

Confirm and Run

Click Run to trigger the test suite. On success, you are redirected to the Test Run detail page to monitor progress.
You can also trigger test suite runs programmatically using the SDK—call run_test_suite for single-turn datasets or run_simulation for multi-turn datasets. See the SDK Reference for details.

Monitoring Results

After triggering a run, you land on the Test Run detail page. The page updates as items complete.

Status Summary

The summary card at the top shows aggregate metrics:
MetricDescription
Passed / FailedNumber of items that passed or failed evaluation
Not AvailableItems where evaluation could not complete
Total CostAggregate cost across all agent calls
Average LatencyMean response time from your agent
DurationTotal wall-clock time for the run
Use the refresh button to update the summary while the run is in progress.

Run Statuses

StatusMeaning
RunningItems are still being processed
CompletedAll items have finished and evaluations are scored
FailedAll items failed—check agent configuration and endpoint availability
CancelledThe run was manually stopped

Per-Item Results

Each item in the results table shows:
ColumnDescription
InputThe original dataset item input
Expected OutputThe ground truth from the dataset (if provided)
Agent OutputThe response from your agent
Run StatusWhether the agent call succeeded or failed
Eval StatusWhether the item passed or failed evaluations
TraceLink to the execution trace for debugging
Click any row to view detailed evaluation results, variable comparisons, or the full multi-turn conversation transcript.
Last modified on June 12, 2026