feat: nemo gym integration by cmunley1 · Pull Request #1053 · PrimeIntellect-ai/verifiers

cmunley1 · 2026-03-23T07:26:12Z

Description

Integrates NVIDIA NeMo Gym with a few example environments. Entire multiturn rollout is offloaded to nemo gym through the rollout collection helper in nemo gym, which includes agent, model, and resources servers.
Example usage

vf-eval nemo-gym-workplace-assistant -m local -d -n 1 -r 1
vf-eval nemo-gym-structured-outputs -m local -d -n 1 -r 1    
vf-eval nemo-gym-code-gen -m local -d -n 1 -r 1    
vf-eval nemo-gym-reasoning-gym -m local -d -n 1 -r 1    
vf-eval nemo-gym-reasoning-gym-reflection -m local -d -n 1 -r 1

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Test improvement

Testing

All existing tests pass when running uv run pytest locally.
New tests have been added to cover the changes

Checklist

My code follows the style guidelines of this project as outlined in AGENTS.md
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Any dependent changes have been merged and published

Additional Notes

Note

Medium Risk
Introduces a new async/threaded integration that starts and manages external NeMo Gym servers and maps their rollouts into Verifiers state, plus expands local install path resolution; these changes touch execution lifecycle and environment installation behavior.

Overview
Adds NVIDIA NeMo Gym integration via a new NemoGymEnv that delegates rollouts to NeMo Gym’s rollout collection helper, manages head/policy server configuration and lifecycle, and converts NeMo Gym results into Verifiers trajectory/completion plus a reward sourced from NeMo Gym.

Adds multiple example NeMo Gym environment packages under environments/nemo_gym/* (each exposing load_environment() and pyproject.toml metadata) and updates docs/README and tests to reference the new integration and skip the parent nemo_gym directory.

Improves tooling compatibility by letting install_from_local() discover environments one directory deeper (nested under environments/*/<env_name>) and treating raw messages with role developer as system in message normalization.

^{Reviewed by Cursor Bugbot for commit bb22743. Bugbot is set up for automated code reviews on this repo. Configure here.}

Signed-off-by: cmunley1 <cmunley@nvidia.com>

…t name, use endpoint params, rename packages to hyphens, update readme, ruff Signed-off-by: cmunley1 <cmunley@nvidia.com>

Signed-off-by: cmunley1 <cmunley@nvidia.com>

cursor · 2026-04-04T09:55:05Z

+        tool_call = ToolCall(
+            id=str(item.get("call_id") or item.get("id") or uuid.uuid4().hex[:8]),
+            name=str(item.get("name", "")),
+            arguments=str(item.get("arguments", "{}")),


Tool call arguments may produce invalid JSON

Medium Severity

_nemo_item_to_assistant_message uses str() to convert the arguments field of a function call item, but ToolCall.arguments is expected to be a valid JSON string throughout the codebase. If NeMo Gym returns arguments as a dict, str() produces Python repr (e.g., "{'key': 'value'}") instead of valid JSON ('{"key": "value"}'). Multiple consumers in the codebase call json.loads(tool_call.arguments) (in tool_env.py, stateful_tool_env.py, etc.), which would fail on Python repr format. The existing _normalize_raw_tool_calls in message_utils.py handles this correctly by checking isinstance(arguments, str) first and falling back to json.dumps().

^{Reviewed by Cursor Bugbot for commit 8145b83. Configure here.}

Signed-off-by: cmunley1 <cmunley@nvidia.com>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

There are 2 total unresolved issues (including 1 from previous review).

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit bb22743. Configure here.}

cursor · 2026-04-04T10:21:13Z

 - **`ReasoningGymEnv`** — wraps [reasoning-gym](https://github.com/open-thought/reasoning-gym) procedural datasets
 - **`BrowserEnv`** — unified browser automation via [Browserbase](https://browserbase.com) with DOM and CUA modes
 - **`OpenEnvEnv`** — wraps OpenEnv gym and MCP contracts using Prime Sandboxes with prebuilt images referenced from `.build.json`
+- **`NemoGymEnv`** — wraps [NVIDIA NeMo Gym](https://github.com/NVIDIA-NeMo/Gym) environments. 


NemoGymEnv install instructions missing from docs

Low Severity

NemoGymEnv is added to the integrations list but the follow-up install paragraph at line 794 only mentions extras for TextArena, BrowserEnv, and OpenEnvEnv. Since NemoGymEnv requires a different install flow (editable local clone, no pip extra defined in pyproject.toml), users reading "These require additional dependencies installed via extras" will be confused about how to install the NeMo Gym dependency.

^{Triggered by project rule: BugBot Instructions}

^{Reviewed by Cursor Bugbot for commit bb22743. Configure here.}

eligotts and others added 12 commits March 2, 2026 14:03

first nemo gym port

c8bd393

nemo gym adapter with lots of examples

25152e6

ty

cf4cc8f

reorganize

ba30931

draft

c1eadaf

Signed-off-by: cmunley1 <cmunley@nvidia.com>

updates

a8e5238

Signed-off-by: cmunley1 <cmunley@nvidia.com>

rename

682c720

Signed-off-by: cmunley1 <cmunley@nvidia.com>

tidy

ede6284

Signed-off-by: cmunley1 <cmunley@nvidia.com>

updates

73e9f0f

Signed-off-by: cmunley1 <cmunley@nvidia.com>

readme

3acac67

Signed-off-by: cmunley1 <cmunley@nvidia.com>

readme

893bbcc

Signed-off-by: cmunley1 <cmunley@nvidia.com>

drop some args

51f4bd1

Signed-off-by: cmunley1 <cmunley@nvidia.com>

cmunley1 changed the title ~~feat: nemo gym integration draft~~ feat: nemo gym integration Mar 23, 2026

cmunley1 added 3 commits March 27, 2026 21:15

rename nemo to nemo_gym

6c566fe

Signed-off-by: cmunley1 <cmunley@nvidia.com>

add more env examples

fb5c918

Signed-off-by: cmunley1 <cmunley@nvidia.com>

revert endpoints

757c2ff

Signed-off-by: cmunley1 <cmunley@nvidia.com>

cmunley1 marked this pull request as ready for review March 28, 2026 05:16

cursor bot reviewed Mar 28, 2026

View reviewed changes

Comment thread environments/README.md Outdated

Comment thread verifiers/envs/integrations/nemo_gym/env.py Outdated

cmunley1 added 2 commits April 4, 2026 02:48

skip nemo_gym parent dir in tests, remove python path war, cache agen…

7647929

…t name, use endpoint params, rename packages to hyphens, update readme, ruff Signed-off-by: cmunley1 <cmunley@nvidia.com>

import top and comment

8145b83

Signed-off-by: cmunley1 <cmunley@nvidia.com>

cursor bot reviewed Apr 4, 2026

View reviewed changes

multi endpoint support

d0cb639

Signed-off-by: cmunley1 <cmunley@nvidia.com>

cursor bot reviewed Apr 4, 2026

View reviewed changes

Comment thread verifiers/envs/integrations/nemo_gym/env.py

docs

bb22743

Signed-off-by: cmunley1 <cmunley@nvidia.com>

cursor bot reviewed Apr 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: nemo gym integration#1053

feat: nemo gym integration#1053
cmunley1 wants to merge 19 commits intoPrimeIntellect-ai:mainfrom
cmunley1:cmunley1/nemo_gym_int

cmunley1 commented Mar 23, 2026 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

Uh oh!

cursor bot Apr 4, 2026

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cmunley1 commented Mar 23, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Testing

Checklist

Additional Notes

Uh oh!

Uh oh!

Uh oh!

cursor bot Apr 4, 2026

Choose a reason for hiding this comment

Tool call arguments may produce invalid JSON

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Apr 4, 2026

Choose a reason for hiding this comment

NemoGymEnv install instructions missing from docs

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cmunley1 commented Mar 23, 2026 •

edited by cursor bot

Loading