LangSplatV2 LERF OVS Evaluation by swahtz · Pull Request #56 · openvdb/fvdb-examples

swahtz · 2026-03-04T06:06:52Z

Fixes #44

- Added `train_eval.sh` and `train_scenes.sh` for streamlined training and evaluation processes. - Introduced `download_data.py` for downloading the LERF-OVS dataset, along with `eval_lerf.py` for model evaluation on this dataset. - Created `environment.yml` to define the Conda environment for LangSplatV2, including necessary dependencies. - Updated `.gitignore` to exclude new log and result directories. - Enhanced `train_langsplatv2.py` to save final checkpoints and log paths for better tracking of training progress. - Various alterations to the training, data generation and evaluation to be more inline with the original implementation results. These changes improve the usability and functionality of the LangSplatV2 framework for open-vocabulary segmentation tasks. Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

Copilot

Pull request overview

Adds a LERF-OVS evaluation workflow and updates the LangSplatV2 preprocessing/training pipeline to better match the original LangSplatV2 feature/mask generation and checkpointing conventions (Fixes #44).

Changes:

Add LERF-OVS dataset download utilities and a full evaluation script (OpenCLIP relevancy, mIoU + localization).
Add SAM1 multi-scale mask generation, shared mask post-processing utilities, and an “import original LangSplatV2 features” transform for A/B comparisons.
Improve training/preprocessing ergonomics (top-level final_checkpoint.pt, GPU-based CLIP crop/resize, codebook init from all levels, updated deps/docs).

Reviewed changes

Copilot reviewed 27 out of 27 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
open_vocabulary_segmentation/langsplatv2/scripts/train_langsplatv2.py	Saves a top-level final checkpoint for easier evaluation consumption.
open_vocabulary_segmentation/langsplatv2/pyproject.toml	Updates Python requirement and adds evaluation/preprocess dependencies.
open_vocabulary_segmentation/langsplatv2/langsplatv2/vq_utils.py	Adds helper to load CLIP features from all scales for codebook init.
open_vocabulary_segmentation/langsplatv2/langsplatv2/training/trainer.py	Enables codebook initialization from all levels via config flag.
open_vocabulary_segmentation/langsplatv2/langsplatv2/training/langsplatv2_writer.py	Adds `save_final_checkpoint()` writing `final_checkpoint.pt` at run root.
open_vocabulary_segmentation/langsplatv2/langsplatv2/training/dataset.py	Adds cache warmup diagnostics; changes GT feature-map init to zeros.
open_vocabulary_segmentation/langsplatv2/langsplatv2/scene_transforms/multi_scale_sam_masks.py	Refactors to shared mask utils, adds diagnostics, updates cache versioning/log suppression.
open_vocabulary_segmentation/langsplatv2/langsplatv2/scene_transforms/multi_scale_sam1_masks.py	New SAM1 multi-scale mask transform matching original LangSplatV2 settings.
open_vocabulary_segmentation/langsplatv2/langsplatv2/scene_transforms/mask_utils.py	New shared post-processing utilities (cross-crop NMS, mask NMS, small-region cleanup).
open_vocabulary_segmentation/langsplatv2/langsplatv2/scene_transforms/import_original_features.py	New transform to import original repo’s `_f.npy`/`_s.npy` features into cache.
open_vocabulary_segmentation/langsplatv2/langsplatv2/scene_transforms/clip_feature_encoding.py	Moves crop/mask/pad/resize onto GPU to better match original feature extraction.
open_vocabulary_segmentation/langsplatv2/langsplatv2/scene_transforms/init.py	Exposes new transforms (SAM1 + original-feature import).
open_vocabulary_segmentation/langsplatv2/langsplatv2/loss.py	Updates loss to match original “masked then mean over all pixels” behavior; adds valid-only metric.
open_vocabulary_segmentation/langsplatv2/langsplatv2/evaluation/openclip_relevancy.py	Adds OpenCLIP relevancy computation matching original eval logic.
open_vocabulary_segmentation/langsplatv2/langsplatv2/evaluation/datasets/util.py	Adds Google Drive download helper for evaluation datasets.
open_vocabulary_segmentation/langsplatv2/langsplatv2/evaluation/datasets/lerf.py	Adds LERF-OVS dataset download + path helpers.
open_vocabulary_segmentation/langsplatv2/langsplatv2/evaluation/datasets/init.py	Adds dataset-root management for evaluation utilities.
open_vocabulary_segmentation/langsplatv2/langsplatv2/evaluation/init.py	Initializes evaluation package.
open_vocabulary_segmentation/langsplatv2/evaluation/lerf_ovs/eval_lerf.py	New end-to-end LERF-OVS evaluation script (render, relevancy, mIoU, localization, viz).
open_vocabulary_segmentation/langsplatv2/evaluation/lerf_ovs/download_data.py	CLI to download LERF-OVS dataset via tyro.
open_vocabulary_segmentation/langsplatv2/evaluation/lerf_ovs/batch_train_eval_langsplat.sh	Batch training script for LERF scenes/levels and checkpoint collection.
open_vocabulary_segmentation/langsplatv2/evaluation/lerf_ovs/batch_reconstruct_eval_scenes.sh	Batch reconstruction script for LERF scenes.
open_vocabulary_segmentation/langsplatv2/evaluation/lerf_ovs/README.md	Documentation for dataset download, reconstruction, training, and evaluation.
open_vocabulary_segmentation/langsplatv2/environment.yml	Adds a conda env capturing dependencies for evaluation/preprocessing.
open_vocabulary_segmentation/langsplatv2/README.md	Updates top-level docs for SAM1/SAM2, scripts path, outputs, and references.
open_vocabulary_segmentation/langsplatv2/.gitignore	Ignores reconstruction logs and results directories.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

…nsforms/mask_utils.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

…nsforms/clip_feature_encoding.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

add a comment for a small logic issue found in a PR review Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

…dataset.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

Copilot

Pull request overview

Copilot reviewed 29 out of 29 changed files in this pull request and generated 8 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

…n/openclip_relevancy.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

swahtz added 3 commits March 4, 2026 18:33

rename and organization

df6bd01

Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

readme updates

4941f71

Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

swahtz requested a review from Copilot March 4, 2026 06:06

Copilot started reviewing on behalf of swahtz March 4, 2026 06:07 View session

swahtz self-assigned this Mar 4, 2026

Copilot AI reviewed Mar 4, 2026

View reviewed changes

swahtz and others added 6 commits March 4, 2026 22:43

train and query scripts

6e71c92

Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

Update open_vocabulary_segmentation/langsplatv2/langsplatv2/scene_tra…

51413cb

…nsforms/mask_utils.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

Update open_vocabulary_segmentation/langsplatv2/langsplatv2/scene_tra…

bd6c57b

…nsforms/clip_feature_encoding.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

add prefiltering of zero-area masks

efbe246

add a comment for a small logic issue found in a PR review Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

Update open_vocabulary_segmentation/langsplatv2/langsplatv2/training/…

5fdde04

…dataset.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

wrap the SAM2 verbose logging supression in a try/finally

90801e3

Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

swahtz force-pushed the langsplatv2_lerf_eval branch from f55f66d to 90801e3 Compare March 4, 2026 23:31

swahtz requested a review from Copilot March 4, 2026 23:31

Copilot started reviewing on behalf of swahtz March 4, 2026 23:32 View session

Copilot AI reviewed Mar 4, 2026

View reviewed changes

swahtz and others added 4 commits March 5, 2026 12:46

notes

a5dd960

Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

Update open_vocabulary_segmentation/langsplatv2/langsplatv2/evaluatio…

6e5a43a

…n/openclip_relevancy.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

notes fixes

91f0afb

Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

setting debug prints correctly

e7bfeb7

Signed-off-by: Jonathan Swartz <jonathan@jswartz.info>

swahtz merged commit 6713dec into openvdb:main Mar 5, 2026
8 checks passed

swahtz deleted the langsplatv2_lerf_eval branch March 5, 2026 00:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LangSplatV2 LERF OVS Evaluation#56

LangSplatV2 LERF OVS Evaluation#56
swahtz merged 13 commits into
openvdb:mainfrom
swahtz:langsplatv2_lerf_eval

swahtz commented Mar 4, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

swahtz commented Mar 4, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants