Skip to content

Conversation

@kellyiss
Copy link
Collaborator

@kellyiss kellyiss commented Dec 4, 2025

✨ InternVL-N1 Dual-System Variants

  1. Added training code for internvla-n1-w-navdp
  2. Added training code for internvla-n1-dualvln
  3. Both variants support full training/inference pipeline

📊 Extended Evaluation Support

  1. Support RXR evaluation
  2. Decouple System 2 and Dual System evaluation functions
  3. Released new models checkpoints
  4. Updated README download links, performance metrics and some content reorganizations.

* update vln yaml; fix import agent

* update habitat, using evaluator and config; env and agent is WIP

* add distributed_base evaluator

* Habitat env applied, distributed evaluator applied; clean evaluator and agent created

* fix observation issues

* update new register name; tiny fix on style

* latest tested

* delete temp agent; rename default evaluator for habitat

* update slurm bash

* update readme

* fix init dist print

* fix eval config; fix local rank to rank

* update init distributed mode if condition

* update dist for dlc

* fix bug in evaluator

* update distributed vln multi, episode loader

* cma tested

* cma tested; episode loader; torchrun; local agent tested; TODO fix isaac sim gpu setting

* add vlnpe distributed script

* add grscene; add new result write and resumble load data feature based on lmdb path + rank; remove TODOs and unused files

* fix bugs in evaluator and dataset for distributed; n1 and rdp tested

* fix comm log concurrency issue, create file with exist true

* fix progress log mkdir race condition

* fix comments

* polish existing configs and bash

* update bash align with doc

* fix CI test

* rename habitat_extensions

* fix comments

* remove useless line

* [file] Update 3D Printing Files for Camera of Unitree Go2 (InternRobotics#176)

* [file] Update 3d printing files for camera of Unitree Go2.

* [Doc] Update readme for 3d printing models.

* fix visualize image size; fix unused comment code; fixed evaluator name

* bump to version v0.0.2

* update vlnmulti to VLN; update habitatVln to HabitatVLN

* :Revert "Merge branch 'main' into vlnpe_refactor"

This reverts commit 7a23b72, reversing
changes made to ea73075.

---------

Co-authored-by: Yuqiang Yang <49871992+yuqiang-yang@users.noreply.github.com>
from .rope2d import get_rope_index_2, get_rope_index_25

# Define placeholders for dataset paths
CAMBRIAN_737K = {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are these datasets used in finetuning?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Qwenvl's default placeholders for dataset paths

import copy
import itertools
import json
import os
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@kew6688 Check the conflicts between this file and the refactored evaluator.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Acknowledged. Conflicts are marked. Will wait for @kew6688 's adaptation before merging.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

kellyiss#1

Adaption is ready to merge in this PR

  • Fixed vlnpe internvla_n1 agent with the updated n1 model
  • Merged refactored Habitat Evaluator with decoupled System 2 and Dual System evaluation functions
  • Downgrade diffusers version 0.33.1 -> 0.32.2

masked_loss = loss * mask
loss = masked_loss.sum() / mask.sum() / (loss.shape[1] * loss.shape[2])
elif 'navdp' in self.get_system1_type():
cur_images = traj_images.flatten(0, 1)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this model support the async navdp system1 (aka. InternVLA-N1-Preview)?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This release just supports the inference of InternVLA-N1-Preview.

input_ids.to(self.get_model().device)
input_ids = torch.cat([input_ids, torch.tensor([[TRAJ_START_TOKEN_INDEX]]).to(input_ids.device)], dim=1)
text_embeds = self.get_model().embed_tokens(input_ids)
with torch.no_grad():
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The kv cache of generat_latents has some unfixed bugs. The outputs after using kv cache are not aligned with the ones that do not use kv cache. So please make sure the past_key_values is None to disable the kv cache.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code related to kv cache has been removed. Support will be added back in future releases.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants