refact(dpmodel,pt_expt): fitting net by wanghan-iapcm · Pull Request #5207 · deepmodeling/deepmd-kit

wanghan-iapcm · 2026-02-08T14:42:37Z

FittingNet Refactoring: Factory Function to Concrete Class

Summary

This refactoring converts FittingNet from a factory-generated dynamic class to a concrete class in the dpmodel backend, following the same pattern as the EmbeddingNet refactoring. This enables the auto-detection registry mechanism in pt_expt to work seamlessly with FittingNet.

This PR is considered after #5194 and #5204

Motivation

Before: FittingNet was created by a factory function make_fitting_network(EmbeddingNet, NativeNet, NativeLayer), producing a dynamically-typed class. This caused:

Cannot be registered: Dynamic classes can't be imported or registered at module import time in the pt_expt registry
Type matching fails: Each call to make_fitting_network creates a new class type, so registry lookup by type fails

After: FittingNet is now a concrete class that can be registered in the pt_expt auto-conversion registry.

Changes

1. dpmodel: Concrete `FittingNet` class

File: deepmd/dpmodel/utils/network.py

Created concrete FittingNet(EmbeddingNet) class
Moved constructor logic from factory into __init__
Fixed deserialize to use type(obj.layers[0]) instead of hardcoding T_Network.__init__(obj, layers), allowing pt_expt subclass to preserve its converted torch layers
Kept make_fitting_network factory for backwards compatibility (for pt/pd backends)

class FittingNet(EmbeddingNet):
    """The fitting network."""

    def __init__(self, in_dim, out_dim, neuron=[24, 48, 96],
                 activation_function="tanh", resnet_dt=False,
                 precision=DEFAULT_PRECISION, bias_out=True,
                 seed=None, trainable=True):
        # Handle trainable parameter
        if trainable is None:
            trainable = [True] * (len(neuron) + 1)
        elif isinstance(trainable, bool):
            trainable = [trainable] * (len(neuron) + 1)

        # Initialize embedding layers via parent
        super().__init__(
            in_dim, neuron=neuron,
            activation_function=activation_function,
            resnet_dt=resnet_dt, precision=precision,
            seed=seed, trainable=trainable[:-1]
        )

        # Add output layer
        i_in = neuron[-1] if len(neuron) > 0 else in_dim
        self.layers.append(
            NativeLayer(
                i_in, out_dim, bias=bias_out,
                use_timestep=False, activation_function=None,
                resnet=False, precision=precision,
                seed=child_seed(seed, len(neuron)),
                trainable=trainable[-1]
            )
        )
        self.out_dim = out_dim
        self.bias_out = bias_out

    @classmethod
    def deserialize(cls, data):
        data = data.copy()
        check_version_compatibility(data.pop("@version", 1), 1, 1)
        data.pop("@class", None)
        layers = data.pop("layers")
        obj = cls(**data)
        # Use type(obj.layers[0]) to respect subclass layer types
        layer_type = type(obj.layers[0])
        obj.layers = type(obj.layers)(
            [layer_type.deserialize(layer) for layer in layers]
        )
        return obj

2. pt_expt: Wrapper and registration

File: deepmd/pt_expt/utils/network.py

Added import: from deepmd.dpmodel.utils.network import FittingNet as FittingNetDP
Created FittingNet(FittingNetDP, torch.nn.Module) wrapper
Converts dpmodel layers to pt_expt NativeLayer (torch modules) in __init__
Registered in auto-conversion registry

from deepmd.dpmodel.utils.network import FittingNet as FittingNetDP

class FittingNet(FittingNetDP, torch.nn.Module):
    def __init__(self, *args: Any, **kwargs: Any) -> None:
        torch.nn.Module.__init__(self)
        FittingNetDP.__init__(self, *args, **kwargs)
        # Convert dpmodel layers to pt_expt NativeLayer
        self.layers = torch.nn.ModuleList(
            [NativeLayer.deserialize(layer.serialize()) for layer in self.layers]
        )

    def __call__(self, *args: Any, **kwargs: Any) -> Any:
        return torch.nn.Module.__call__(self, *args, **kwargs)

    def forward(self, x: torch.Tensor) -> torch.Tensor:
        return self.call(x)

register_dpmodel_mapping(
    FittingNetDP,
    lambda v: FittingNet.deserialize(v.serialize()),
)

Tests

dpmodel tests

File: source/tests/common/dpmodel/test_network.py

Added to TestFittingNet class:

test_fitting_net: Original roundtrip serialization test (already existed)
test_is_concrete_class: Verifies FittingNet is now a concrete class, not factory output
test_forward_pass: Tests dpmodel forward pass produces correct output shapes (single and batch)
test_trainable_parameter_variants: Tests different trainable configurations (all trainable, all frozen, mixed)

pt_expt integration tests

File: source/tests/pt_expt/utils/test_network.py

Created TestFittingNetRefactor test suite with 4 tests:

test_pt_expt_fitting_net_wraps_dpmodel: Verifies pt_expt wrapper inherits correctly and converts layers
test_pt_expt_fitting_net_forward: Tests pt_expt forward pass returns torch.Tensor with correct shape
test_serialization_round_trip_pt_expt: Tests pt_expt serialize/deserialize round-trip
test_registry_converts_dpmodel_to_pt_expt: Tests try_convert_module auto-converts dpmodel to pt_expt

Verification

All tests pass:

# dpmodel network tests (includes new FittingNet tests)
python -m pytest source/tests/common/dpmodel/test_network.py -v
# 19 passed in 0.56s (was 16, added 3 FittingNet tests)

# dpmodel FittingNet tests specifically
python -m pytest source/tests/common/dpmodel/test_network.py::TestFittingNet -v
# 4 passed in 0.44s

# pt_expt network tests (EmbeddingNet + FittingNet)
python -m pytest source/tests/pt_expt/utils/test_network.py -v
# 14 passed in 0.45s

# Descriptor tests (verify refactoring doesn't break existing code)
python -m pytest source/tests/pt_expt/descriptor/ -v
# 8 passed in 5.43s

Benefits

Type-based auto-detection: FittingNet now works with the registry mechanism
Consistency: Same pattern as EmbeddingNet and other dpmodel classes
Maintainability: Single source of truth for FittingNet in dpmodel
Future-proof: Any dpmodel FittingNet instances can be auto-converted to pt_expt

Backward Compatibility

Serialization format unchanged (version 1)
All existing tests pass
make_fitting_network factory kept for pt/pd backends
No changes to public API

Files Changed

Modified

deepmd/dpmodel/utils/network.py: Concrete FittingNet class + deserialize fix
deepmd/pt_expt/utils/network.py: FittingNet wrapper + registration
source/tests/common/dpmodel/test_network.py: Added dpmodel FittingNet tests (3 new tests)
source/tests/pt_expt/utils/test_network.py: Added pt_expt integration tests (4 new tests)

Pattern

This refactoring follows the exact same pattern as EMBEDDING_NET_REFACTOR.md:

Convert factory-generated class to concrete class in dpmodel
Fix deserialize to use type(obj.layers[0])
Create pt_expt wrapper with layer conversion in __init__
Register with register_dpmodel_mapping
Add comprehensive tests

Summary by CodeRabbit

New Features
- Added PyTorch experimental descriptor implementations for multi-device support
- Introduced unified PyTorch integration layer for automatic dpmodel-to-PyTorch conversions
Bug Fixes
- Fixed implicit failures when statistics unavailable; now raises explicit errors
- Improved device propagation for array operations across all descriptors
Refactor
- Restructured network classes (EmbeddingNet, FittingNet) with explicit initialization and serialization methods
- Enhanced array operations to support multi-device scenarios with proper device inference
Tests
- Added comprehensive unit tests for PyTorch experimental descriptors
- Extended test coverage for serialization consistency and cross-backend equivalence

…y on pt backend.

gemini-code-assist · 2026-02-08T14:43:25Z

Summary of Changes

Hello @wanghan-iapcm, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly refactors the FittingNet class, transforming it from a dynamically generated entity into a concrete class. This change is crucial for enabling FittingNet to be properly registered and utilized within the pt_expt (PyTorch exportable) backend, aligning it with the architectural patterns established by previous refactorings like EmbeddingNet. The update enhances the system's ability to automatically detect and convert modules, leading to a more consistent and maintainable codebase. It also includes critical adjustments for array device handling and thread management to ensure robust operation across different PyTorch environments, all validated by a comprehensive suite of new tests.

Highlights

FittingNet Refactoring: Converted the FittingNet class from a factory-generated dynamic class to a concrete class within the deepmd.dpmodel backend. This change resolves issues with dynamic classes not being registrable and failing type matching in the pt_expt registry.
PyTorch Exportable (pt_expt) Integration: Introduced a FittingNet wrapper in deepmd/pt_expt/utils/network.py that inherits from the new concrete dpmodel.FittingNet and torch.nn.Module. This wrapper converts dpmodel layers to pt_expt NativeLayer (torch modules) and registers FittingNet for auto-conversion in the pt_expt backend.
Array Handling and Device Compatibility: Updated dpmodel/common.py to ensure arrays are moved to the CPU device when converting to NumPy for compatibility. Several descriptor files (dpa1.py, repflows.py, repformers.py, se_e2_a.py, se_r.py, se_t.py, se_t_tebd.py) were modified to explicitly specify the device for array creation (xp.zeros, xp.asarray) and to cast indices to int64 for PyTorch backend compatibility.
Thread Management Improvements: Added robust error handling to torch.set_num_interop_threads and torch.set_num_threads calls in deepmd/pt/utils/env.py and deepmd/pt_expt/utils/env.py to prevent RuntimeError if these functions are called multiple times or after threads have been created.
Comprehensive Testing: Added extensive new test cases for FittingNet in both dpmodel and pt_expt to verify concrete class behavior, forward pass, trainable parameter variants, serialization, cross-backend consistency, and proper registry conversion. New test files were also added for pt_expt descriptors and utilities.

Changelog

deepmd/backend/pt_expt.py
- Added new file for PyTorch exportable backend definition.
deepmd/dpmodel/common.py
- Modified to_numpy_array to move arrays to CPU device for numpy compatibility.
deepmd/dpmodel/descriptor/descriptor.py
- Imported array_api_compat and updated extend_descrpt_stat to use xp.zeros and xp.concat with explicit device specification.
deepmd/dpmodel/descriptor/dpa1.py
- Updated compute_input_stats to specify device for xp.asarray and cast idx to int64 for PyTorch compatibility.
deepmd/dpmodel/descriptor/repflows.py
- Updated compute_input_stats to specify device for xp.asarray.
deepmd/dpmodel/descriptor/repformers.py
- Updated compute_input_stats to specify device for xp.asarray.
deepmd/dpmodel/descriptor/se_e2_a.py
- Updated compute_input_stats and call to specify device for xp.asarray and xp.zeros.
deepmd/dpmodel/descriptor/se_r.py
- Updated compute_input_stats and call to specify device for xp.asarray and xp.zeros.
deepmd/dpmodel/descriptor/se_t.py
- Updated compute_input_stats and call to specify device for xp.asarray and xp.zeros.
deepmd/dpmodel/descriptor/se_t_tebd.py
- Updated compute_input_stats and call to specify device for xp.asarray, xp.zeros and cast idx to int64.
deepmd/dpmodel/utils/network.py
- Refactored EmbeddingNet and FittingNet from factory functions to concrete classes.
- Moved constructor logic to __init__.
- Fixed deserialize to use type(obj.layers[0]) to preserve subclass layer types.
deepmd/dpmodel/utils/type_embed.py
- Updated call and change_type_map to specify device for xp.eye, xp.zeros, xp.concat, and xp.asarray.
deepmd/env.py
- Corrected environment variable lookup from TF_INTRA_OP_PARALLELISM_THREADS to TF_INTER_OP_PARALLELISM_THREADS.
deepmd/pt/utils/env.py
- Added guards to torch.set_num_interop_threads and torch.set_num_threads to prevent RuntimeError if called multiple times.
deepmd/pt_expt/init.py
- Added new file.
deepmd/pt_expt/common.py
- Added new file defining dpmodel_setattr, register_dpmodel_mapping, and try_convert_module for automatic attribute conversion.
deepmd/pt_expt/descriptor/init.py
- Added new file to import and register descriptor converters.
deepmd/pt_expt/descriptor/base_descriptor.py
- Added new file defining BaseDescriptor for pt_expt.
deepmd/pt_expt/descriptor/se_e2_a.py
- Added new file for DescrptSeA wrapper for pt_expt.
deepmd/pt_expt/descriptor/se_r.py
- Added new file for DescrptSeR wrapper for pt_expt.
deepmd/pt_expt/descriptor/se_t.py
- Added new file for DescrptSeT wrapper for pt_expt.
deepmd/pt_expt/descriptor/se_t_tebd.py
- Added new file for DescrptSeTTebd wrapper for pt_expt.
deepmd/pt_expt/descriptor/se_t_tebd_block.py
- Added new file for DescrptBlockSeTTebd wrapper for pt_expt.
deepmd/pt_expt/utils/init.py
- Added new file to export pt_expt utilities.
deepmd/pt_expt/utils/env.py
- Added new file for pt_expt environment settings, including thread guards.
deepmd/pt_expt/utils/exclude_mask.py
- Added new file for AtomExcludeMask and PairExcludeMask wrappers for pt_expt.
deepmd/pt_expt/utils/network.py
- Added new file for NativeLayer, NativeNet, EmbeddingNet, FittingNet, and NetworkCollection wrappers for pt_expt.
deepmd/pt_expt/utils/type_embed.py
- Added new file for TypeEmbedNet wrapper for pt_expt.
pyproject.toml
- Updated banned-module-level-imports and runtime-evaluated-base-classes to include deepmd.pt_expt.
source/tests/common/dpmodel/test_network.py
- Added test_is_concrete_class, test_forward_pass, and test_trainable_parameter_variants for both EmbeddingNet and FittingNet.
source/tests/consistent/common.py
- Added INSTALLED_PT_EXPT and pt_expt_class to CommonTest.
- Added eval_pt_expt and related logic for pt_expt backend consistency tests.
source/tests/consistent/descriptor/common.py
- Updated eval_pt_expt_descriptor to handle pt_expt descriptor evaluation.
source/tests/consistent/descriptor/test_se_e2_a.py
- Added DescrptSeAPTExpt and updated skip_pt_expt and eval_pt_expt for se_e2_a descriptor.
source/tests/consistent/descriptor/test_se_r.py
- Added DescrptSeRPTExpt and updated skip_pt_expt and eval_pt_expt for se_r descriptor.
source/tests/consistent/descriptor/test_se_t.py
- Added DescrptSeTPTExpt and updated skip_pt_expt and eval_pt_expt for se_t descriptor.
source/tests/consistent/descriptor/test_se_t_tebd.py
- Added DescrptSeTTebdPTExpt and updated skip_pt_expt and eval_pt_expt for se_t_tebd descriptor.
source/tests/pt/test_env_threads.py
- Added new test file to verify thread setting guards.
source/tests/pt_expt/init.py
- Added new file.
source/tests/pt_expt/conftest.py
- Added new file for pt_expt test configuration.
source/tests/pt_expt/descriptor/init.py
- Added new file.
source/tests/pt_expt/descriptor/test_se_e2_a.py
- Added new test file for pt_expt DescrptSeA consistency and exportability.
source/tests/pt_expt/descriptor/test_se_r.py
- Added new test file for pt_expt DescrptSeR consistency and exportability.
source/tests/pt_expt/descriptor/test_se_t.py
- Added new test file for pt_expt DescrptSeT consistency and exportability.
source/tests/pt_expt/descriptor/test_se_t_tebd.py
- Added new test file for pt_expt DescrptSeTTebd consistency and exportability.
source/tests/pt_expt/model/init.py
- Added new file.
source/tests/pt_expt/utils/init.py
- Added new file.
source/tests/pt_expt/utils/test_common.py
- Added new test file for to_torch_array utility.
source/tests/pt_expt/utils/test_env.py
- Added new test file for pt_expt thread setting guards.
source/tests/pt_expt/utils/test_exclusion_mask.py
- Added new test file for pt_expt AtomExcludeMask and PairExcludeMask.
source/tests/pt_expt/utils/test_network.py
- Added new test file for pt_expt EmbeddingNet and FittingNet refactoring, including serialization, cross-backend consistency, registry conversion, and trainable parameter handling.

Activity

The pull request initiates a significant refactoring of the FittingNet class, transitioning it from a dynamically generated class to a concrete one. This change is essential for its integration with the pt_expt backend's auto-detection registry.
This refactoring builds upon previous work, specifically the EmbeddingNet refactoring, ensuring a consistent architectural pattern across the codebase.
Extensive new test cases have been introduced to thoroughly validate the new FittingNet implementation, covering its behavior as a concrete class, its forward pass, handling of trainable parameters, serialization, and its interaction with the pt_expt registry.
Backward compatibility is maintained for existing pt and pd backends by retaining the original make_fitting_network factory function.
Several minor but important fixes related to array device handling and thread management in PyTorch environments have been incorporated to enhance robustness.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

deepmd/pt_expt/common.py

deepmd/pt_expt/utils/env.py

deepmd/pt_expt/utils/network.py