[Feature Request] Framework-independent DP model format #2982

njzjz · 2023-11-08T20:40:22Z

Summary

Implement a framework-independent DP model format.

Detailed Description

Background

Currently, the DP model file is dependent on the deep learning framework. The TensorFlow model is in ProtoBuf format (.pb), while the developing PyTorch model is in .pt format. These two files are hard to convert between each other. The ONNX package aims to do it on the OP level, but it is limited since both TensorFlow and PyTorch have lots of unsupported OPs, and DP models may have customized OPs.

The DeePMD-kit needs to implement a framework-independent DP model format to have multiple backend support, as described below. Different frameworks are expected to behave similarly for the same model data.

Data structure

The model data is based on the current input parameters, ensuring alignment for each framework. Unimplemented parameters should also be aligned, and the framework raises a NotImplementedError during runtime.
Add a @variables key to each layer's dictionary, with a type of dict[str, np.ndarray], to store network parameters corresponding to what is needed to be restored in the current init_frz_model (which currently ensures complete restoration). "@variables" has a special character @ and should be a reserved name and avoided in the future. The keys of @variables should be aligned for all frameworks. Type embedding should be explicitly written and not hidden.

{
    "argument1": ...,
    "@variables": { 
        "variable1": ..., 
    }
}

Add the following meta-information at the top level: (1) Software, version, and module used to generate the model file. (2) Generation time. (3) A unified model definition version for all frameworks.

{
    "model": ...,
    "software": ...,
    "software_version": ...,
    "time": ...,
    "model_version": ...,
}

Data storage

HDF5 file is used to store data. h5py is a dependency of TensorFlow, PyTorch, and the existing DeePMD-kit, so this doesn't bring extra dependencies.

All variables are stored in the HDF5 file using a unique path. The json path is preserved and should not be used.
The JSON file is stored in the json path, where the type of @variables is dict[str, str]. The value of the @variables dict is the path to the variable, which could be different among different platforms.
Convert dict[str, np.ndarray] to dict[str, str] when saving the model and convert it back when restoring it.

Binding with class

Add deserialize (methodclass) and serialize to each class. The parent class should call the method of subclass. The implementation should follow dpdispacher:

https://github.com/deepmodeling/dpdispatcher/blob/065731a60be3b58979b54f1d33562ef189800158/dpdispatcher/submission.py#L97-L166

The deserialize (methodclass) and serialize of the top class can be called by external modules.

Progress

Further Information, Files, and Links

No response

The text was updated successfully, but these errors were encountered:

njzjz · 2024-05-24T00:28:52Z

Close, considering the framework has been set up.

njzjz added the enhancement label Nov 8, 2023

njzjz added this to Multiple backend support for DeePMD-kit Nov 8, 2023

njzjz self-assigned this Nov 8, 2023

njzjz moved this to Todo in Multiple backend support for DeePMD-kit Nov 8, 2023

njzjz mentioned this issue Nov 11, 2023

support DP native model format #2987

Closed

njzjz linked a pull request Nov 11, 2023 that will close this issue

support DP native model format #2987

Closed

njzjz moved this from Todo to In Progress in Multiple backend support for DeePMD-kit Nov 22, 2023

njzjz added this to the v3.0.0 milestone Jan 9, 2024

njzjz mentioned this issue Jan 9, 2024

[Feature request] Roadmap for PyTorch backend #3122

Closed

7 tasks

njzjz mentioned this issue Feb 2, 2024

Feat: numpy pairtab model #3212

Merged

njzjz closed this as completed May 24, 2024

github-project-automation bot moved this from In Progress to Done in Multiple backend support for DeePMD-kit May 24, 2024

njzjz mentioned this issue Nov 1, 2024

[BUG] DPA-2 model cannot be converted to .dp format #4295

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Framework-independent DP model format #2982

[Feature Request] Framework-independent DP model format #2982

njzjz commented Nov 8, 2023 •

edited

Loading

njzjz commented May 24, 2024

[Feature Request] Framework-independent DP model format #2982

[Feature Request] Framework-independent DP model format #2982

Comments

njzjz commented Nov 8, 2023 • edited Loading

Summary

Detailed Description

Background

Data structure

Data storage

Binding with class

Progress

Further Information, Files, and Links

njzjz commented May 24, 2024

njzjz commented Nov 8, 2023 •

edited

Loading