Importing ONNX Models in Mabor
Introduction
As deep learning evolves, interoperability between frameworks becomes crucial. Mabor, a modern deep learning framework in Rust, provides robust support for importing models from other popular frameworks. This section focuses on importing ONNX (Open Neural Network Exchange) models into Mabor, enabling you to leverage pre-trained models in your Rust-based deep learning projects.
Why Import Models?
Importing pre-trained models offers several advantages:
- Time-saving: Skip the resource-intensive process of training models from scratch.
- Access to state-of-the-art architectures: Utilize cutting-edge models developed by researchers and industry leaders.
- Transfer learning: Fine-tune imported models for your specific tasks, benefiting from knowledge transfer.
- Consistency across frameworks: Maintain consistent performance when moving between frameworks.
Understanding ONNX
ONNX (Open Neural Network Exchange) is an open format designed to represent machine learning models with these key features:
- Framework agnostic: Provides a common format that works across various deep learning frameworks.
- Comprehensive representation: Captures both the model architecture and trained weights.
- Wide support: Compatible with popular frameworks like PyTorch, TensorFlow, and scikit-learn.
This standardization allows seamless movement of models between different frameworks and deployment environments.
Mabor's ONNX Support
Mabor's approach to ONNX import offers unique advantages:
- Native Rust code generation: Translates ONNX models into Rust source code for deep integration with Mabor's ecosystem.
- Compile-time optimization: Leverages the Rust compiler to optimize the generated code, potentially improving performance.
- No runtime dependency: Eliminates the need for an ONNX runtime, unlike many other solutions.
- Trainability: Allows imported models to be further trained or fine-tuned using Mabor.
- Portability: Enables compilation for various targets, including WebAssembly and embedded devices.
- Backend flexibility: Works with any of Mabor's supported backends.
ONNX Compatibility
Mabor requires ONNX models to use opset version 16 or higher. If your model uses an older version, you'll need to upgrade it using the ONNX version converter.
Upgrading ONNX Models
There are two simple ways to upgrade your ONNX models to the required opset version:
Option 1: Use the provided utility script:
uv run --script https://raw.githubusercontent.com/tracel-ai/mabor/refs/heads/main/crates/mabor-import/onnx_opset_upgrade.py
Option 2: Use a custom Python script:
import onnx
from onnx import version_converter, shape_inference
# Load your ONNX model
model = onnx.load('path/to/your/model.onnx')
# Convert the model to opset version 16
upgraded_model = version_converter.convert_version(model, 16)
# Apply shape inference to the upgraded model
inferred_model = shape_inference.infer_shapes(upgraded_model)
# Save the converted model
onnx.save(inferred_model, 'upgraded_model.onnx')
Step-by-Step Guide
Follow these steps to import an ONNX model into your Mabor project:
Step 1: Update build.rs
First, add the mabor-import
crate to your Cargo.toml
:
[build-dependencies]
mabor-import = "~0.18"
Then, in your build.rs
file:
use mabor_import::onnx::ModelGen; fn main() { ModelGen::new() .input("src/model/my_model.onnx") .out_dir("model/") .run_from_script(); }
This generates Rust code from your ONNX model during the build process.
Step 2: Modify mod.rs
In your src/model/mod.rs
file, include the generated code:
#![allow(unused)] fn main() { pub mod my_model { include!(concat!(env!("OUT_DIR"), "/model/my_model.rs")); } }
Step 3: Use the Imported Model
Now you can use the imported model in your code:
use mabor::tensor; use mabor_ndarray::{NdArray, NdArrayDevice}; use model::my_model::Model; fn main() { let device = NdArrayDevice::default(); // Create model instance and load weights from target dir default device let model: Model<NdArray<f32>> = Model::default(); // Create input tensor (replace with your actual input) let input = tensor::Tensor::<NdArray<f32>, 4>::zeros([1, 3, 224, 224], &device); // Perform inference let output = model.forward(input); println!("Model output: {:?}", output); }
Advanced Configuration
The ModelGen
struct provides several configuration options:
#![allow(unused)] fn main() { ModelGen::new() .input("path/to/model.onnx") .out_dir("model/") .record_type(RecordType::NamedMpk) .half_precision(false) .embed_states(false) .run_from_script(); }
record_type
: Defines the format for storing weights (Bincode, NamedMpk, NamedMpkGz, or PrettyJson).half_precision
: Reduces model size by using half-precision (f16) for weights.embed_states
: Embeds model weights directly in the generated Rust code (requires record typeBincode
).
Loading and Using Models
Depending on your configuration, you can load models in several ways:
#![allow(unused)] fn main() { // Create a new model instance with device // (initializes weights randomly and lazily; load weights via `load_record` afterward) let model = Model::<Backend>::new(&device); // Load from a file // (file type should match the record type specified in `ModelGen`) let model = Model::<Backend>::from_file("path/to/weights", &device); // Load from embedded weights (if embed_states was true) let model = Model::<Backend>::from_embedded(&device); // Load from the output directory with default device (useful for testing) let model = Model::<Backend>::default(); }
Troubleshooting
Common issues and solutions:
-
Unsupported ONNX operator: Check the list of supported ONNX operators. You may need to simplify your model or wait for support.
-
Build errors: Ensure your
mabor-import
version matches your Mabor version and verify the ONNX file path inbuild.rs
. -
Runtime errors: Confirm that your input tensors match the expected shape and data type of your model.
-
Performance issues: Try using the
half_precision
option to reduce memory usage or experiment with differentrecord_type
options. -
Viewing generated files: Find the generated Rust code and weights in the
OUT_DIR
directory (usuallytarget/debug/build/<project>/out
).
Examples and Resources
For practical examples, check out:
These demonstrate real-world usage of ONNX import in Mabor projects.
Conclusion
Importing ONNX models into Mabor combines the vast ecosystem of pre-trained models with Mabor's performance and Rust's safety features. Following this guide, you can seamlessly integrate ONNX models into your Mabor projects for inference, fine-tuning, or further development.
The mabor-import
crate is actively developed, with ongoing work to support more ONNX operators and
improve performance. Stay tuned to the Mabor repository for updates!
🚨Note: The
mabor-import
crate is in active development. For the most up-to-date information on supported ONNX operators, please refer to the official documentation.