RustODotnet 0.1.2
dotnet add package RustODotnet --version 0.1.2
NuGet\Install-Package RustODotnet -Version 0.1.2
<PackageReference Include="RustODotnet" Version="0.1.2" />
<PackageVersion Include="RustODotnet" Version="0.1.2" />
<PackageReference Include="RustODotnet" />
paket add RustODotnet --version 0.1.2
#r "nuget: RustODotnet, 0.1.2"
#:package RustODotnet@0.1.2
#addin nuget:?package=RustODotnet&version=0.1.2
#tool nuget:?package=RustODotnet&version=0.1.2
RustO! ๐ฆ
Pure Rust OCR Library - Fast, Safe, and Cross-Platform
RustO! is a high-performance OCR (Optical Character Recognition) library written in pure Rust, based on RapidOCR and powered by PaddleOCR models with MNN inference engine.
๐ฏ Why RustO!?
- ๐ Pure Rust - Zero OpenCV dependency, optional OpenCV backend available
- ๐ฏ High Accuracy - 99.3% parity with OpenCV-based implementations
- โก Fast Performance - Optimized with LTO, single codegen unit compilation
- ๐ Memory Safe - Leverages Rust's safety guarantees
- ๐ Cross-Platform - Linux, macOS, Windows, iOS, Android support
- ๐ง FFI Ready - C FFI bindings for integration with other languages
- ๐ฆ Easy to Use - Simple API, modern CLI with JSON/Text/TSV output
๐๏ธ Architecture
RustO! is built on top of proven OCR technology:
- Based on: RapidOCR architecture
- Models: PaddleOCR PPOCRv4/v5 models
- Inference: MNN inference engine for high-performance cross-platform execution
- Image Processing: Pure Rust implementation (image + imageproc crates)
- Contour Detection: Custom Rust implementation matching OpenCV behavior
๐ Project Structure
rusto-rs/
โโโ src/
โ โโโ lib.rs # Public API
โ โโโ main.rs # CLI application
โ โโโ ffi.rs # C FFI bindings (optional)
โ โโโ det.rs # Text detection
โ โโโ rec.rs # Text recognition
โ โโโ layout.rs # Layout detection
โ โโโ doc_pipeline.rs # Document pipeline (layout + OCR)
โ โโโ preprocess.rs # Image preprocessing
โ โโโ postprocess.rs # Result postprocessing
โ โโโ contours.rs # Pure Rust contour detection
โ โโโ geometry.rs # Geometric transformations + NMS
โ โโโ image_impl.rs # Image abstraction layer
โ โโโ ...
โโโ Cargo.toml # Dependencies & optimization
โโโ docs/ # Documentation
โโโ examples/ # Example applications
โ โโโ doc_pipeline_demo.rs # Document pipeline example
โ โโโ ...
โโโ packages/ # Additional packages
Model Conversion
RustO! uses MNN inference engine. You need to convert PaddleOCR models to MNN format:
# Install required tools
pip install paddle2onnx
# Download and build MNN from https://github.com/alibaba/MNN
# Convert models using the provided script
python convert_paddle_to_mnn.py --ocr-dir ./models
See MODEL_CONVERSION.md for detailed conversion instructions.
Quick Start
1. Build the Library
# Pure Rust build (default)
cargo build --release
# With FFI bindings
cargo build --release --features ffi
# With OpenCV backend (optional)
cargo build --release --features use-opencv
2. Run CLI Application
# JSON output (default)
cargo run --release -- \
--det-model path/to/det.mnn \
--rec-model path/to/rec.mnn \
--dict path/to/dict.txt \
image.jpg
# Plain text output
cargo run --release -- \
--det-model path/to/det.mnn \
--rec-model path/to/rec.mnn \
--dict path/to/dict.txt \
--format text \
image.jpg
# TSV output
cargo run --release -- \
--det-model path/to/det.mnn \
--rec-model path/to/rec.mnn \
--dict path/to/dict.txt \
--format tsv \
image.jpg
3. Use as a Library
Add to your Cargo.toml:
[dependencies]
rusto = "0.1"
Then in your code:
use rusto::{RapidOCR, RapidOCRConfig};
fn main() -> Result<(), Box<dyn std::error::Error>> {
// Configure OCR
let config = RapidOCRConfig {
det_model_path: "models/det.mnn".to_string(),
rec_model_path: "models/rec.mnn".to_string(),
dict_path: "models/dict.txt".to_string(),
};
// Create OCR instance
let ocr = RapidOCR::new(config)?;
// Run OCR on an image
let results = ocr.ocr("image.jpg")?;
// Process results
for result in results {
println!("Text: {}, Score: {:.3}", result.text, result.score);
println!("Box: {:?}", result.box_points);
}
Ok(())
}
4. Document Pipeline (Layout + OCR)
RustO! now supports document layout analysis combined with OCR for structured document processing:
use rusto::{DocPipeline, DocPipelineConfig, LayoutConfig, RustOConfig};
fn main() -> Result<(), Box<dyn std::error::Error>> {
// Configure layout detection
let layout_config = LayoutConfig::default("models/DocOCR/layout.mnn".into());
// Configure OCR
let ocr_config = RustOConfig::new_ppv5(
"models/det.mnn".into(),
"models/rec.mnn".into(),
"models/dict.txt".into(),
);
// Create document pipeline
let config = DocPipelineConfig {
layout: layout_config,
ocr: ocr_config,
};
let mut pipeline = DocPipeline::new(config)?;
let result = pipeline.run("document.jpg")?;
// Generate markdown output
println!("{}", result.to_markdown());
Ok(())
}
Supported Layout Elements:
- Text, Title, Header, Footer
- Figure, Figure Caption
- Table, Table Caption
- Reference, Equation
Example:
cargo run --example doc_pipeline_demo -- \
--image document.jpg \
--layout-model models/DocOCR/layout.mnn \
--det-model models/det.mnn \
--rec-model models/rec.mnn \
--keys-path models/dict.txt
5. iOS Integration
Install via CocoaPods:
pod 'RustO', '~> 0.1'
Then in Swift:
import RustO
let ocr = try RapidOCR(
detModelPath: Bundle.main.path(forResource: "det", ofType: "mnn")!,
recModelPath: Bundle.main.path(forResource: "rec", ofType: "mnn")!,
dictPath: Bundle.main.path(forResource: "dict", ofType: "txt")!
)
let results = try ocr.recognizeFile("image.jpg")
for result in results {
print("\(result.text): \(result.score)")
}
API Reference
RapidOCRConfig
Configuration structure for initializing the OCR engine.
pub struct RapidOCRConfig {
pub det_model_path: String, // Path to detection MNN model
pub rec_model_path: String, // Path to recognition MNN model
pub dict_path: String, // Path to character dictionary
}
TextResult
OCR result for a single detected text region.
pub struct TextResult {
pub text: String, // Recognized text
pub score: f32, // Confidence score (0.0-1.0)
pub box_points: [(f32, f32); 4], // Bounding box corners
}
RapidOCR
Main OCR engine.
impl RapidOCR {
// Create a new OCR instance
pub fn new(config: RapidOCRConfig) -> Result<Self, EngineError>;
// Run OCR on an image file
pub fn ocr<P: AsRef<Path>>(&self, image_path: P) -> Result<Vec<TextResult>, EngineError>;
// Run OCR on image data in memory
pub fn ocr_from_bytes(&self, image_data: &[u8]) -> Result<Vec<TextResult>, EngineError>;
}
FFI Bindings
The library includes C FFI bindings for integration with other languages. Enable with the ffi feature:
cargo build --release --features ffi
This produces:
- Linux:
librusto.so - macOS:
librusto.dylib - Windows:
rusto.dll
See src/ffi.rs for the complete FFI API documentation.
๐ฆ Models
RustO! uses PaddleOCR models converted to ONNX format:
Supported Models
- PPOCRv4 - PaddleOCR version 4 models
- PPOCRv5 - PaddleOCR version 5 models (recommended)
Model Components
- Detection Model (
det.onnx) - Detects text regions in images - Recognition Model (
rec.onnx) - Recognizes text within detected regions - Dictionary (
dict.txt) - Character dictionary for text recognition
Download Models
# Example: Download PPOCRv5 models
wget https://github.com/RapidAI/RapidOCR/releases/download/v1.3.0/det.onnx
wget https://github.com/RapidAI/RapidOCR/releases/download/v1.3.0/rec.onnx
wget https://github.com/RapidAI/RapidOCR/releases/download/v1.3.0/dict.txt
โก Performance
Benchmarks
Tested on typical document images:
| Metric | Value |
|---|---|
| Detection | ~80ms |
| Recognition (per box) | ~120ms |
| Total (28 boxes) | ~3.5s |
| Memory Peak | ~200MB |
Comparison with OpenCV-based implementations
| Aspect | RustO! | OpenCV-based |
|---|---|---|
| Speed | โ Similar (ยฑ10%) | Baseline |
| Accuracy | โ 99.3% parity | 100% |
| Binary Size | โ Smaller | Larger (OpenCV deps) |
| Memory Usage | โ Lower | Higher (OpenCV overhead) |
| Dependencies | โ Minimal | OpenCV required |
| Safety | โ Memory safe | Manual memory management |
Configuration
Cargo Features
[features]
default = [] # Pure Rust mode
use-opencv = ["opencv"] # Use OpenCV backend
ffi = [] # Enable C FFI bindings
Build Profiles
[profile.release]
opt-level = 3 # Maximum optimization
lto = "fat" # Link-time optimization
codegen-units = 1 # Single codegen unit for better optimization
strip = true # Strip symbols
panic = "abort" # Smaller binary
Development
Run Tests
cd rapidocr
cargo test
cargo test --features use-opencv # Test OpenCV backend
Run Benchmarks
cargo bench
Check Code
cargo clippy
cargo fmt --check
Known Issues
Rust Library (contours.rs)
- โ ๏ธ Unused functions (400+ lines) - cleanup pending
- โ ๏ธ Minor lint warnings - non-blocking
Remaining Parity Gap (0.7%)
- 2 minor text differences out of 28 boxes
- Caused by: Spacing (
"Gol. Darah:"vs"Gol. Darah :") - Impact: Negligible for production use
License
MIT (or your license)
Contributing
- Fork the repository
- Create a feature branch
- Make your changes
- Run tests:
cargo test - Submit a pull request
Support
- ๐ง Email: support@rapidocr.com
- ๐ฌ Discussions: GitHub Discussions
- ๐ Issues: GitHub Issues
๐ Acknowledgments
RustO! builds upon the excellent work of:
- RapidOCR - Architecture and design inspiration
- PaddleOCR - State-of-the-art OCR models (PPOCRv4/v5)
- ONNX Runtime - Cross-platform inference engine
- Rust Community - Excellent tooling and libraries (image, imageproc, nalgebra)
๐ Citation
If you use RustO! in your research or project, please cite:
@software{rusto2024,
title = {RustO! - Pure Rust OCR Library},
author = {byrizki},
year = {2024},
url = {https://github.com/byrizki/rusto-rs},
note = {Based on RapidOCR and powered by PaddleOCR models}
}
Also consider citing the underlying technologies:
- PaddleOCR: https://github.com/PaddlePaddle/PaddleOCR
- RapidOCR: https://github.com/RapidAI/RapidOCR
<div align="center">
Status: Production Ready ๐
Version: 0.1.2
License: MIT
Made with โค๏ธ and ๐ฆ Rust
Report Bug ยท Request Feature ยท Contribute
</div>
| Product | Versions Compatible and additional computed target framework versions. |
|---|---|
| .NET | net5.0 was computed. net5.0-windows was computed. net6.0 is compatible. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 was computed. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed. |
| .NET Core | netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
| .NET Standard | netstandard2.1 is compatible. |
| MonoAndroid | monoandroid was computed. |
| MonoMac | monomac was computed. |
| MonoTouch | monotouch was computed. |
| Tizen | tizen60 was computed. |
| Xamarin.iOS | xamarinios was computed. |
| Xamarin.Mac | xamarinmac was computed. |
| Xamarin.TVOS | xamarintvos was computed. |
| Xamarin.WatchOS | xamarinwatchos was computed. |
-
.NETStandard 2.1
- No dependencies.
-
net6.0
- No dependencies.
-
net8.0
- No dependencies.
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.