NOTEARS: Rust Implementation

Production-grade Rust implementation of the NO TEARS algorithm for learning directed acyclic graph (DAG) structures from observational data using continuous optimization.

Paper: Zheng et al. (2018) — DAGs with NO TEARS

Overview

NOTEARS learns sparse DAG structures by solving:

$$\min_{W} \frac{1}{2n}|X - XW|_F^2 + \lambda|W|_1 \quad \text{subject to} \quad h(W) = \text{tr}(e^{W \odot W}) - d = 0$$

where:

$W$ is the $d \times d$ weight matrix defining the DAG
$h(W)$ is the differentiable acyclicity constraint
$\odot$ denotes element-wise product (Hadamard)
$e^{(\cdot)}$ is the matrix exponential

Key Features

✅ Differentiable Acyclicity Constraint — Enable gradient-based optimization
✅ O(d³) per-iteration Complexity — Via efficient matrix exponential
✅ L-BFGS + Augmented Lagrangian — State-of-the-art constrained optimization
✅ Production-Grade Error Handling — Comprehensive validation & descriptive errors
✅ Numerical Stability — Across varying data regimes (n, d, λ)
✅ Comprehensive Benchmarks — Performance profiling suite included

Quick Start

Installation

Add to your Cargo.toml:

[dependencies]
notears = "0.1"
ndarray = "0.15"

Minimal Example

use notears::optimization::solve;
use notears::utils::standardize_data;
use ndarray::Array2;

fn main() -> Result<(), Box<dyn std::error::Error>> {
    // Load your observational data (n samples × d variables)
    let data = Array2::zeros((1000, 20));
    
    // Standardize (recommended)
    let standardized = standardize_data(&data)?;
    
    // Learn DAG structure (default config, λ=0.1)
    let result = solve(&standardized, 0.1)?;
    
    // Extract learned structure
    let w_estimated = result.weight_matrix;
    let edges = result.edges();
    let acyclicity = result.constraint_violation;
    
    println!("Learned {} edges", edges.len());
    println!("Constraint violation: {:.2e}", acyclicity);
    
    Ok(())
}

Advanced Usage with Custom Configuration

use notears::types::{OptimizationConfig, RegularizationConfig};
use notears::optimization::solve_with_config;
use ndarray::Array2;

fn main() -> Result<(), Box<dyn std::error::Error>> {
    let data = Array2::zeros((1000, 50));
    let standardized = notears::utils::standardize_data(&data)?;
    
    // Custom configuration for large-scale problems
    let opt_config = OptimizationConfig {
        max_outer_iterations: 20,
        max_lbfgs_iterations: 200,
        lbfgs_memory: 15,
        constraint_tolerance: 1e-8,
        penalty_rho_init: 0.1,
        progress_rate: 0.1,
        edge_threshold: 0.3,
    };
    
    let reg_config = RegularizationConfig::new(0.05, false)?;
    let result = solve_with_config(&standardized, opt_config, reg_config)?;
    
    Ok(())
}

Documentation

📚 Comprehensive Documentation Suite (~29,000 words)

Start here: NOTEARS Documentation Master Index — Navigation guide with 5 reading paths by role

For Different Audiences:

🚀 Quick Reference Guide — Practical cheat sheet
- Algorithm comparison, hyperparameter tuning, troubleshooting (5 common issues), validation checklist, 10 pitfalls
- Best for: Practitioners needing fast answers
🛠️ Rust Implementation Guide — Complete technical reference
- 7-phase implementation roadmap, mathematical foundations, code examples, production checklist
- Best for: Software engineers implementing NOTEARS
📊 Algorithm Analysis & Comparison — Deep dive for researchers
- Detailed comparison vs. PC/GES/LiNGAM/GOBNILP, 8-dimensional evaluation rubric, 4 real-world case studies
- Best for: Data scientists and researchers

Technical References:

API Reference — Complete type and function documentation with examples
Configuration Guide — Tuning for different data regimes (underdetermined, balanced, overdetermined)
Troubleshooting Guide — Common issues, diagnostics, and solutions (20+ topics)
Benchmarking Suite — Performance profiling, flamegraph, regression testing
Deployment Guide — Production setup, version management, CI/CD
Tutorial Notebooks — Jupyter notebooks with worked examples and best practices

Examples

Example 1: Synthetic Data with Known Structure

// Generate data from known DAG
let w_true = create_dag(10, 0.3);  // 10 nodes, 30% density
let data = sample_from_dag(&w_true, 1000)?;

// Learn structure
let result = solve(&data, 0.1)?;

// Evaluate: compare learned structure to ground truth
let accuracy = evaluate_structure(&w_true, &result.weight_matrix);
println!("Structure accuracy: {:.2%}", accuracy);

Example 2: Real-World Data Analysis

// Load real data (e.g., from CSV)
let data = load_data_from_file("data.csv")?;

// Standardize
let standardized = standardize_data(&data)?;

// Learn with lambda selection via cross-validation
let lambda = select_lambda(&standardized)?;
let result = solve(&standardized, lambda)?;

// Visualize DAG
visualize_dag(&result.weight_matrix, "learned_dag.svg")?;

Example 3: Sensitivity Analysis

let data = Array2::zeros((1000, 20));
let standardized = standardize_data(&data)?;

// Vary regularization strength
for lambda in [0.01, 0.05, 0.1, 0.2, 0.5] {
    let result = solve(&standardized, lambda)?;
    println!("λ={}: {} edges", lambda, result.edges().len());
}

Performance Targets

Problem	Rust (target)	Paper (reference)	Safety Margin
d=20, n=1000	1-2 sec	1-2 sec	3×
d=50, n=1000	5-10 sec	5-10 sec	5×
d=100, n=1000	30-60 sec	30-60 sec	10×

See BENCHMARKING.md for full performance analysis.

Building from Source

# Clone repository
git clone https://github.com/pristley/notears
cd notears

# Build library
cargo build --release

# Run tests
cargo test --release

# Run benchmarks
cargo bench --bench bench_end_to_end

# Generate documentation
cargo doc --open

Minimum Supported Rust Version (MSRV)

NOTEARS requires Rust 1.56+ and works with:

✅ Stable channel
✅ Beta channel
✅ Nightly channel (tested on latest)

Older Rust versions may work but are not officially supported.

Project Structure

notears/
├── src/                           # Core library
│   ├── lib.rs                    # Library root
│   ├── types.rs                  # Type definitions & configuration
│   ├── optimization.rs           # L-BFGS + Augmented Lagrangian solver
│   ├── acyclicity.rs             # Differentiable acyclicity constraint
│   ├── scoring.rs                # Loss functions & gradients
│   └── utils.rs                  # Matrix operations & utilities
├── tests/                         # Integration test suite
│   ├── test_acyclicity.rs        # Constraint tests
│   ├── test_optimization.rs      # Solver tests
│   ├── test_scoring.rs           # Loss function tests
│   ├── test_integration.rs       # End-to-end workflows
│   └── common.rs                 # Test utilities
├── benches/                       # Performance benchmarks
│   ├── bench_matrix_ops.rs       # Low-level operations (matrix exp, etc.)
│   ├── bench_optimization.rs     # Intermediate solver components
│   ├── bench_end_to_end.rs       # Full algorithm end-to-end
│   └── profiling_utils.rs        # Benchmark utilities & data generation
├── examples/                      # Tutorial Jupyter notebooks
│   ├── 01_quick_start.ipynb      # Getting started guide
│   └── 02_configuration_best_practices.ipynb  # Configuration by regime
├── docs/                          # Detailed documentation
│   ├── API.md                    # Function & type reference
│   └── CONFIGURATION.md          # Hyperparameter tuning guide
├── .github/workflows/             # GitHub Actions CI/CD
│   ├── tests.yml                 # Multi-version testing (1.56+, stable, beta, nightly)
│   ├── benchmarks.yml            # Performance benchmarking with regression detection
│   └── docs.yml                  # Documentation generation & deployment
├── README.md                      # Project overview (you are here)
├── CHANGELOG.md                   # Version history & release notes
├── BENCHMARKING.md                # Performance profiling suite guide
├── TROUBLESHOOTING.md             # Common issues & debugging
├── DEPLOYMENT.md                  # Production deployment guide
├── Cargo.toml                     # Project manifest (MSRV: 1.56+)
└── LICENSE                        # MIT license

Configuration Guide

Three preset configurations for common data regimes:

Small n, Large d (Underdetermined)

OptimizationConfig {
    max_outer_iterations: 20,
    max_lbfgs_iterations: 200,
    lbfgs_memory: 10,
    constraint_tolerance: 1e-7,
    penalty_rho_init: 1.0,      // Higher penalty for faster DAG feasibility
    progress_rate: 0.25,
    edge_threshold: 0.3,
}

Large n, Small d (Overdetermined)

OptimizationConfig {
    max_outer_iterations: 15,
    max_lbfgs_iterations: 100,
    lbfgs_memory: 20,
    constraint_tolerance: 1e-8,
    penalty_rho_init: 0.1,      // Lower penalty for fine-tuning
    progress_rate: 0.1,         // Stricter progress criterion
    edge_threshold: 0.3,
}

Balanced (Default)

OptimizationConfig::default()  // See types.rs for values

Testing

# Unit tests
cargo test --lib

# Integration tests
cargo test --test '*'

# All tests with logging
RUST_LOG=debug cargo test -- --nocapture

# Specific test
cargo test test_acyclicity_constraint -- --nocapture

Performance Profiling

# Run benchmarks
cargo bench --bench bench_matrix_ops
cargo bench --bench bench_optimization
cargo bench --bench bench_end_to_end

# Generate baseline for regression testing
cargo bench -- --save-baseline initial

# Compare against baseline
cargo bench -- --baseline initial

# Flame graph profiling (Linux)
cargo flamegraph --bench bench_end_to_end

See BENCHMARKING.md for detailed profiling guide.

CI/CD Status

Contributing

Contributions welcome! Please:

Fork the repository
Create a feature branch (git checkout -b feature/your-feature)
Add tests for new functionality
Ensure all tests pass: cargo test
Run benchmarks: cargo bench
Submit a pull request

References

NOTEARS Paper: Zheng et al. (2018) — DAGs with NO TEARS: Continuous Optimization for Learning Acyclic Graphs
Matrix Exponential: Higham (2008) — Functions of Matrices: Theory and Computation
Augmented Lagrangian: Boyd & Parikh (2011) — Distributed Optimization and Statistical Learning

License

Licensed under the MIT License — see LICENSE file for details.

Citation

If you use NOTEARS in your research, please cite:

@inproceedings{zheng2018dags,
  title={DAGs with NO TEARS: Continuous Optimization for Learning Acyclic Graphs},
  author={Zheng, Xun and Aragam, Bryon and Ravikumar, Pradeep K and Xing, Eric P},
  booktitle={Advances in Neural Information Processing Systems},
  pages={9472--9483},
  year={2018}
}

Acknowledgments

Original algorithm by Zheng et al. (2018)
Built with ndarray, nalgebra, and rayon

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github/workflows		.github/workflows
benches		benches
docs		docs
examples		examples
src		src
target		target
tests		tests
.gitignore		.gitignore
BENCHMARKING.md		BENCHMARKING.md
CHANGELOG.md		CHANGELOG.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
DEPLOYMENT.md		DEPLOYMENT.md
README.md		README.md
TROUBLESHOOTING.md		TROUBLESHOOTING.md

Folders and files

Latest commit

History

Repository files navigation

NOTEARS: Rust Implementation

Overview

Key Features

Quick Start

Installation

Minimal Example

Advanced Usage with Custom Configuration

Documentation

📚 Comprehensive Documentation Suite (~29,000 words)

For Different Audiences:

Technical References:

Examples

Example 1: Synthetic Data with Known Structure

Example 2: Real-World Data Analysis

Example 3: Sensitivity Analysis

Performance Targets

Building from Source

Minimum Supported Rust Version (MSRV)

Project Structure

Configuration Guide

Small n, Large d (Underdetermined)

Large n, Small d (Overdetermined)

Balanced (Default)

Testing

Performance Profiling

CI/CD Status

Contributing

References

License

Citation

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages