Testing & Integration Improvements¶

A comprehensive set of improvements to the testing infrastructure and development workflow, addressing critical issues with test reliability and linting consistency.

🎯 Overview¶

This release focuses on infrastructure stability and developer experience improvements, ensuring that tests run reliably across all environments and that linting behavior is consistent between local development and CI/CD pipelines.

✨ Key Improvements¶

🧪 Test Import Reliability¶

Problem Solved: Tests were failing with ModuleNotFoundError when optional dependencies (numpy, pandas) weren't available, causing CI failures and development friction.

Solution Implemented: - Safe Import Pattern: Replaced direct imports with pytest.importorskip() for all optional dependencies - Test Isolation: Tests now gracefully skip when dependencies are unavailable rather than failing - Robust Error Handling: Improved error messages and fallback behavior

Files Improved: - tests/test_auto_detection_and_metadata.py - Fixed 23 critical auto-detection tests - tests/test_dataframe_orientation_regression.py - Fixed 15 DataFrame orientation tests
- tests/conftest.py - Enhanced dependency checking with importlib.util.find_spec()

🔧 Linting Configuration Alignment¶

Problem Solved: Local development and CI environments were using different linting rules, causing "passing locally, failing in CI" scenarios that frustrated developers.

Root Cause Analysis:

# Local (inconsistent)
ruff check .                    # Only current directory

# CI (comprehensive)
ruff check datason/ tests/ examples/  # All relevant directories

Solution Implemented: - Unified Configuration: Added comprehensive ignore rules to pyproject.toml - Developer Tooling: Added ruff-ci alias for local testing that matches CI exactly - Documentation: Clear guidance on running the same checks locally as CI

📋 Code Quality Improvements¶

Linting Issues Resolved:

Issue Type	Count	Description	Solution
F401	8	Unused imports	Scoped imports to function level
B007	3	Unused loop variables	Renamed to underscore prefix
SIM108	2	If-else to ternary	Simplified conditional expressions
B904	1	Exception chaining	Added `from None`
SIM117	12	Nested with statements	Added to global ignore (style preference)

🛠️ Technical Implementation¶

Safe Import Pattern¶

Before (Fragile):

import numpy as np
import pandas as pd

def test_numpy_functionality():
    # Test fails if numpy not installed
    array = np.array([1, 2, 3])
    assert len(array) == 3

After (Robust):

def test_numpy_functionality():
    np = pytest.importorskip("numpy")
    # Test skips gracefully if numpy not installed
    array = np.array([1, 2, 3])
    assert len(array) == 3

Configuration Alignment¶

Enhanced pyproject.toml:

[tool.ruff.lint]
ignore = [
    "E501",    # line too long (handled by formatter)
    "B008",    # do not perform function calls in argument defaults
    "C901",    # too complex (let developers decide)
    "SIM105",  # Use contextlib.suppress - too pedantic for import error handling
    "SIM117",  # Use single with statement with multiple contexts - style preference
    "B017",    # assert raises - sometimes needed for testing exception types
    "B007",    # Loop control variable not used - sometimes needed for counting
]

[tool.ruff.lint.per-file-ignores]
"examples/*.py" = ["T201"]  # Allow print statements in examples
"tests/test_benchmarks.py" = ["T201"]  # Allow print statements in benchmarks
"tests/test_*.py" = ["E402"]  # Allow imports after pytest.importorskip for optional dependencies

Developer Workflow Improvements¶

New Shell Alias (added to ~/.zshrc):

alias ruff-ci="ruff check datason/ tests/ examples/"

Pre-commit Hook Alignment:

# Now runs the same checks as CI
pre-commit run --all-files

📊 Results & Impact¶

Test Reliability¶

✅ 38 Critical Tests Fixed: All auto-detection and DataFrame regression tests now pass consistently
✅ Zero Import Failures: Tests skip gracefully when optional dependencies unavailable
✅ CI Stability: No more random failures due to missing dependencies

Developer Experience¶

✅ Local/CI Parity: Identical linting behavior in all environments
✅ Faster Feedback: Catch issues locally before pushing to CI
✅ Clear Documentation: Developers know exactly how to run the same checks as CI

Code Quality¶

✅ 26 Linting Issues Resolved: Clean codebase with consistent style
✅ Maintainable Patterns: Established best practices for optional dependency handling
✅ Security Improvements: Better exception handling with proper chaining

🚀 Usage Examples¶

Running Tests with Optional Dependencies¶

# Install minimal dependencies
pip install -e .

# Run core tests (no optional deps required)
pytest tests/test_core.py -v

# Install optional dependencies and run full suite
pip install -e ".[pandas,numpy,ml]"
pytest tests/ -v

# Tests automatically skip when dependencies unavailable
pytest tests/test_auto_detection_and_metadata.py -v
# SKIPPED [1] tests/test_auto_detection_and_metadata.py:15: could not import 'numpy'

Local Development Workflow¶

# Check exactly what CI will check
ruff-ci

# Or run manually to match CI
ruff check datason/ tests/ examples/

# Run pre-commit hooks (now aligned with CI)
pre-commit run --all-files

# Run tests with coverage
pytest tests/ -v --cov=datason

Optional Dependency Patterns in Tests¶

def test_pandas_integration():
    """Test pandas functionality when available."""
    pd = pytest.importorskip("pandas")

    # Now safe to use pandas
    df = pd.DataFrame({"a": [1, 2, 3]})
    result = datason.serialize(df)
    assert isinstance(result, list)

def test_numpy_arrays():
    """Test numpy array serialization when available."""
    np = pytest.importorskip("numpy")

    # Safe to use numpy
    arr = np.array([[1, 2], [3, 4]])
    result = datason.serialize(arr)
    assert result["_type"] == "numpy.ndarray"

🔍 Configuration Reference¶

Ruff Settings¶

# Key ignore rules for datason development
ignore = [
    "SIM117",  # Style preference: nested with statements OK
    "B007",    # Unused loop vars OK for counting iterations  
    "E402",    # Imports after pytest.importorskip OK in tests
]

Pytest Configuration¶

[tool.pytest.ini_options]
markers = [
    "numpy: marks tests requiring numpy",
    "pandas: marks tests requiring pandas",
    "ml: marks tests requiring ML libraries",
    "optional: marks tests for optional dependency functionality",
]

🎉 Developer Benefits¶

Before This Release¶

# Frustrating local/CI differences
✅ Local: ruff check .           # Passes
❌ CI: ruff check datason/ tests/ examples/  # Fails

# Flaky test failures  
❌ ModuleNotFoundError: No module named 'numpy'
❌ ImportError: Missing optional dependency 'pandas'

After This Release¶

# Consistent everywhere
✅ Local: ruff-ci                # Same as CI
✅ CI: ruff check datason/ tests/ examples/  # Same rules

# Reliable test behavior
✅ SKIPPED: could not import 'numpy' (expected)
✅ 38/38 auto-detection tests passing

Contributing Guide - Development setup and guidelines
CI Pipeline Guide - Complete CI/CD documentation
Security - Security best practices in testing

Release Commit: 2a61260 - Fix test import issues and align ruff configuration
GPG Verified: ✅ Signed commit with comprehensive test coverage and documentation