Roadmap to v1.0.0#

Current Version: 0.4.0-dev (Beta Track) Target: v1.0.0 Stable Release

This roadmap outlines the development path for TorchFX from the current beta state to a production-ready v1.0.0 release. The plan is organized into major epics, each containing specific deliverables and tasks.

Vision#

TorchFX v1.0.0 will be a production-ready, GPU-accelerated audio DSP library with:

Real-time processing capabilities for live audio (microphone/instrument input)
Modern CLI tool combining sox compatibility with GPU acceleration
Optimized performance through custom CUDA kernels
Professional documentation with comprehensive tutorials and API reference
>90% test coverage with integration and audio quality tests
Semantic versioning with backward compatibility guarantees

Current State#

Strengths#

✅ Solid core DSP architecture (~2000 LOC)
✅ GPU acceleration working
✅ 393 tests with >90% coverage
✅ Published research paper (arXiv:2504.08624)
✅ Clean API with pipe operator support
✅ Professional Sphinx documentation with tutorials
✅ Real-time audio processing with circular buffers
✅ Full-featured CLI with sox compatibility
✅ Interactive REPL with live performance mode
✅ Complete validation and logging infrastructure
✅ API stability guarantees with deprecation system

Gaps#

❌ No custom CUDA kernels yet
❌ Limited ML integration examples
❌ Missing some advanced effects (compressor, phaser, pitch shift)
❌ No VST3 wrapper

Estimated Completion: ~85% ready for v1.0.0

Implementation Phases#

Phase 1: Foundation ✅ COMPLETED#

Priority: Critical

Epic 1: Core Library Stabilization ✅
- ✅ Complete missing features
- ✅ API stabilization
- ✅ Error handling
Epic 6: Testing Infrastructure ✅
- ✅ Expand unit tests (393 tests, >90% coverage)
- ✅ CI improvements

Phase 2: Major Features ✅ COMPLETED#

Priority: Critical

Epic 2: Real-Time Audio Processing ✅
- ✅ Audio backends (SoundDevice)
- ✅ Real-time pipeline with circular buffers
- ✅ Thread-safe parameter updates
- ✅ Stream processor for large files
Epic 3: CLI Application ✅
- ✅ Core CLI with 11 commands
- ✅ Pipeline processing (batch, pipes, watch)
- ✅ Interactive mode with live performance
- ✅ Preset management
- ✅ Sox-compatible commands
Epic 5: Documentation ✅
- ✅ Complete API reference
- ✅ CLI guide
- ✅ Tutorials and examples
- ✅ Migration guide and API stability docs

Phase 3: Optimization & Polish (v1.0)#

Priority: Medium — NEXT

Epic 4: CUDA Kernels (can start early)
- IIR kernels (priority)
- Effect kernels
Epic 7: Additional Effects
- Can be added incrementally in v1.1+

Success Metrics for v1.0.0#

✅ API Stability: No breaking changes after v1.0.0 without major version bump
- ✅ Implemented deprecation system
- ✅ API stability guarantees documented
- ✅ Migration guide template created
✅ Test Coverage: >90% code coverage
- ✅ 393 tests across all modules
- ✅ Unit, integration, and CLI tests
✅ Documentation: 100% of public API documented with examples
- ✅ Complete API reference
- ✅ CLI guide with comprehensive examples
- ✅ Tutorials and how-to guides
✅ Performance:
- ✅ Real-time: 48kHz, 2048 buffer, ~46ms latency (tested in REPL)
- ⚠️ Batch: >100x real-time on modern GPU (needs CUDA kernels for further optimization)
✅ Platform Support: Linux, macOS, Windows with Python 3.10-3.13
- ✅ CI testing on multiple platforms
✅ CLI Functionality: All core commands working
- ✅ 11 commands implemented (process, info, play, record, convert, trim, concat, stats, preset, interactive, watch)
- ✅ Batch processing, pipes, TOML config, presets
✅ Community: Contributing guide, issue templates, active CI
- ✅ Style guide documented
- ✅ Roadmap maintained

Status: 6.5/7 metrics achieved — ready for v1.0.0 RC

Code Quality Standards#

TorchFX follows SOLID and DRY principles:

Single Responsibility: Each class has one clear purpose
Open/Closed: Extensible through inheritance and composition
Liskov Substitution: Consistent interfaces across similar classes
Interface Segregation: Narrow, focused interfaces
Dependency Inversion: Depend on abstractions, not implementations
Don’t Repeat Yourself: Shared utilities, reusable components

Future Considerations (Post-v1.0)#

Plugin system for third-party effects
ONNX export for deployment
Model Hub integration (HuggingFace)
Audio ML integration helpers
VST3 plugin wrapper (complex, long-term)

Contributing#

We welcome contributions! See the style guide for guidelines.

Current focus: Phase 3 (CUDA Optimization) & additional effects
Phase 1 & 2: ✅ COMPLETED
Good first issues: Check GitHub issues tagged good-first-issue
CLI Extension Ideas: Real-time visualization, AB comparison mode, spectrum analyzer
Questions: Open a discussion on GitHub

Roadmap to v1.0.0#

Vision#

Current State#

Strengths#

Gaps#

Epic 1: Core Library Stabilization#

1.1 Complete Missing Core Features#

1.2 API Stabilization#

1.3 Error Handling & Validation#

Epic 2: Real-Time Audio Processing#

2.1 Audio Backend Integration#

2.2 Real-Time Processing Pipeline#

2.3 Real-Time Effect Adaptations#

Epic 3: CLI Application#

3.1 Core CLI Architecture ✅#

3.2 Pipeline Processing & Sox Compatibility ✅#

3.3 Interactive Mode (REPL) ✅#

3.4 Watch Mode & Automation ✅#

Epic 4: Performance Optimization & CUDA#

4.1 CUDA Development Infrastructure#

4.2 IIR Filter CUDA Kernels (Priority 1)#

4.3 Time-Domain Effects CUDA Kernels (Priority 2)#

4.4 Batch Processing Optimizations (Priority 3)#

4.5 Performance Benchmarking#

Epic 5: Comprehensive Documentation#

5.1 API Reference Completion#

5.2 Tutorial & Guide Documentation#

5.3 Example Gallery#

5.4 Project Documentation#

Epic 6: Testing & Quality Assurance#

6.1 Expand Unit Test Coverage#

6.2 Integration Tests#

6.3 Audio Quality Tests#

6.4 Performance & Memory Tests#

6.5 CI/CD Improvements#

Epic 7: Additional Effects#

7.1 Dynamics Processing#

7.2 Modulation Effects#

7.3 Distortion & Saturation#

7.4 Pitch & Time Manipulation#

7.5 Spatial Audio#