Skip to main content
Ctrl+K

TorchFX

  • Guides
  • API Reference
  • Blog
  • Glossary
  • GitHub
  • PyPI
  • Guides
  • API Reference
  • Blog
  • Glossary
  • GitHub
  • PyPI
  • Posts tagged precision

Posts tagged precision

FP32 on the GPU: 3–3.6× and the End of the Consumer-GPU Penalty

  • 04 June 2026
  • Matteo Spanio
  • features
  • cuda fp32 performance precision kernels

This is the GPU half of the promise we made in 0.5.4: “retuning the CUDA SOS kernel for mixed precision so float32 gets the same fast path on GPU that it now has on CPU.” TorchFX 0.6.0 delivers it.

Read more ...


© Copyright 2026, Matteo Spanio.

Created using Sphinx 8.1.3.

Built with the PyData Sphinx Theme 0.17.0.