Matteo Spanio

Centro di Sonologia Computazionale.

Department of Information Engineering.

iome_bn.jpg

CSC lab, building DEI/S

Via Gardenigo 6/A

Padua, Italy

Hi there, I’m Matteo Spanio, PhD student at the University of Padua. I’m interested in:

  • 🤖 machine learning,
  • 📡 signal processing,
  • 🎹 music.

My Ph. D. project focuses on studying deep learning methods for music generation based on cross modal perception.

I’m member of the MPAI group, the international, unaffiliated, no-profit organisation developing standards for AI-based data coding.

I’m also a musician, and I play the clarinet on a regular basis in many orchestras and ensembles (Orchestra di Padova e del Veneto, Orchestra del Friuli Venezia Giulia, Orchestra San Marco, Concordia Chamber Orchestra, Rossini Ensemble and many others).

To see my work, check out my projects and publications. Time permitting, I also write blog posts.

news

May 28, 2026 I am happy to announce that I will be speaking at the Ital-IA 2026 conference in Rome on June 18th. I will present two papers accepted for publication in the conference proceedings: a review on the AI related studies we do at CSC “Preserving Memory, Expanding Creativity: Human-Centered AI Trajectory in Engineering and Music Research at the CSC of Padua University” and a follow up work on Lilybert: “Can LLMs understand LilyPond? A benchmark for symbolic music generation and understanding”. See you there!
Apr 12, 2026 We release Lilybert, a new model for music understanding. It introduces lilypond notation format to DL. It is available for download on Hugging Face, alongside an open dataset available on zenodo. Learn more about it in our preprint paper. Enjoy!
Aug 28, 2025 Next week I’ll present torchfx, a new python library for GPU accelerated audio effects, at DAFx 2025 in Ancona, Italy. The conference will take place from 2 to 6 September 2025. Hope to see you there!
Mar 05, 2025 I am happy to announce the release of a our new deep learning model for synesthetic music generation. It is available for download on Hugging Face. Learn more about it in our preprint paper. Enjoy!
Nov 03, 2024 My two new papers Towards Emotionally Aware AI: Challenges and Opportunities in the Evolution of Multimodal Generative Models and Filming the sound: Anomaly Detection on Audio Tape Recordings using Computer Vision Algorithms have been accepted for publication at the 23rd International Conference of the Italian Association for Artificial Intelligence that will take place in Bozen between 25 and 28 november. See you there!

latest posts

selected publications

  1. Enhancing Preservation and Restoration of Open Reel Audio Tapes Through Computer Vision
    Alessandro Russo, Matteo Spanio, and Sergio Canazza
    In Image Analysis and Processing - ICIAP 2023 Workshops, 2024
  2. AES
    aes2024.jpg
    A novel derivative-based approach for the automatic detection of time-reversed audio in the MPAI/IEEE-CAE ARP international standard
    Marina Bosi, Fabio Zanini, Matteo Spanio, and 2 more authors
    Journal of the Audio Engineering Society, 2024
  3. Frontiers in CS
    frontiers2025.webp
    A multimodal symphony: integrating taste and sound through generative AI
    Matteo Spanio, Massimiliano Zampini, Antonio Rodà, and 1 more author
    Frontiers in Computer Science, 2025