TR2025-116

Audio Signal Processing in the Artificial Intelligence Era: Challenges and Directions


    •  Steinmetz, C., Uhle, C., Everardo, F., Mitcheltree, C., McElveen, J.K., Jot, J.-M., Wichern, G., "Audio Signal Processing in the Artificial Intelligence Era: Challenges and Directions", Journal of the Audio Engineering Society, DOI: 10.17743/​jaes.2022.0209, Vol. 73, No. 7/8, pp. 406-428, August 2025.
      BibTeX TR2025-116 PDF
      • @article{Steinmetz2025aug,
      • author = {Steinmetz, Christian and Uhle, Christian and Everardo, Flavio and Mitcheltree, Christopher and McElveen, J. Keith and Jot, Jean-Marc and Wichern, Gordon},
      • title = {{Audio Signal Processing in the Artificial Intelligence Era: Challenges and Directions}},
      • journal = {Journal of the Audio Engineering Society},
      • year = 2025,
      • volume = 73,
      • number = {7/8},
      • pages = {406--428},
      • month = aug,
      • doi = {10.17743/jaes.2022.0209},
      • url = {https://www.merl.com/publications/TR2025-116}
      • }
  • MERL Contact:
  • Research Areas:

    Artificial Intelligence, Machine Learning, Speech & Audio

Abstract:

Artificial intelligence ( AI ) has seen significant advancement in recent years, leading to increasing interest in integrating these techniques to solve both existing and emerging problems in audio engineering. In this paper, we investigate current trends in the application of AI for audio engineering, outlining open problems and applications in the research field. We begin by providing an overview of AI-based algorithm development in the context of audio, discussing problem selection and taxonomy. We then explore human-centric AI challenges and how they relate to audio engineering, including ethics, trustworthiness, explainability, and interaction, emphasizing the need for ethically sound and human-centered AI systems. Subsequently, we examine technical challenges that arise when applying modern AI techniques to audio, including robust generalization, audio quality, high sample rates, and real-time processing with low latency. Finally, we outline applications of AI in audio engineering, covering the development of machine learning-powered audio effects, synthesizers, and automated mixing systems, as well as spatial audio, speech enhancement, dialog separation and music generation. We emphasize the need for a balanced approach that integrates human-centric concerns with technological advancements, advocating for responsible and effective application of AI.

 

  • Related News & Events

    •  NEWS    MERL Researcher Diego Romeres Collaborates with Mitsubishi Electric and University of Padua to Advance Physics-Embedded AI for Predictive Equipment Maintenance
      Date: December 10, 2025
      MERL Contact: Diego Romeres
      Research Areas: Artificial Intelligence, Machine Learning, Robotics
      Brief
      • Mitsubishi Electric Research Laboratories (MERL) researchers, together with collaborators at Mitsubishi Electric’s Information Technology R&D Center in Kamakura, Kanagawa Prefecture, Japan, and the Department of Information Engineering at the University of Padua, have developed a cutting-edge physics-embedded AI technology that substantially improves the accuracy of equipment degradation estimation using minimal training data. This collaborative effort has culminated in a press release by Mitsubishi Electric Corporation announcing the new AI technology as part of its Neuro-Physical AI initiative under the Maisart program.

        The interdisciplinary team, including MERL Senior Principal Research Scientist and Team Leader Diego Romeres and University of Padua researchers Alberto Dalla Libera and Giulio Giacomuzzo, combined expertise in machine learning, physical modeling, and real-world industrial systems to embed physics-based models directly into AI frameworks. By training AI with theoretical physical laws and real operational data, the resulting system delivers reliable degradation estimates on the torque of robotic arms even with limited datasets. This result addresses key challenges in preventive maintenance for complex manufacturing environments and supports reduced downtime, maintained quality, and lower lifecycle costs.

        The successful integration of these foundational research efforts into Mitsubishi Electric’s business-scale AI solutions exemplifies MERL’s commitment to translating fundamental innovation into real-world impact.
    •