WaveSplit: A Multi-Stage Framework for Audio Enhancement and Audio Denoising – Combining Deep Learning with Psychoacoustic Principles and Adaptive Noise Processing

Authors

  • Parth Gandhi Department of Computer Engineering, Sardar Patel Institute of Technology, Mumbai, India
  • Kevin Doshi Department of Computer Engineering, Sardar Patel Institute of Technology, Mumbai, India
  • Anand Godbole Department of Computer Engineering, Sardar Patel Institute of Technology, Mumbai, India

DOI:

https://doi.org/10.65138/ijresm.v9i2.3413

Abstract

In the field of audio processing, noise interference poses a significant challenge, affecting speech intelligibility and communication quality across multiple domains. Current audio denoising methods often struggle with the delicate balance be- tween noise removal and speech preservation. This paper presents WaveSplit, a novel multi-stage framework for audio enhancement and denoising that addresses these limitations by combining deep learning techniques with psychoacoustic principles and adaptive noise processing. Building upon the CleanUNet architecture, our approach introduces several innovative components: adaptive SNR-based processing, harmonic enhancement that preserves critical speech components, vocal clarity enhancement, and perceptual processing leveraging human hearing characteristics. Evaluations demonstrate that our framework achieves superior performance compared to baseline models, with significant improvements in SNR (76.36 dB compared to 7.20-8.10 dB in baseline models), PESQ scores (1.05 improvement versus 0.77- 0.91), and STOI metrics (0.15 versus 0.09-0.13) while reducing the “robotic” artifacts common in traditional methods. This research has significant implications for applications including telecommunications, hearing assistive technologies, content production, and speech recognition systems. By addressing both objective quality metrics and perceptual factors, WaveSplit represents an advancement toward more effective, natural-sounding audio enhancement solutions for real-world environments.

Downloads

Download data is not yet available.

Downloads

Published

14-02-2026

Issue

Section

Articles

How to Cite

[1]
P. Gandhi, K. Doshi, and A. Godbole, “WaveSplit: A Multi-Stage Framework for Audio Enhancement and Audio Denoising – Combining Deep Learning with Psychoacoustic Principles and Adaptive Noise Processing”, IJRESM, vol. 9, no. 2, pp. 16–22, Feb. 2026, doi: 10.65138/ijresm.v9i2.3413.

Most read articles by the same author(s)