This is an accompanying repository for the article Regularized autoregressive modeling and its application to audio signal reconstruction, which is to be submitted to IEEE Transactions on Audio, Speech, and Language Processing.
Autoregressive (AR) modeling is invaluable in signal processing, in particular in speech and audio fields. Attempts in the literature can be found that regularize or constrain either the time-domain signal values or the AR coefficients, which is done for various reasons, including the incorporation of prior information or numerical stabilization. Although these attempts are appealing, an encompassing and generic modeling framework is still missing. We propose such a framework and the related optimization problem and algorithm. We discuss the computational demands of the algorithm and explore the effects of various improvements on its convergence speed. In the experimental part, we demonstrate the usefulness of our approach on the audio declipping and dequantization problems. We compare its performance against state-of-the-art methods and demonstrate the competitiveness of the proposed method in declipping musical signals, and its superiority in declipping speech. The evaluation includes a heuristic algorithm of generalized linear prediction (GLP), a strong competitor which has only been presented as a patent and is new in the scientific community.
The submitted manuscript is available at arXiv.
Accompanying webpage with examples for listening is available through GitHub pages.
The repository contains MATLAB implementation of all the methods and experiments described in the article.
It is organized as follows:
- dequantization toolbox – clone of the repository audio_dequantization
- docs – source files for the accompanying webpage
- results – numerical results of the experiments presented in the paper, as well as the scripts used to plot the results
- signals – music audio signals used in the experiments which do not use the full set from survey toolbox
- speech – speech signals used in the experiments
- survey toolbox – clone of selected parts of the repository declipping2020_codes, which is used for comparison of the proposed method with the state-of-the-art optimization-based audio declipping methods
- utils – all the functions implementing the proposed framework and functions used by the plotting scripts in results
acceleration_test.mtests different acceleration options for the DRA and ACSconsistency_test.manalyzes the results in terms of performance and consistency; note that this code cannot be run without first runningsurvey_test.mand generating all the declipped waveformsdemo.mis a demonstrative script which runs a single instance of the declipping experiment using GLP and the proposed ACS approachdemo_quant.mis a demonstrative script which runs a single instance of the dequantization experiment using the proposed ACS approachdemo_speech.mis a demonstrative script which runs a single instance of the speech declipping experiment using the proposed ACS approachdequantization_test.mperforms the dequantization experiment from the article Audio Dequantization Using (Co)Sparse (Non)Convex Methods using the proposed ACS approachiteration_tradeoff.mtests the proposed method for different combinations of the ACS (outer) and DRA (inner) iterationsmain_csl1_speech.m,main_Social_Sparsity_speech.m,main_spade_speech.mare minor modifications of the sripts from survey toolbox for the sake of the speech experimentoracle_test.mtests the inpainting / declipping using Janssen algorithm or GLP and compares the progression of AR coefficients to the coefficients of the ground truth signalspeech_test.mperforms the speech declipping experiment inspired by the article A Survey and an Extensive Evaluation of Popular Audio Declipping Methods using the proposed ACS approach and GLPspeech_test_add_metrics.mcomputes the STOI, MOS and NSIM metrics for the declipped speech signals; note that this code cannot be run without first runningspeech_test.m,main_csl1_speech.m,main_Social_Sparsity_speech.m, andmain_spade_speech.mto generate all the declipped waveformssurvey_test.mperforms the declipping experiment from the article A Survey and an Extensive Evaluation of Popular Audio Declipping Methods using the proposed ACS approach and GLPsurvey_test_add_CR.mperforms the post-processing of the results fromsurvey_test.mas described in the article Audio Declipping Performance Enhancement via Crossfading; note that this code cannot be run without first runningsurvey_test.mto generate all the declipped waveforms
The codes were tested in MATLAB R2025a. They depend on the following toolboxes:
- Parallel Computing Toolbox,
- Signal Processing Toolbox,
- Statistics and Machine Learning Toolbox,
- The Large Time-Frequency Analysis Toolbox (LTFAT).
Special thank you goes to:
- the authors of the declipping survey [1], the follow-up article [2] and the dequantization contribution [3] for sharing publically the code and numerical results, which allowed to build on their work,
- the authors of [4] and the Audio Inpainting Toolbox for sharing the implementation of the basic Janssen algorithm,
- İlker Bayram for sharing the codes for [5].
[1] P. Záviška, P. Rajmic, A. Ozerov and L. Rencker, “A Survey and an Extensive Evaluation of Popular Audio Declipping Methods,” IEEE Journal of Selected Topics in Signal Processing, vol. 15, no. 1, pp. 5–24, 2021, doi: 10.1109/JSTSP.2020.3042071.
[2] P. Záviška, P. Rajmic and O. Mokrý, “Audio declipping performance enhancement via crossfading,” Signal Processing, vol. 192, 2022, doi: 10.1016/j.sigpro.2021.108365.
[3] P. Záviška, P. Rajmic and O. Mokrý, “Audio Dequantization Using (Co)Sparse (Non)Convex Methods,” 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 2021, pp. 701–705, doi: 10.1109/ICASSP39728.2021.9414637.
[4] A. Adler, V. Emiya, M. G. Jafari, M. Elad, R. Gribonval and M. D. Plumbley, “Audio Inpainting,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 3, pp. 922-932, 2012, doi: 10.1109/TASL.2011.2168211.
[5] İ. Bayram, “Proximal Mappings Involving Almost Structured Matrices,” IEEE Signal Processing Letters, vol. 22, no. 12, pp. 2264-2268, 2015, doi: 10.1109/LSP.2015.2476381.