Volume 20 No 10 (2022)
 Download PDF
IMPROVE THE PERFORMANCE OF NON-INTRUSIVE SPEECH QUALITY ASSESSMENT USING MACHINE LEARNING ALGORITHMS
Dr.Sandeep U. Kadam, Dr.Anand katri, Dr.Vajid N Khan, Dr. Avaneesh Singh, Dr. Dattatray G. Takale, Dr. Dattatray S. Galhe
Abstract
The Sensing tasks, such as evaluating voice clarity, are difficult for computers. (SQA). Traditional approaches to SQA have favored objective techniques that require the existence of a separate reference pair. In practice, however, where reliable information about the world is often hard to come by, these approaches unavoidably fall short. Recent months have seen a surge in interest in non-invasive techniques that use neural networks to forecast evaluations or scores; however, these approaches suffer from inconsistency and require labelled data for training. In this piece, we advocate for a fresh approach to evaluating the quality of a speaker's voice. Taking cues from humans' innate ability to compare and evaluate the quality of speech signals even when they have non-matching contents, we introduce a novel framework that predicts a subjective relative quality score for a given speech signal with respect to any given reference without using any subjective data. This score's reliability depends on how well the incoming speech sample matches the standard. We use a convolutive and additive noise-exposed speech dataset that has been assessed using crowd-sourced QoE labels, Pearson association with MOS labels, and mean-squared error of the estimated MOS. The finest Pearson association (0.87) and the lowest mean squared error of any of the techniques we suggest are obtained by a completely connected deep neural network trained on Mel-frequency features. (0:15)
Keywords
Non-intrusive speech assessment models, deep learning, multi-objective learning, speech enhancement.
Copyright
Copyright © Neuroquantology

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Articles published in the Neuroquantology are available under Creative Commons Attribution Non-Commercial No Derivatives Licence (CC BY-NC-ND 4.0). Authors retain copyright in their work and grant IJECSE right of first publication under CC BY-NC-ND 4.0. Users have the right to read, download, copy, distribute, print, search, or link to the full texts of articles in this journal, and to use them for any other lawful purpose.