Volume 20 No 13 (2022)
 Download PDF
Multi Feature Sound Classification using Deep Learning
Vrushali K. Bongirwar, Samir N. Ajani, Archana V. Potnurwar
Abstract
Artificial Intelligence plays an important role in acoustics recognition. Importantly, being able to automatically and accurately identify environmental sounds opens up a broad range of applications. Deep learning techniques can assist in the recognition of sounds which we come across in our day-to-day life. Most of the previous work in environmental sound classification involves training a model on a single set of features. Convolutional neural network (ConvNet) is a class of deep feed-forward neural network which exploits the strong spatially local correlation in natural images. It achieves successful performance in visual analyzing area. This paper primarily focuses on two key aims: the first aim is to perform a multilabel classification system and the second aim is to develop Stacked Bidirectional Long Short-Term Memory (LSTM) with two hidden layers to categorize multiple UAVs sounds. There are three portions to perform environmental classification. Firstly, the input signal is converted into spectrogram image with time-frequency representation using short time Fourier transforms. Secondly, this spectrogram is used to extract features with local binary pattern of three different radius and neighborhood sizes. The three distinct features resulted from local binary pattern based on spectrogram are concatenated and used as one feature vector.
Keywords
Deep learning, Long Short-Term Memory, Convolutional neural network.
Copyright
Copyright © Neuroquantology

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Articles published in the Neuroquantology are available under Creative Commons Attribution Non-Commercial No Derivatives Licence (CC BY-NC-ND 4.0). Authors retain copyright in their work and grant IJECSE right of first publication under CC BY-NC-ND 4.0. Users have the right to read, download, copy, distribute, print, search, or link to the full texts of articles in this journal, and to use them for any other lawful purpose.