An Improved Audio Classification Method Based on Parameter-Free Attention Combined with Self-Supervision

Gong Xuchao; and Li Zongmin

doi:10.3724/SP.J.1089.2023.19353

Gong Xuchao, and Li Zongmin. An Improved Audio Classification Method Based on Parameter-Free Attention Combined with Self-Supervision[J]. Journal of Computer-Aided Design & Computer Graphics, 2023, 35(3): 434-440. DOI: 10.3724/SP.J.1089.2023.19353

Citation:

An Improved Audio Classification Method Based on Parameter-Free Attention Combined with Self-Supervision

Graphical Abstract

Graphical Abstract

Abstract

Abstract

The end-to-end audio classification method based on transformer is proved to be better than two-dimensional convolution in multiple scenes. In view of the current popular audio recognition method based on serialization learning transformer, which focuses on the importance of current features in time sequence, and the lack of feature description of simultaneous sequence, a method of parameter-free attention combined with self supervised feature construction is proposed to further improve audio classification. In this method, the parameter-free attention mechanism is constructed in the simultaneous order feature to fit the multi-local extreme value distribution. At the same time, in the process of model learning, the input spectrum is randomly masked in time domain and frequency domain, and self-supervision information is added to effectively learn the audio spectrum details and classification information. The experimental results using audio set, esc50 and Speech Command show that the accuracy of algorithm in this paper improves by 0.46%~1.20%, compared with the current state of the art method.

FullText(HTML)

References (29)

Cited By

Turn off MathJax

Article Contents

An Improved Audio Classification Method Based on Parameter-Free Attention Combined with Self-Supervision

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content