This website uses cookies.
We use cookies to improve your online experience. By continuing to use our website we assume you agree to the placement of these cookies.
To learn more, you can find in our Privacy Policy.
Original paper

HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection

Pages: 646 - 650
Published: Apr 27, 2022
Abstract
Audio classification is an important task of mapping audio samples into their corresponding labels. Recently, the transformer model with self-attention mechanisms has been adopted in this field. However, existing audio transformers require large GPU memories and long training time, meanwhile relying on pretrained vision models to achieve high performance, which limits the model's scalability in audio tasks. To combat these problems, we introduce...
Paper Details
Title
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Published Date
Apr 27, 2022
Pages
646 - 650
© 2025 Pluto Labs All rights reserved.
Step 1. Scroll down for details & analytics related to the paper.
Discover a range of citation analytics, paper references, a list of cited papers, and more.