Audio Spectrogram Transformers in The Digital Realm
Basic usage of AI/LLM to visualize research papers — with Prompts
WTF is a Audio Spectrogram Transformer?
First of all the seed of my tree of thoughts:
AST: Audio Spectrogram Transformer(https://arxiv.org/pdf/2104.01778.pdf)
Yuan Gong, Yu-An Chung, James Glass MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA 02139, USA
Prompt: “usage of AST: Audio Spectrogram Transformer”
A spectrogram is a visual representation of the spectrum of frequencies in a sound or other signal as they vary with time.
For Visual Learners ♥
vGPT4: How AST Works
For Our Children ♥
The Listener