Audio Spectrogram Transformers in The Metaverse

Basic usage of AI/LLM to visualize research papers — with Prompts

Romesh Niriella

--

WTF is a Audio Spectrogram Transformer?

First of all the seed of my tree of thoughts:

AST: Audio Spectrogram Transformer(https://arxiv.org/pdf/2104.01778.pdf)

Yuan Gong, Yu-An Chung, James Glass MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA 02139, USA

--

--

Romesh Niriella

{ 🇱🇰 | 🇦🇺 } — Ǟutomation, Дeep Space, €rypto