This repo is the implementation of a research project aimed at enhancing Acoustic Side-Channel Attacks (ASCAs) using a novel combination of Vision Transformers (VTs) and Large Language Models (LLMs).
Abstract: Audio feature selection and neural network architecture play crucial roles in speech recognition performance. This paper presents a comparative analysis of Artificial Neural Networks (ANNs) ...
Abstract: This paper investigates the zero-shot capabilities of pre-trained large language models (LLMs) for music genre classification. The proposed approach splits audio signals into 20 ms chunks ...