PERSA+: A Deep Learning Front-End for Context-Agnostic Audio Classification

Lazaros Vrysis, Iordanis Thoidis, Charalampos Dimoulas, George Papanikolaou

Deep learning has been applied to diverse audio semantics tasks, enabling the construction of models that learn hierarchical levels of features from high-dimensional raw data, delivering state-of-the-art performance. But do these algorithms perform similarly in real-world conditions, or just at the benchmark, where their high learning capability assures the complete memorization of the employed datasets? This work presents a deep learning front-end, aiming at discarding detrimental information before entering the modeling stage, bringing the learning process closer to the point, anticipating the development of robust and context-agnostic classification algorithms.

Knowledge Graph

arrow_drop_up

Comments

Sign up or login to leave a comment