Data Augmentation in Automatic Classification of Voice Quality

Authors

  • Mario Alejandro García, Doctorando Grupo de Inteligencia Artificial (GIA), Facultad Regional Córdoba, Universidad Tecnológica Nacional - Argentina
  • Eduardo Atilio Destéfanis Director

DOI:

https://doi.org/10.33414/ajea.5.748.2020

Keywords:

Vocal quality, Deep learning, Data augmentation

Abstract

The status of the thesis plan "Vocal quality assessment through deep scattering spectrum and machine learning" is presented. Three transformations are proposed in order to increase the amount of training data and reduce overfitting. These transformations perform a frequency shift, time segmentation and flipping. It results in a dataset 18 times larger than the original dataset. An experiment consisting of training a deep neural network is run to evaluate performance with the augmented data. It is concluded that the proposed transformations reduce the overfitting, improve the classification error and it could be useful for the thesis plan scope, classification of vocal quality from sustained vowels.

Downloads

Download data is not yet available.

Published

2020-10-05

How to Cite

García, M. A., & Destéfanis, E. A. (2020). Data Augmentation in Automatic Classification of Voice Quality. AJEA (Proceedings of UTN Academic Conferences and Events), (5). https://doi.org/10.33414/ajea.5.748.2020