Data Augmentation in Automatic Classification of Voice Quality

Authors

  • Mario Alejandro García, Doctorando Grupo de Inteligencia Artificial (GIA), Facultad Regional Córdoba, Universidad Tecnológica Nacional - Argentina
  • Eduardo Atilio Destéfanis Director

DOI:

https://doi.org/10.33414/ajea.5.748.2020

Keywords:

Vocal quality, Deep learning, Data augmentation

Abstract

The status of the thesis plan "Vocal quality assessment through deep scattering spectrum and machine learning" is presented. Three transformations are proposed in order to increase the amount of training data and reduce overfitting. These transformations perform a frequency shift, time segmentation and flipping. It results in a dataset 18 times larger than the original dataset. An experiment consisting of training a deep neural network is run to evaluate performance with the augmented data. It is concluded that the proposed transformations reduce the overfitting, improve the classification error and it could be useful for the thesis plan scope, classification of vocal quality from sustained vowels.

Downloads

Metrics

Visualizaciones del PDF
431
Oct 07 '20Oct 10 '20Oct 13 '20Oct 16 '20Oct 19 '20Oct 22 '20Oct 25 '20Oct 28 '20Oct 31 '20Nov 01 '203.0
| |

Published

2020-10-05

How to Cite

García, M. A., & Destéfanis, E. A. (2020). Data Augmentation in Automatic Classification of Voice Quality. AJEA (Proceedings of UTN Academic Conferences and Events), (5). https://doi.org/10.33414/ajea.5.748.2020