Compute the GeMAPS openSMILE feature set
GeMAPS.Rd
This function applies the "The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing" (the Geneva Minimalistic Standard Parameter Set, GeMAPS v0.1b) (Eyben et al. 2015) to a portion of a recording.
Arguments
- listOfFiles
The full path to the sound file.
- beginTime
The starting time of the section of the sound files that should be analysed.
- endTime
The end time of the section of the sound files that should be analysed.
- explicitExt
The file extension of the slice file where the results should be stored.
Value
A list of 62 acoustic values, with the names as reported by openSMILE. The minimalistic acoustic parameter set contains the following compact set of 18 low-level descriptors (LLD), sorted by parameter groups:
Frequency related parameters:
Pitch, logarithmic f0 on a semitone frequency scale,starting at 27.5 Hz (semitone 0).
Jitter, deviations in individual consecutive f0 period lengths.
Formant 1, 2, and 3 frequency, centre frequency of first, second, and third formant
Formant 1, bandwidth of first formant.Energy/Amplitude related parameters:
Shimmer, difference of the peak amplitudes of consecutive f0 periods.
Loudness, estimate of perceived signal intensity from an auditory spectrum.
Harmonics-to-noise ratio (HNR), relation of energy in harmonic components to energy in noise-like components.
Spectral (balance) parameters:
Alpha Ratio, ratio of the summed energy from50-1000 Hz and 1-5 kHz
Hammarberg Index, ratio of the strongest energy peak in the 0-2 kHz region to the strongest peak in the 2–5 kHz region.
Spectral Slope 0-500 Hz and 500-1500 Hz, linear regression slope of the logarithmic power spectrum within the two given bands.
Formant 1, 2, and 3 relative energy, as well as the ratio of the energy of the spectral harmonic peak at the first, second, third formant’s centre frequency to the energy of the spectral peak atF0.
Harmonic difference H1-H2, ratio of energy of the first f0 harmonic (H1) to the energy of the second f0 harmonic (H2).
Harmonic difference H1-A3, ratio of energy of the first f0harmonic (H1) to the energy of the highest harmonic in the third formant range (A3).
which are analysed in terms of mean and coefficient of variation, as well as 20th, median (50th), and 80th percentile (pitch and loudness), and the arithmetic mean of the Alpha Ratio, the Hammarberg Index, and the spectral slopes from 0-500 Hz and 500-1500 Hz over all unvoiced segments.
Temporal features:
the rate of loudness peaks, i.e., the number of loudness peaks per second,
the mean length and the standard deviation of continuously voiced regions(f0>0),
the mean length and the standard deviation of unvoiced regions (f0 == 0; approximating pauses),
the number of continuous voiced regions per second(pseudo syllable rate).
Please consult the (Eyben et al. 2015) for a description of the features.
Details
The GeMAPS feature set consists of of 62 static acoustic features resulting from the computation of various functionals over low-level descriptor features, and is applied by this function using the openSMILE. (Eyben et al. 2010; Jaimes et al. 2013) acoustic feature extraction library.
References
Eyben F, Scherer KR, Schuller BW, Sundberg J, Andre E, Busso C, Devillers LY, Epps J, Laukka P, Narayanan SS, Truong KP (2015).
“The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing.”
IEEE Transactions on Affective Computing, 7(2), 190--202.
ISSN 1949-3045, doi:10.1109/taffc.2015.2457417
.
Eyben F, Wöllmer M, Schuller B (2010).
Opensmile: the munich versatile and fast open-source audio feature extractor, the international conference.
ACM.
ISBN 978-1-60558-933-6, doi:10.1145/1873951.1874246
, http://dl.acm.org/citation.cfm?id=1874246.
Jaimes A(, Sebe N, Boujemaa N, Gatica-Perez D, Shamma DA, Worring M, Zimmermann R, Eyben F, Weninger F, Gross F, Schuller B (2013).
“Recent developments in openSMILE, the munich open-source multimedia feature extractor.”
Proceedings of the 21st ACM international conference on Multimedia, 835--838.
doi:10.1145/2502081.2502224
.