Corrigendum to “Harmonic Differences Method for Robust Fundamental Frequency Detection in Wideband and Narrowband Speech Signals”
C. Parlak, Y. Altun, “Harmonic Differences Method for Robust Fundamental Frequency Detection in Wideband and Narrowband Speech Signals, (2021): 1–17,” https://doi.org/10.1155/2021/6658951.
In reference 96, there is a typographical error, where “algoritm” was spelled incorrectly. This has been changed to “algorithm”, and the corrected reference is shown below:
[96] H. Yedla, R. R. Kishore, and M. N. Yadav, “Hybrid high noise resiliency pitch detection algorithm,” International Journal of Current Engineering and Scientific Research (IJCESR), vol. 3, pp. 1–4, 2015.
In reference 113, there is a typographical error, where “Insterspeech” was spelled incorrectly. This has been changed to “Interspeech”, and the correct reference is shown below:
[113] L. Ardaillon and A. Roebel, “Fully-convolutional network for pitch estimation of speech signals,” in Proceedings of the Interspeech 2019, Graz, Austria, 2019.
In reference 156, the year “2007” is incorrect. The correct year is “2008”, and the correct reference is shown below:
[156] J. Benesty, M. M. Sondhi, and Y. Huang, Springer Handbook of Speech Processing, 2008, Springer, Berlin, Germany.
In the first paragraph of Section 2.4. Datasets, the in-text citation for reference 152 is missing. The correct text is shown below:
2.4. Datasets
As usual with the other experiments, creation and use of proper datasets is essential in pitch tracking experiments. There are numerous datasets used in this field and among them Keele Studio, PTDB-TUG [135], Keele Telephone [136], TIMIT [137], NTIMIT [138], CSTR [139], FARSDAT [140], Mocha TIMIT [141], RWC Music Database [142], MedleyDB [143], Vowel-CVC [144], NOIZEUS [145], and SPEECON [146] datasets. Some authors used CMU ARCTIC, KED TIMIT [147], APLAWD [148], BACH10 [149], SyncRWC60, Saarland Music Data [150], Mazurka [151], and MIREX Dataset [152]. Vowel datasets can also be used for pitch determination. Because f0 tracking is specifically important in music transcription, many music datasets are available in ISMIR (https://www.ismir.net/resources/) web pages.
In Table 1, there was an error in the data Texas row. 972 should display in the Boy And Girl columns and 1232 should appear under the Male column. The corrected Table 1 is shown below:
Boy | Girl | Male | Female | |
---|---|---|---|---|
Hillenbrand | 324 | 228 | 540 | 576 |
Texas | 972 | 1232 | 1110 | |
TIMIT | — | 54,357 | 24,017 |
The caption for Figure 2 has been updated to include permissions information. The correct caption is shown below:
Figure 2: Laryngogram and differentiated laryngogram of an example voiced speech signal [156]. Reproduced from: W. J. Hess, Pitch, and Voicing Determination of Speech with an Extension Toward Music Signals, in Springer Handbook of Speech Processing, p. 198, Springer, with permission from Springer Nature. Not covered by the article’s Creative Commons license.
We apologize for these errors.