Volume 2025, Issue 1 9856294
Corrigendum
Open Access

Corrigendum to “Harmonic Differences Method for Robust Fundamental Frequency Detection in Wideband and Narrowband Speech Signals”

First published: 10 June 2025

C. Parlak, Y. Altun, “Harmonic Differences Method for Robust Fundamental Frequency Detection in Wideband and Narrowband Speech Signals, (2021): 1–17,” https://doi.org/10.1155/2021/6658951.

In reference 96, there is a typographical error, where “algoritm” was spelled incorrectly. This has been changed to “algorithm”, and the corrected reference is shown below:

[96] H. Yedla, R. R. Kishore, and M. N. Yadav, “Hybrid high noise resiliency pitch detection algorithm,” International Journal of Current Engineering and Scientific Research (IJCESR), vol. 3, pp. 1–4, 2015.

In reference 113, there is a typographical error, where “Insterspeech” was spelled incorrectly. This has been changed to “Interspeech”, and the correct reference is shown below:

[113] L. Ardaillon and A. Roebel, “Fully-convolutional network for pitch estimation of speech signals,” in Proceedings of the Interspeech 2019, Graz, Austria, 2019.

In reference 156, the year “2007” is incorrect. The correct year is “2008”, and the correct reference is shown below:

[156] J. Benesty, M. M. Sondhi, and Y. Huang, Springer Handbook of Speech Processing, 2008, Springer, Berlin, Germany.

In the first paragraph of Section 2.4. Datasets, the in-text citation for reference 152 is missing. The correct text is shown below:

2.4. Datasets

As usual with the other experiments, creation and use of proper datasets is essential in pitch tracking experiments. There are numerous datasets used in this field and among them Keele Studio, PTDB-TUG [135], Keele Telephone [136], TIMIT [137], NTIMIT [138], CSTR [139], FARSDAT [140], Mocha TIMIT [141], RWC Music Database [142], MedleyDB [143], Vowel-CVC [144], NOIZEUS [145], and SPEECON [146] datasets. Some authors used CMU ARCTIC, KED TIMIT [147], APLAWD [148], BACH10 [149], SyncRWC60, Saarland Music Data [150], Mazurka [151], and MIREX Dataset [152]. Vowel datasets can also be used for pitch determination. Because f0 tracking is specifically important in music transcription, many music datasets are available in ISMIR (https://www.ismir.net/resources/) web pages.

In Table 1, there was an error in the data Texas row. 972 should display in the Boy And Girl columns and 1232 should appear under the Male column. The corrected Table 1 is shown below:

Table 1. Datasets used in this work.
Boy Girl Male Female
Hillenbrand 324 228 540 576
Texas 972 1232 1110
TIMIT 54,357 24,017

The caption for Figure 2 has been updated to include permissions information. The correct caption is shown below:

Figure 2: Laryngogram and differentiated laryngogram of an example voiced speech signal [156]. Reproduced from: W. J. Hess, Pitch, and Voicing Determination of Speech with an Extension Toward Music Signals, in Springer Handbook of Speech Processing, p. 198, Springer, with permission from Springer Nature. Not covered by the article’s Creative Commons license.

We apologize for these errors.

    The full text of this article hosted at iucr.org is unavailable due to technical difficulties.