Search:
Computing and Library Services - delivering an inspiring information environment

Minimising latency of pitch detection algorithms for live vocals on low-cost hardware

Firth, Matthew (2016) Minimising latency of pitch detection algorithms for live vocals on low-cost hardware. Fields: journal of Huddersfield student research, 2 (1). e25. ISSN 2057-0163

[img]
Preview
PDF - Published Version
Available under License Creative Commons Attribution.

Download (986kB) | Preview

Abstract

A pitch estimation device was proposed for live vocals to output appropriate pitch data through the musical instrument digital interface (MIDI). The intention was to ideally achieve unnoticeable latency while maintaining estimation accuracy. The projected target platform was low-cost, standalone hardware based around a microcontroller such as the Microchip PIC series. This study investigated, optimised and compared the performance of suitable algorithms for this application.

Performance was determined by two key factors: accuracy and latency. Many papers have been published over the past six decades assessing and comparing the accuracy of pitch detection algorithms on various signals, including vocals. However, very little information is available concerning the latency of pitch detection algorithms and methods with which this can be minimised. Real-time audio introduces a further latency challenge that is sparsely studied, minimising the length of sampled audio required by the algorithms in order to reduce overall total latency.

Thorough testing was undertaken in order to determine the best-performing algorithm and optimal parameter combination. Software modifications were implemented to facilitate accurate, repeatable, automated testing in order to build a comprehensive set of results encompassing a wide range of test conditions.

The results revealed that the infinite-peak-clipping autocorrelation function (IACF) performed better than the other autocorrelation functions tested and also identified ideal parameter values or value ranges to provide the optimal latency/accuracy balance.

Although the results were encouraging, testing highlighted some fundamental issues with vocal pitch detection. Potential solutions are proposed for further development.

Item Type: Article
Subjects: T Technology > T Technology (General)
Depositing User: Cherry Edmunds
Date Deposited: 15 Jan 2016 12:41
Last Modified: 18 Jan 2016 20:43
URI: http://eprints.hud.ac.uk/id/eprint/26731

Downloads

Downloads per month over past year

Repository Staff Only: item control page

View Item View Item

University of Huddersfield, Queensgate, Huddersfield, HD1 3DH Copyright and Disclaimer All rights reserved ©