Publications

+ General Information

+ Members

+ Research

+ Intranet

Access

2021

15th workshop gRFIA

9th Music Encoding Conference (MEI 2021)

Alicante, July 19-23

13th international workshop on Machine Learning and Music

(online) September 18, 2020

Publications:

All

Calvo-Zaragoza, J.; Valero-Mas, J.J.; Pertusa, A
"End-To-End Optical Music Recognition using Neural Networks"
Proc. of International Society for Music Information Retrieval Conference (ISMIR), Suzhou, China (2017)
: bibtex : pdf
Abstract:
This work addresses the Optical Music Recognition (OMR) task in an end-to-end fashion using neural net- works. The proposed architecture is based on a Recurrent Convolutional Neural Network topology that takes as input an image of a monophonic score and retrieves a sequence of music symbols as output. In the first stage, a series of convolutional filters are trained to extract meaningful fea- tures of the input image, and then a recurrent block models the sequential nature of music. The system is trained us- ing a Connectionist Temporal Classification loss function, which avoids the need for a frame-by-frame alignment be- tween the image and the ground-truth music symbols. Ex- perimentation has been carried on a set of 90,000 synthetic monophonic music scores with more than 50 different pos- sible labels. Results obtained depict classification error rates around 2 % at symbol level, thus proving the po- tential of the proposed end-to-end architecture for OMR. The source code, dataset, and trained models are publicly released for reproducible research and future comparison purposes.

@inproceedings {
 author = "Calvo-Zaragoza, J.; Valero-Mas, J.J.; Pertusa, A",
 title  = "End-To-End Optical Music Recognition using Neural Networks",
 address = "Suzhou, China",
 booktitle = "Proc. of International Society for Music Information Retrieval Conference (ISMIR)",
 month = "October",
 year = "2017"
}

Resources associated with this publication