+ General Information
+ Members
+ Research
+ Intranet



15th workshop gRFIA

9th Music Encoding Conference (MEI 2021)


Alicante, July 19-23

13th international workshop on Machine Learning and Music


(online) September 18, 2020


Aligned audio-symbolic flute corpus
  • Description: Labeled flute corpora for audio to score music transcription.
  • Size: 28 minutes recordings with 2246 manually annotated notes.
  • Date: 4-9-2018
  • Download
MASATI: MAritime SATellite Imagery dataset
  • Description: maritime scenes of optical aerial images from visible spectrum.
  • Date: 14-4-2019
  • Download
Capitan dataset
  • Description: Data collection from Early music manuscripts [ICDAR 2017]
  • Date: 2-10-2017
  • Download
Isolated handwritten music symbols
  • Description: Four corpora of isolated handwritten music symbols [ICDAR 2017]
  • Date: 28-7-2017
  • Download
  • Description: Dataset containing photographs and metadata gathered from smartphones for object recognition tasks.
  • Date: continuosly updated
  • More info: http://www.mirbot.com/research
Bimodal music symbols from Early notation
  • Description: corpus collected by an electronic pen while tracing isolated music symbols from Early manuscripts. The dataset contains information of both the sequence followed by the pen and the patch of the source under the tracing itself.
  • Size: 10230 symbols
  • Date: 2016
  • Download
HOMUS (Handwritten Online Music Symbols)
  • Description: Corpus of handwritten music symbols drawn by an electronic pen. It contains data from 100 different musicians spread over 32 classes.
  • Size: 15200 symbols
  • Date: 2014
  • More info
Bach vs. Shostakovich (BvS)
  • Description: Corpus of fugues from Bach and Shostakovich in MIDI format, originally used for composer recognition.
  • Size: 59 MIDI files
  • Date: 2013
  • More info
Fugues (Bach, Krebs, & Kellner)
  • Description: Corpus of fugues from J.S. Bach, W.F. Bach, J.L. Krebs, J.P. Kellner, and some disputed fugues originally attributed to J.S. Bach.
  • Size: 39 MIDI files
  • Date: 2013
  • More info
9GDB (9 genres database)
  • Description: Corpus of chord progressions from nine different genres taken from three "domains": popular, jazz, and academic music.
  • Size: 856 pieces
  • Date: 2009
  • More info
ODB (onset detection database)
  • Description: ODB is an onset detection test database built using a set of real recordings.
  • Size: 19 real recordings in wav format and their onset positions in text format.
  • Date: 2009
  • More info
JvC ( music genre recognition )
  • Description: This corpus contains melodies from jazz and classical music pieces encoded as MIDI file tracks.
  • Size: 150 MIDI files
  • Date: 2009
  • More info
Melody track recognition
  • Description: This corpus contains samples in ARFF Weka format extracted from multitrack MIDI files. Each track is described as a vector of 34 features stored together with class labels and four other tags (the four first attributes are metadata, and the last one is the boolean label 'IsMelody').
  • Size: 3140 samples
  • Date: 2009
  • More info
External datasets:
Valid XHTML 1.0!Valid CSS!