GithubHelp home page GithubHelp logo

standardgalactic / mastmelody_dataset Goto Github PK

View Code? Open in Web Editor NEW

This project forked from barisbozkurt/mastmelody_dataset

0.0 1.0 0.0 84.56 MB

A dataset of pitch curves for music performance assessment

License: Other

Julia 100.00%

mastmelody_dataset's Introduction

License: CC BY-NC-SA 4.0

MASTmelody_dataset

This repository contains a dataset of f0-series data computed from signing and piano samples and a baseline system for assessment (comparing a singing sample to a reference piano recording), both of which are described in the scientific paper:

Bozkurt, B., Baysal, O., Yuret, D. A Dataset and Baseline System for Singing Voice Assessment, 13th Int. Symposium on Computer Music Multidisciplinary Research, Porto, Sept. 25-28, 2017.

@inproceedings{inproceedings,
  author={Bozkurt, B., Baysal, O., Yuret, D.},
  title={A Dataset and Baseline System for Singing Voice Assessment},
  year={2017},
  booktitle={13th Int. Symposium on Computer Music Multidisciplinary Research, CMMR 2017}
}

Please cite the publication if you use this dataset and/or the baseline system in your work.

The directory structure is:

  • baseline: Code for the baseline model. See baseline/README.md for documentation.
  • f0data: MASTmelody dataset.
  • wavSamples: Audio samples.

Introduction

The MASTmelody dataset is designed and shared to facilitate comparison of algorithms in the field of automatic music performance assessment.

The dataset includes pitch (f0) data extracted from audio data recorded during conservatory entrance examinations. Audio data could not be included due to the difficulties involved in completely anonymizing audio files (recognition of the singer is possible via listening). Only a few samples are provided in the 'wavSamples' directory.

There are two broad categories for the data: f0 data of the reference recording files (melodies played on the piano as reference) and f0 data of the performance recording files (recordings of the candidate in singing the melodies).

Each recording was subject to f0-detection using a variant of Melodia Melody Extraction tool. The results were saved as text files containing two columns: time-stamp and estimated f0 information in Hz.

The performances have been graded by three jury members, who are teaching staff members of the conservatory. Grades are binary: pass, fail. This dataset includes only the samples for which all jury members agreed in grading with the same score. Hence, there are basically two categories for the performance files: i) performances which was graded as 'fail' by all the jury members , ii) performances which was graded as 'pass' by all the jury members.

Naming convention:

The dataset is just composed of a list of text files. All other information is coded in the file names: 'ref': reference recording on the piano 'per': performance recording 'fail': performance graded as 'fail' 'pass': performance graded as 'pass'

There are basically 40 distinct melodies performed. The id for the melody makes up the first part of the file name. Examples:

'51_mel1_per101559_fail.f0s.txt': Melody with ID: '51_mel1' and this is a performance file graded as fail

'55_mel2_ref280758.f0s.txt': Melody with ID: '55_mel2' and this is a reference file

License

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Acknowledgement

This dataset has been curated within the TUBITAK (The Scientific and Technological Research Council of Turkey) funded research project 1001-215K017 targeting development of automatic assessment tools for music performances.

mastmelody_dataset's People

Contributors

barisbozkurt avatar denizyuret avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.