PopMNet: Generating structured pop music melodies using neural networks

作者:

摘要

Recently, many deep learning models have been proposed to generate symbolic melodies. However, generating pop music melodies with well organized structures remains to be challenging. In this paper, we present a melody structure-based model called PopMNet to generate structured pop music melodies. The melody structure is defined by pairwise relations, specifically, repetition and sequence, between all bars in a melody. PopMNet consists of a Convolutional Neural Network (CNN)-based Structure Generation Net (SGN) and a Recurrent Neural Network (RNN)-based Melody Generation Net (MGN). The former generates melody structures and the latter generates melodies conditioned on the structures and chord progressions. The proposed model is compared with four existing models AttentionRNN, LookbackRNN, MidiNet and Music Transformer. The results indicate that the melodies generated by our model contain much clearer structures compared to those generated by other models, as confirmed by human behavior experiments.

论文关键词:Melody generation,Melody structure,Artificial neural network,Generative adversarial network,LSTM

论文评审过程:Received 19 August 2019, Revised 12 May 2020, Accepted 15 May 2020, Available online 26 May 2020, Version of Record 2 June 2020.

论文官网地址:https://doi.org/10.1016/j.artint.2020.103303