Hey everybody, just wanted to update you on how the project is going. I have a basic recommendation system working now, which rates songs' similarity by how many tags they have in common. I've put the code on github, mostly just so you can look through the recommendation files if you're interested. Here is some sample output (first the similar song, then the tags they have in common).
For the song "A Different Kind of Peace":
Then the Healing Came orchestral,strings,cinematic
Shield of Legend orchestral,epic,cinematic
Justice for All orchestral,epic,cinematic
Captain of the Skies orchestral,epic,strings,cinematic
Enemy Underworld orchestral,epic,strings,cinematic
For the song "Adlehyde Castle Flow":
Insecta Robotica mellow,jazz,electric-piano
This Heart percussion,mellow,tempo-slow
Gemini Salsa percussion,jazz,tempo-slow
Holiday Frappe mellow,jazz,electric-piano
Fighting (7/8 Jazz Spiritual) mellow,jazz,electric-piano
There is still a real sparsity of tags--only 1,306 songs even have one tag, and only about half of those have more than one. My next move is to incorporate Remixer and source-game as tags, by extracting the meta-data from the MP3s. That should help, but it seems like to be successful the tag-based system needs more data. Once again, does anybody know how to see a list of all the tags in the system, not just that page with the top 70?
EDIT: forgot the github link... https://github.com/zygoth/OCR-Music-Recommendation-System/tree/master/