💻 🧠 Contact school.brainhack@gmail.com for questions! 🧠 💻

Brainbeats: Classifying Music Genre with fMRI Connectivity

By Chelsea Xuan Chen
Published on June 12, 2025

"Can we predict music genres based on fMRI connectivity patterns alone? This project explores a single-subject decoding approach using ROI-to-ROI correlation matrices and machine learning classifiers on OpenNeuro dataset ds003720."

Project definition

Background

Music evokes rich and complex responses in the brain—and I wanted to explore whether functional connectivity patterns alone could reflect the genre of music being heard. Inspired by Nakai et al. (2021), who used voxelwise patterns across five participants to decode music genre, I chose to narrow the focus to a single-subject analysis.

Due to event file inconsistencies in sub-001 to sub-004, only sub-005 contained intact and correctly labeled annotations across all six runs. My project thus became a pilot proof-of-concept: could trial-wise ROI-to-ROI correlation matrices, extracted via the Schaefer 100-ROI atlas, support genre decoding even within just one brain?

Tools

This project used:

  • Python & Jupyter Notebooks
  • nilearn for ROI time series extraction and brain visualization
  • scikit-learn for PCA and classification (SVM, KNN)
  • matplotlib & seaborn for plotting and visual diagnostics

Data

  • Dataset: OpenNeuro ds003720
    Nakai, T., Koide‑Majima, N., & Nishimoto, S. (2021). Brain and Behavior, 11(1), e01936.

  • Used only sub-005 due to complete event labels (manual inspection revealed misalignments in other subjects)

  • 6 runs, each with ~40 musical trials; each trial is a 10-second clip; 10 total genres in the dataset

  • Brain parcellation: Schaefer 100-ROI cortical atlas (98 usable ROIs after resampling)

  • Final dataset: 98 usable trials, filtered to exclude clips <2 TRs or with correlation errors

Why only sub-005? The other subjects had corrupted or mismatched event files that broke alignment between music clips and fMRI data.

Deliverables

  • Trial-wise ROI-to-ROI correlation matrices
  • Atlas and correlation matrix visualizations
  • PCA dimensionality reduction pipeline
  • Genre classification with SVM and KNN
  • Final presentation slide deck (≤15 minutes)

Results

Progress overview

  • Extracted time-series from 98 ROIs using NiftiLabelsMasker in nilearn
  • Computed ROI-to-ROI Pearson correlations → 100×100 matrix → flattened to 4950D feature vector
  • Applied PCA (n=20) for dimensionality reduction
  • Trained classifiers (SVM, KNN) for multiclass and binary decoding

Tools I learned during this project

  • nilearn atlas-based ROI extraction
  • Correlation matrix construction + vectorization
  • PCA feature compression
  • Classification using SVC, KNeighborsClassifier, and cross-validation
  • Confusion matrix and performance visualization

Deliverable 1: Multiclass genre classification

  • SVM + PCA yielded ~13% accuracy (above 10% random baseline for 10 genres)
  • Confusion matrix showed some genre-specific signal, particularly for Reggae (not shown in final binary model)

Deliverable 2: Binary genre decoding (Pop vs. Metal)

  • SVM + PCA achieved ~68% accuracy (above chance), though KNN was too unstable to evaluate
  • Performance varied across runs, likely due to short duration and low trial count

Deliverable 3: Visualizations

  • Schaefer 100-ROI brain parcellation plotted to validate resampling and mask quality
  • Average correlation matrix showed strong within-region signals (clean diagonals)

Schaefer Atlas

Schaefer Atlas

ROI Correlation Matrix

ROI Correlation

PCA Confmax

PCA Confmax


Reflection

This pilot study suggests that even within a single brain, genre-relevant information can be gleaned from ROI connectivity patterns. But limitations remain:

  • Generalizability limited by sample size (n=1)
  • Low trial count vs. high feature dimension = overfitting risk
  • Classifier performance unstable across reruns

Future directions

  • Use more participants (once metadata is cleaned!)
  • Try higher-resolution atlases (e.g., Schaefer 200-ROI)
  • Experiment with sliding window connectivity or graph-based features
  • Combine with music theory-based audio features (e.g., tempo, tonality) for hybrid modeling
  • Explore feature importance: which ROI-pairs drive classification?

See also these similar projects

Functional Brain Activation During a Memory Encoding and Retrieval Task: Discovering Tools and Techniques for Analysis

This project aimed to explore tools and techniques used to analyze fMRI brain activation at the first level using data from an open-access dataset. We …

Discover this project

Predicting Neuroticism and Personality Traits from fMRI Data

Are neuropsychiatric disorders extreme cases of connectivity patterns that are found in the overall population? Using personality traits as a measure …

Discover this project

Workflow of resting state connectivity from a raw dataset to longitudinal visualization

To aim of this project was to provide a full neuroimaging workflow from preprocessing of raw data to visualisation of results, to explore longitudinal …

Discover this project