Music Genre Classification

Overview

A machine learning project comparing the effectiveness of RandomForest, K-Nearest Neighbours (KNN), and Support Vector Machine (SVM) models for music genre classification. Through systematic feature selection and model optimisation, the KNN classifier achieved 91% accuracy on the GTZAN dataset.

Model Development

Feature Selection and Optimisation

Initial model training with the full feature set
Feature importance analysis using RandomForestClassifier
Iterative feature selection based on importance rankings:
- Top 3 features: chroma_stft_mean, spec_bandwidth_mean, rolloff_mean
- Additional significant features: mfcc1_mean, mfcc2_mean, mfcc3_mean
Model retraining with optimised feature subset

Models Evaluated

K-Nearest Neighbours (KNN)
- Best performing model: 91% accuracy
- Optimised parameters through RandomSearchCV
- Robust performance across genres
Random Forest
- Used for initial feature importance analysis
- Secondary classification model
Support Vector Machine (SVM)
- Comparative baseline model
- Performance evaluation with different kernels
- Computational efficiency considerations

Results Summary

KNN achieved highest accuracy (91%) with optimised feature set
Feature reduction from original set to top performers maintained accuracy
Cross-validation scores demonstrate model stability
Detailed confusion matrix highlighting per-genre performance

Project Structure

.
├── features_3_sec.csv     # Feature set with 3-second windows
├── features_30_sec.csv    # Feature set with 30-second windows
├── music_genre_classification.ipynb    # Implementation and analysis
├── LICENCE
└── README.md

Technical Requirements

Python 3.x
scikit-learn
pandas
numpy
seaborn (visualisation)
matplotlib (visualisation)

Usage

Clone the repository:

git clone https://github.com/lukasz-iskierka/ml-music-classification.git

Install dependencies:

pip install scikit-learn pandas numpy seaborn matplotlib

Run the Jupyter notebook for detailed analysis and results:
```
jupyter notebook music_genre_classification.ipynb
```

Future Development

Ensemble method exploration
Additional feature engineering
Model optimisation for specific genre pairs
Performance optimisation for larger datasets

Licence

See LICENCE file for details.

Contact

For questions or suggestions, please open an issue in the GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
features_30_sec.csv		features_30_sec.csv
features_3_sec.csv		features_3_sec.csv
music_genre_classification.ipynb		music_genre_classification.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Music Genre Classification

Overview

Model Development

Feature Selection and Optimisation

Models Evaluated

Results Summary

Project Structure

Technical Requirements

Usage

Future Development

Licence

Contact

About

Releases

Languages

License

lukasz-iskierka/ml-music-classification

Folders and files

Latest commit

History

Repository files navigation

Music Genre Classification

Overview

Model Development

Feature Selection and Optimisation

Models Evaluated

Results Summary

Project Structure

Technical Requirements

Usage

Future Development

Licence

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Languages