Skip to content

Python sound classification using a mel spectogram and a convolutional neural network (CNN)

Notifications You must be signed in to change notification settings

FrozenAssassine/SoundClassification-CNN

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

🔊 Sound Classification

Simple sound classification in Python using a Convolutional Neural Network (CNN).

🚀 How it works

Audio files are converted into NumPy arrays using Mel-spectrogram conversion.
This process turns the raw audio data into a spectrogram image, capturing frequency vs time.
The spectrogram is then used as input for a CNN model, which learns to classify different types of sounds.

🧰 Features

  • Convert audio files into spectrograms using the MEL scale
  • Feed image-like data into a CNN for training and prediction
  • Easily customizable for different sound classification tasks

🛠 Requirements

  • Python 3.8+
  • TensorFlow / Keras
  • Librosa
  • NumPy
  • Matplotlib (optional for spectrogram visualization)

Running

To train the model, use the train.py script:

python train.py

To use the model use the test.py script:

python test.py

Training

image image

About

Python sound classification using a mel spectogram and a convolutional neural network (CNN)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages