Skip to content

This repository contains a Convolutional Neural Network (CNN) model for classifying urban sound events using the UrbanSound8K dataset. The model leverages deep learning techniques to accurately categorize different environmental sounds such as sirens, dog barks, and car horns.

License

Notifications You must be signed in to change notification settings

AmirHosseinSoleymani/Speech-Classification-CNN-8k-urbansound

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸŽ™οΈ Speech Classification with CNN - UrbanSound8K

πŸ“Œ Overview

This repository contains a Convolutional Neural Network (CNN) model for classifying urban sound events using the UrbanSound8K dataset. The model leverages deep learning techniques to accurately categorize different environmental sounds such as sirens, dog barks, and car horns.

πŸ“‚ Dataset - UrbanSound8K

UrbanSound8K is a dataset containing 8,732 labeled audio files across 10 sound classes:

  • Air Conditioner
  • Car Horn
  • Children Playing
  • Dog Bark
  • Drilling
  • Engine Idling
  • Gun Shot
  • Jackhammer
  • Siren
  • Street Music

πŸ”— Download: UrbanSound8K Dataset

πŸ“– Model Architecture - CNN for Audio Classification

The model utilizes a Convolutional Neural Network (CNN) with Mel-Spectrograms as input features. The key layers include:

  • Convolutional Layers (Feature Extraction)
  • Batch Normalization & Dropout (Regularization)
  • Fully Connected Layers (Classification)
  • Softmax Activation (Multi-class prediction)

πŸ”₯ Future Improvements

  • Experiment with different CNN architectures (ResNet, EfficientNet)
  • Implement attention mechanisms for better feature learning
  • Deploy as a real-time classification app

🀝 Contributing

Contributions are welcome! Feel free to open an issue or submit a pull request.

πŸ“œ License

This project is licensed under the MIT License.


🎡 Developed by [AmirHosseinSoleymani] | πŸš€ Follow for more AI projects!

About

This repository contains a Convolutional Neural Network (CNN) model for classifying urban sound events using the UrbanSound8K dataset. The model leverages deep learning techniques to accurately categorize different environmental sounds such as sirens, dog barks, and car horns.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published