🎙️ Speech Classification with CNN - UrbanSound8K

📌 Overview

This repository contains a Convolutional Neural Network (CNN) model for classifying urban sound events using the UrbanSound8K dataset. The model leverages deep learning techniques to accurately categorize different environmental sounds such as sirens, dog barks, and car horns.

📂 Dataset - UrbanSound8K

UrbanSound8K is a dataset containing 8,732 labeled audio files across 10 sound classes:

Air Conditioner
Car Horn
Children Playing
Dog Bark
Drilling
Engine Idling
Gun Shot
Jackhammer
Siren
Street Music

🔗 Download: UrbanSound8K Dataset

📖 Model Architecture - CNN for Audio Classification

The model utilizes a Convolutional Neural Network (CNN) with Mel-Spectrograms as input features. The key layers include:

Convolutional Layers (Feature Extraction)
Batch Normalization & Dropout (Regularization)
Fully Connected Layers (Classification)
Softmax Activation (Multi-class prediction)

🔥 Future Improvements

Experiment with different CNN architectures (ResNet, EfficientNet)
Implement attention mechanisms for better feature learning
Deploy as a real-time classification app

🤝 Contributing

Contributions are welcome! Feel free to open an issue or submit a pull request.

📜 License

This project is licensed under the MIT License.

🎵 Developed by [AmirHosseinSoleymani] | 🚀 Follow for more AI projects!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
speech-classification-cnn-urbansound.ipynb		speech-classification-cnn-urbansound.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🎙️ Speech Classification with CNN - UrbanSound8K

📌 Overview

📂 Dataset - UrbanSound8K

📖 Model Architecture - CNN for Audio Classification

🔥 Future Improvements

🤝 Contributing

📜 License

About

Uh oh!

Releases

Packages

Languages

License

AmirHosseinSoleymani/Speech-Classification-CNN-8k-urbansound

Folders and files

Latest commit

History

Repository files navigation

🎙️ Speech Classification with CNN - UrbanSound8K

📌 Overview

📂 Dataset - UrbanSound8K

📖 Model Architecture - CNN for Audio Classification

🔥 Future Improvements

🤝 Contributing

📜 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages