GitHub - PriyankaSett/movie_recommendation: This project builds a 'Movie Recommendation System', using the tmdb5000 dataset. This project uses nltk library for the text analysis and Streamlit for deployment.

Movie Recommendation System using NLTK library

In this project the TMDB data set (https://www.kaggle.com/datasets/tmdb/tmdb-movie-metadata) has been used to build a movie recommendation system.

'CountVectorizer' has been used to perform the Text to Vector transformation.

This project is deployed using 'Streamlit'.

unzip movie_recommendation.zip

cd movie_recommendation

Create a local enviornment. I have used python version 3.8. One can use higher version too.

conda create -p venv python==3.8 -y

conda activate venv/

mkdir artifacts

pip install -r requirements.txt

Now you can run the notebook file - 'notebooks/movie_reco.ipynb'.
This will generate the 'movie_data.pkl' which is the input list of names of movies and 'cosine_similarity.pkl' which stores the cosine similarity matrix.
These above two files will be needed when we run 'app.py' using 'Streamlit'.
To run 'app.py' -

streamlit run app.py

This will open a browser at localhost where you will be able to get the recommendations.

Here is a glimpse of the output.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
notebooks		notebooks
src		src
README.md		README.md
app.py		app.py
reco1.png		reco1.png
reco2.png		reco2.png
reco3.png		reco3.png
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback