Predict attributes from product image

Predicting attributes from supplier uploaded images as part of cataloging is extremely crucial for any E-commerce platform. Typically, when suppliers upload their products for listing on the marketplace, they are required to fill in various details corresponding to product attributes (e.g., color, pattern, sleeve). This process often results in incorrect or incomplete attribute information.

Dataset Description

Training Dataset (train.csv)
- Contains product information, including Category and 10 attribute columns (attr_1 to attr_10).
- Some attribute columns have missing values.
- Example columns: id, Category, len, attr_1, ..., attr_10.
Category Attributes (category_attributes.parquet)
- Metadata on the number and type of attributes for each category.
Testing Dataset (test.csv)
- Similar to the training dataset but without attribute labels.
Image Features (train_features_26k.npy, test_features_30500.npy)
- Pre-extracted image features using a Vision Transformer (ViT).

Setup and Data Preparation

Install Dependencies
Ensure you have the following libraries installed:
- pandas, numpy, tensorflow, scikit-learn, pyarrow.
Load and Inspect Data
- Import training, testing, and category attributes datasets.
- Handle missing values by filling them with a placeholder ('dummy_value').
Filter for Specific Categories
- Product categories: "Men Tshirts", "Sarees", "Kurtis", "Women Tshirts" and "Women Tops & Tunics"
Reshape Image Features
- Flatten the image features from (32, 768) to (768) per sample.
Encode Attribute Labels
- Use LabelEncoder to encode attribute columns.
- Convert the encoded labels into one-hot vectors for training.

Model Architecture

The model combines image features and tabular data for predictions:

Input Layers:
- Image Features: (768,)
- Tabular Data: (2,)
Hidden Layers:
- Dense layers for image and tabular data separately.
- Concatenation of image and tabular layers.
Output Layers:
- Separate outputs for each attribute with a softmax activation.

Training the Model

Compile the Model
- Loss: Categorical Crossentropy for each attribute.
- Metrics: Accuracy for each attribute.
Fit the Model
- Input: Combined image and tabular data.
- Output: One-hot encoded labels for all attributes.
- Validation split: 20%
- Epochs: 50
- Batch Size: 32

Testing and Predictions

Prepare Test Features
- Filter test samples for "Men Tshirts"/other product categories
- Reshape the image features as required.
Get Predictions
- Use the trained model to predict attributes.
- Extract the class with the highest probability for each attribute.
Reverse Encoding
- Use LabelEncoder.inverse_transform to convert predicted labels back to original values.
Submission File
- Create a DataFrame with predictions for all attributes.
- Ensure proper alignment with test data indices.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Men-tshirts-only.ipynb		Men-tshirts-only.ipynb
README.md		README.md
Sarees Only.ipynb		Sarees Only.ipynb
Submission1-200-6k.ipynb		Submission1-200-6k.ipynb
Submission2-1k-8092.ipynb		Submission2-1k-8092.ipynb
Submission3-23k-21k.ipynb		Submission3-23k-21k.ipynb
finetuning-the-clip-model-using-ai2d.ipynb		finetuning-the-clip-model-using-ai2d.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Predict attributes from product image

Dataset Description

Setup and Data Preparation

Model Architecture

Training the Model

Testing and Predictions

About

Uh oh!

Releases

Packages

Languages

shruthimohan03/Predict-attributes-from-product-image

Folders and files

Latest commit

History

Repository files navigation

Predict attributes from product image

Dataset Description

Setup and Data Preparation

Model Architecture

Training the Model

Testing and Predictions

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages