This project addresses the challenges in fashion image analysis by developing models for jeans detection and segmentation, multi-label classification of fashion attributes, and attribute embedding using BERT. The methodologies employed include Detectron2 for object detection, Fastai for multi-label classification, and BERT for attribute embedding. The experiments demonstrate the effectiveness of these models in accurately detecting jeans, classifying multiple fashion attributes, and generating meaningful attribute embeddings.
Fashion image analysis involves complex tasks such as detecting specific clothing items, classifying multiple attributes, and understanding textual descriptions. This project focuses on:
Utilizing Detectron2's Generalized R-CNN models to accurately detect and segment jeans in images.
Applying Fastai to classify multiple attributes of fashion items, acknowledging that garments often possess several attributes simultaneously.
Leveraging BERT embeddings to extract features from product descriptions, facilitating tasks like semantic search and information retrieval.
Dataset: Custom dataset with jeans annotations for training.
Model: Detectron2 with configurations for model training, including learning rate and batch size.
Evaluation: Visual and potentially quantitative analysis of detection and segmentation accuracy.
Dataset: Levi's jeans data with multi-label annotations.
Training: FastAI's data block API for structured data handling, with focus on balancing datasets.
Metrics: Accuracy, F1 scores, etc., for each attribute classification.
Text Data: Uses fashion descriptions to show how BERT can produce meaningful embeddings.
Process: Tokenization, embedding generation, and visualization of semantic relationships.
FastAI v2: Enhanced capabilities with new features like data augmentation and progressive resizing.
Experimentation: Focused on optimizing multi-label classification for jeans attributes.
Additional Information from Repository:
Code Structure: Includes datasets, model checkpoints, and visualization scripts.
Pretrained Models: Utilization of COCO pre-trained weights for initial model setup.
For detailed code and further information, please refer to the following notebooks:
https://github.com/Pyligent/Fashion-Jeans-Detection-attributes-BERT-embedding-multilabels/tree/master
This project illustrates the powerful integration of AI technologies in fashion analysis, specifically for jeans. By combining Detectron2, FastAI, and BERT, we provide tools that can significantly improve how fashion items are searched for, recommended, and understood. Future work could aim at expanding datasets, refining model architectures, and applying these methods in real-world fashion tech scenarios.
There are no datasets linked
There are no datasets linked