MGT: Extending Virtual Try-Off to Multi-Garment Scenarios
With a simple yet effective extension to TryOffDiff, we present Multi-Garment TryOffDiff (MGT), a diffusion-based VTOFF model that supports upper-body, lower-body, and dress garments.
I am working as MLE at fal.ai and a Ph.D. researcher in the Machine Learning Group at Bielefeld University. My research focuses on vision-language models and diffusion models, with an emphasis on their application in real-world scenarios.
With a simple yet effective extension to TryOffDiff, we present Multi-Garment TryOffDiff (MGT), a diffusion-based VTOFF model that supports upper-body, lower-body, and dress garments.
Introducing Virtual Try-Off (VTOFF), a task for generating standardized garment images, and TryOffDiff, a model that adapts Stable Diffusion with SigLIP-based conditioning for high fidelity and detail retention.
We present a novel e-commerce fashion dataset to enhance object detection and segmentation models, introducing a basic data augmentation approach to increase model robustness for industrial use.
Evaluating Transformer architectures and classical ML models for process modeling, demonstrating their predictive capabilities and the insights provided by attention mechanisms and XAI techniques.
Won 3rd prize in the competition by finetunung VisualBERT and using ensemble learning.