Quality and Quantity: Unveiling a Million High-Quality Images for Text-to-Image Synthesis in Fashion Design

要約

AIとファッションデザインの融合は、有望な研究分野として浮上しています。
しかし、衣服と試着段階に関する相互に関連する広範なデータが不足しているため、この分野での AI の可能性を最大限に発揮することができません。
これに対処するために、私たちは、数年にわたる厳しい努力の成果であるファッション拡散データセットを紹介します。
この種のものとしては初めてのこのデータセットは、100 万枚を超える高品質のファッション画像と詳細なテキスト説明で構成されています。
このデータセットは、さまざまな地理的位置や文化的背景から得られたもので、世界的なファッショントレンドを要約しています。
画像には、衣服や人間に関連するきめ細かい属性が注意深く注釈付けされており、ファッションデザインプロセスを Text-to-Image (T2I) タスクに簡素化します。
Fashion-Diffusion データセットは、高品質のテキストと画像のペアや多様な人間と衣服のペアを提供するだけでなく、人間に関する大規模なリソースとしても機能するため、T2I 世代の研究が促進されます。
さらに、T2Iベースのファッションデザイン分野の標準化を促進するために、ファッションデザインモデルのパフォーマンスを評価するための複数のデータセットで構成される新しいベンチマークを提案します。
この成果は、AI 主導のファッションデザインの分野における大きな進歩を表しており、この分野の将来の研究に新たな標準を設定します。

要約(オリジナル)

The fusion of AI and fashion design has emerged as a promising research area. However, the lack of extensive, interrelated data on clothing and try-on stages has hindered the full potential of AI in this domain. Addressing this, we present the Fashion-Diffusion dataset, a product of multiple years’ rigorous effort. This dataset, the first of its kind, comprises over a million high-quality fashion images, paired with detailed text descriptions. Sourced from a diverse range of geographical locations and cultural backgrounds, the dataset encapsulates global fashion trends. The images have been meticulously annotated with fine-grained attributes related to clothing and humans, simplifying the fashion design process into a Text-to-Image (T2I) task. The Fashion-Diffusion dataset not only provides high-quality text-image pairs and diverse human-garment pairs but also serves as a large-scale resource about humans, thereby facilitating research in T2I generation. Moreover, to foster standardization in the T2I-based fashion design field, we propose a new benchmark comprising multiple datasets for evaluating the performance of fashion design models. This work represents a significant leap forward in the realm of AI-driven fashion design, setting a new standard for future research in this field.

arxiv情報

著者	Jia Yu,Lichao Zhang,Zijie Chen,Fayu Pan,MiaoMiao Wen,Yuming Yan,Fangsheng Weng,Shuai Zhang,Lili Pan,Zhenzhong Lan
発行日	2024-03-18 12:23:25+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Quality and Quantity: Unveiling a Million High-Quality Images for Text-to-Image Synthesis in Fashion Design

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー