投稿者「jarxiv」のアーカイブ

Mini-ResEmoteNet: Leveraging Knowledge Distillation for Human-Centered Design

投稿日: 2025年1月31日作成者: jarxiv

要約顔の感情認識は、ユーザーエクスペリエンスのドメイン、特に最新のユーザビリテ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Learning Priors of Human Motion With Vision Transformers

投稿日: 2025年1月31日作成者: jarxiv

要約人間がシナリオのどこに移動するか、通常のパスと速度、そして停止する場所を明 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

UDC-VIT: A Real-World Video Dataset for Under-Display Cameras

投稿日: 2025年1月31日作成者: jarxiv

要約ディスプレイカメラ（UDC）は、デジタルカメラレンズをディスプレイパネルの … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

R-LLaVA: Improving Med-VQA Understanding through Visual Region of Interest

投稿日: 2025年1月31日作成者: jarxiv

要約人工知能は医学的視覚的質問応答（MED-VQA）に大きな進歩を遂げましたが … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Perspectives: Comparison of Deep Learning Segmentation Models on Biophysical and Biomedical Data

投稿日: 2025年1月31日作成者: jarxiv

要約現在、深い学習ベースのアプローチは、画像セグメンテーション、機能選択、デコ … 続きを読む →

カテゴリー: cs.CV, eess.IV, physics.bio-ph | コメントを受け付けていません

Vision-based autonomous structural damage detection using data-driven methods

投稿日: 2025年1月31日作成者: jarxiv

要約この研究では、再生可能エネルギーインフラストラクチャの重要なコンポーネント … 続きを読む →

カテゴリー: (Primary), cs.AI, cs.CV, eess.IV, secondary | コメントを受け付けていません

Inkspire: Supporting Design Exploration with Generative AI through Analogical Sketching

投稿日: 2025年1月31日作成者: jarxiv

要約テキストツーイメージ（T2I）AIモデルの能力に最近の進歩により、製品設計 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC, cs.MM | コメントを受け付けていません

DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models

投稿日: 2025年1月31日作成者: jarxiv

要約照明効果の理解とモデリングは、コンピュータービジョンとグラフィックスの基本 … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models

投稿日: 2025年1月31日作成者: jarxiv

要約実際のシナリオでは、モデルが未知のターゲット分布に適応または一般化する必要 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Diffusion Autoencoders are Scalable Image Tokenizers

投稿日: 2025年1月31日作成者: jarxiv

要約画像をコンパクトな視覚表現にトークン化することは、効率的で高品質の画像生成 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Mini-ResEmoteNet: Leveraging Knowledge Distillation for Human-Centered Design

Learning Priors of Human Motion With Vision Transformers

UDC-VIT: A Real-World Video Dataset for Under-Display Cameras

R-LLaVA: Improving Med-VQA Understanding through Visual Region of Interest

Perspectives: Comparison of Deep Learning Segmentation Models on Biophysical and Biomedical Data

Vision-based autonomous structural damage detection using data-driven methods

Inkspire: Supporting Design Exploration with Generative AI through Analogical Sketching

DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models

Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models

Diffusion Autoencoders are Scalable Image Tokenizers

最近の投稿

最近のコメント

アーカイブ

カテゴリー