「cs.CV」カテゴリーアーカイブ

Reliable Breast Cancer Molecular Subtype Prediction based on uncertainty-aware Bayesian Deep Learning by Mammography

投稿日: 2024年12月17日作成者: jarxiv

要約乳がんは、分子サブタイプ、臨床的挙動、治療反応、生存転帰が異なる不均一な疾 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Gramian Multimodal Representation Learning and Alignment

投稿日: 2024年12月17日作成者: jarxiv

要約人間の知覚は、視覚、聴覚、言語などの複数のモダリティを統合して、周囲の現実 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

GSDiff: Synthesizing Vector Floorplans via Geometry-enhanced Structural Graph Generation

投稿日: 2024年12月17日作成者: jarxiv

要約建築フロアプラン設計の自動化は住宅やインテリアの設計にとって不可欠であり、 … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data

投稿日: 2024年12月17日作成者: jarxiv

要約リアルな影の生成は、高品質の画像合成と視覚効果にとって重要なコンポーネント … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning

投稿日: 2024年12月17日作成者: jarxiv

要約従来の強化学習ベースのロボット制御手法はタスク固有であることが多く、多様な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.RO | コメントを受け付けていません

EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion

投稿日: 2024年12月17日作成者: jarxiv

要約拡散モデルはトーキングヘッド生成の分野に革命をもたらしましたが、表現力、 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SAMIC: Segment Anything with In-Context Spatial Prompt Engineering

投稿日: 2024年12月17日作成者: jarxiv

要約フューショットセグメンテーションは、ラベル付けされた参照画像の小さなセッ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts

投稿日: 2024年12月17日作成者: jarxiv

要約放射線医学レポートの作成は柔軟性を必要とする複雑なタスクであり、放射線科医 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation

投稿日: 2024年12月17日作成者: jarxiv

要約マルチターゲットドメインアダプテーション (MTDA) では、単一のソ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

RepFace: Refining Closed-Set Noise with Progressive Label Correction for Face Recognition

投稿日: 2024年12月17日作成者: jarxiv

要約顔認識は、データセットの規模の拡大、さまざまなバックボーンの進歩、および識 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Reliable Breast Cancer Molecular Subtype Prediction based on uncertainty-aware Bayesian Deep Learning by Mammography

Gramian Multimodal Representation Learning and Alignment

GSDiff: Synthesizing Vector Floorplans via Geometry-enhanced Structural Graph Generation

Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data

Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning

EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion

SAMIC: Segment Anything with In-Context Spatial Prompt Engineering

LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts

COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation

RepFace: Refining Closed-Set Noise with Progressive Label Correction for Face Recognition

最近の投稿

最近のコメント

アーカイブ

カテゴリー