月別アーカイブ: 2025年3月

StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts

投稿日: 2025年3月5日作成者: jarxiv

要約この作業では、レイアウト制御拡散モデルと組み合わせた大規模な言語モデルを使 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs

投稿日: 2025年3月5日作成者: jarxiv

要約最近のマルチモーダル大手言語モデル（MLLMS）は、マルチモーダルの問い合 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts

投稿日: 2025年3月5日作成者: jarxiv

要約アフォーダンスとは、エージェントが環境から認識し、利用する機能特性を指し、 … 続きを読む →

カテゴリー: cs.CV, cs.RO, eess.IV | コメントを受け付けていません

ARC-Flow : Articulated, Resolution-Agnostic, Correspondence-Free Matching and Interpolation of 3D Shapes Under Flow Fields

投稿日: 2025年3月5日作成者: jarxiv

要約この作業は、2つの3Dの明確な形状とそれらの間の密な対応の自動推定の間の物 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Smoothing the Shift: Towards Stable Test-Time Adaptation under Complex Multimodal Noises

投稿日: 2025年3月5日作成者: jarxiv

要約テスト時間適応（TTA）は、ソースデータにアクセスせずに、無ー化されたテス … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

ZAPBench: A Benchmark for Whole-Brain Activity Prediction in Zebrafish

投稿日: 2025年3月5日作成者: jarxiv

要約データ駆動型のベンチマークは、気象や構造生物学を含む主要な科学モデリングド … 続きを読む →

カテゴリー: cs.CV, cs.LG, q-bio.NC | コメントを受け付けていません

XFMamba: Cross-Fusion Mamba for Multi-View Medical Image Classification

投稿日: 2025年3月5日作成者: jarxiv

要約シングルビューの医療画像分類と比較して、複数のビューを使用すると、ビュー間 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation

投稿日: 2025年3月5日作成者: jarxiv

要約 UAVテクノロジーは徐々に成熟しており、スマートな農業と正確な監視に対する … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

R2Det: Exploring Relaxed Rotation Equivariance in 2D object detection

投稿日: 2025年3月5日作成者: jarxiv

要約 Group Equivariant Convolution（GCONV）は … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A dataset-free approach for self-supervised learning of 3D reflectional symmetries

投稿日: 2025年3月5日作成者: jarxiv

要約このホワイトペーパーでは、入力オブジェクト自体のみでデータセットに依存する … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年3月

StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts

Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs

Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts

ARC-Flow : Articulated, Resolution-Agnostic, Correspondence-Free Matching and Interpolation of 3D Shapes Under Flow Fields

Smoothing the Shift: Towards Stable Test-Time Adaptation under Complex Multimodal Noises

ZAPBench: A Benchmark for Whole-Brain Activity Prediction in Zebrafish

XFMamba: Cross-Fusion Mamba for Multi-View Medical Image Classification

WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation

R2Det: Exploring Relaxed Rotation Equivariance in 2D object detection

A dataset-free approach for self-supervised learning of 3D reflectional symmetries

最近の投稿

最近のコメント

アーカイブ

カテゴリー