月別アーカイブ: 2025年5月

Convolutional Long Short-Term Memory Neural Networks Based Numerical Simulation of Flow Field

投稿日: 2025年5月22日作成者: jarxiv

要約計算流体力学（CFD）は、流れ場を分析するための主なアプローチです。ただ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation

投稿日: 2025年5月22日作成者: jarxiv

要約 3Dセマンティックセグメンテーションは、自律運転および道路インフラストラク … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

M3TR: A Generalist Model for Real-World HD Map Completion

投稿日: 2025年5月22日作成者: jarxiv

要約自動運転車は操作のためにHDマップに依存していますが、オフラインのHDマッ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

TinyDrive: Multiscale Visual Question Answering with Selective Token Routing for Autonomous Driving

投稿日: 2025年5月22日作成者: jarxiv

要約自律運転で視覚的な質問回答（VQA）に採用されたビジョン言語モデル（VLM … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach

投稿日: 2025年5月22日作成者: jarxiv

要約視覚的なキューを統合することにより、騒々しい環境での視聴覚音声認識（AVS … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation

投稿日: 2025年5月22日作成者: jarxiv

要約 3Dインスタンスセグメンテーション（3DIS）は大幅に進歩していますが、既 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Visual Perturbation and Adaptive Hard Negative Contrastive Learning for Compositional Reasoning in Vision-Language Models

投稿日: 2025年5月22日作成者: jarxiv

要約ビジョン言語モデル（VLM）は、マルチモーダルタスク、特に構成推論（CR） … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset

投稿日: 2025年5月22日作成者: jarxiv

要約最近の大規模モデリングのブレークスルーにより、セグメントAnything … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

投稿日: 2025年5月22日作成者: jarxiv

要約チャートやドキュメントなどの豊富なテキストを持つ画像に関する推論は、ビジョ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Diversity-Driven View Subset Selection for Indoor Novel View Synthesis

投稿日: 2025年5月22日作成者: jarxiv

要約屋内シーンの新しいビュー統合は、環境の単眼ビデオシーケンスをキャプチャする … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年5月

Convolutional Long Short-Term Memory Neural Networks Based Numerical Simulation of Flow Field

seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation

M3TR: A Generalist Model for Real-World HD Map Completion

TinyDrive: Multiscale Visual Question Answering with Selective Token Routing for Autonomous Driving

Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach

CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation

Visual Perturbation and Adaptive Hard Negative Contrastive Learning for Compositional Reasoning in Vision-Language Models

UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Diversity-Driven View Subset Selection for Indoor Novel View Synthesis

最近の投稿

最近のコメント

アーカイブ

カテゴリー