「cs.AI」カテゴリーアーカイブ

Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs

投稿日: 2025年3月5日作成者: jarxiv

要約最近のマルチモーダル大手言語モデル（MLLMS）は、マルチモーダルの問い合 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation

投稿日: 2025年3月5日作成者: jarxiv

要約 UAVテクノロジーは徐々に成熟しており、スマートな農業と正確な監視に対する … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

R2Det: Exploring Relaxed Rotation Equivariance in 2D object detection

投稿日: 2025年3月5日作成者: jarxiv

要約 Group Equivariant Convolution（GCONV）は … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A dataset-free approach for self-supervised learning of 3D reflectional symmetries

投稿日: 2025年3月5日作成者: jarxiv

要約このホワイトペーパーでは、入力オブジェクト自体のみでデータセットに依存する … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

State of play and future directions in industrial computer vision AI standards

投稿日: 2025年3月5日作成者: jarxiv

要約人工知能（AI）とディープラーニング（DL）の分野における最近の途方もない … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds?

投稿日: 2025年3月5日作成者: jarxiv

要約 3D認識タスクでのデータ収集と注釈に必要な努力により、既存のデータを混合す … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Memory Efficient Continual Learning for Edge-Based Visual Anomaly Detection

投稿日: 2025年3月5日作成者: jarxiv

要約視覚異常検出（VAD）は、多数の現実世界のアプリケーションを備えたコンピュ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

WalkVLM:Aid Visually Impaired People Walking by Vision Language Model

投稿日: 2025年3月5日作成者: jarxiv

要約世界中の約2億人の個人が視覚障害の程度がさまざまであるため、AIテクノロジ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A Comprehensive Survey on Composed Image Retrieval

投稿日: 2025年3月5日作成者: jarxiv

要約 Composed Image Retrieval（CIR）は、ユーザーが参 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.IR, cs.MM | コメントを受け付けていません

UAR-NVC: A Unified AutoRegressive Framework for Memory-Efficient Neural Video Compression

投稿日: 2025年3月5日作成者: jarxiv

要約暗黙の神経表現（INR）は、ビデオをニューラルネットワークとして表現するこ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs

WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation

R2Det: Exploring Relaxed Rotation Equivariance in 2D object detection

A dataset-free approach for self-supervised learning of 3D reflectional symmetries

State of play and future directions in industrial computer vision AI standards

Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds?

Memory Efficient Continual Learning for Edge-Based Visual Anomaly Detection

WalkVLM:Aid Visually Impaired People Walking by Vision Language Model

A Comprehensive Survey on Composed Image Retrieval

UAR-NVC: A Unified AutoRegressive Framework for Memory-Efficient Neural Video Compression

最近の投稿

最近のコメント

アーカイブ

カテゴリー