月別アーカイブ: 2025年1月

EgoHand: Ego-centric Hand Pose Estimation and Gesture Recognition with Head-mounted Millimeter-wave Radar and IMUs

投稿日: 2025年1月24日作成者: jarxiv

要約 Apple Vision Pro などの最近の高度な仮想現実 (VR) ヘ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MuMA-ToM: Multi-modal Multi-Agent Theory of Mind

投稿日: 2025年1月24日作成者: jarxiv

要約複雑な現実世界のシナリオで人々の社会的相互作用を理解することは、しばしば複 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

By-Example Synthesis of Vector Textures

投稿日: 2025年1月24日作成者: jarxiv

要約単一のラスターの模範を考慮して、任意のサイズの新しいベクトルテクスチャを合 … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Ensuring Medical AI Safety: Explainable AI-Driven Detection and Mitigation of Spurious Model Behavior and Associated Data

投稿日: 2025年1月24日作成者: jarxiv

要約ディープニューラルネットワークは、実際には致命的な結果をもたらす可能性 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

投稿日: 2025年1月24日作成者: jarxiv

要約人間は、情報の知覚、知識の理解、新しい問題を解決するために知識を適応させる … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

MV-GMN: State Space Model for Multi-View Action Recognition

投稿日: 2025年1月24日作成者: jarxiv

要約マルチビューアクション認識の最近の進歩は、トランスベースのモデルに大きく依 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

投稿日: 2025年1月24日作成者: jarxiv

要約拡散モデルは、視覚生成の支配的なアプローチとなっています。彼らは、入力に … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Where Do You Go? Pedestrian Trajectory Prediction using Scene Features

投稿日: 2025年1月24日作成者: jarxiv

要約歩行者の軌跡を正確に予測することは、自動運転車の安全性を高め、歩行者が巻き … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Enhanced Encoder-Decoder Architecture for Accurate Monocular Depth Estimation

投稿日: 2025年1月24日作成者: jarxiv

要約単一の 2D 画像から奥行きを推定することは、通常、奥行き知覚に必要なステ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

First Lessons Learned of an Artificial Intelligence Robotic System for Autonomous Coarse Waste Recycling Using Multispectral Imaging-Based Methods

投稿日: 2025年1月24日作成者: jarxiv

要約粗粒の廃棄物の現在の廃棄施設は、重機を備えた材料の手動ソートを実行します。 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

月別アーカイブ: 2025年1月

EgoHand: Ego-centric Hand Pose Estimation and Gesture Recognition with Head-mounted Millimeter-wave Radar and IMUs

MuMA-ToM: Multi-modal Multi-Agent Theory of Mind

By-Example Synthesis of Vector Textures

Ensuring Medical AI Safety: Explainable AI-Driven Detection and Mitigation of Spurious Model Behavior and Associated Data

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

MV-GMN: State Space Model for Multi-View Action Recognition

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

Where Do You Go? Pedestrian Trajectory Prediction using Scene Features

Enhanced Encoder-Decoder Architecture for Accurate Monocular Depth Estimation

First Lessons Learned of an Artificial Intelligence Robotic System for Autonomous Coarse Waste Recycling Using Multispectral Imaging-Based Methods

最近の投稿

最近のコメント

アーカイブ

カテゴリー