投稿者「jarxiv」のアーカイブ

PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments

投稿日: 2025年1月24日作成者: jarxiv

要約理想的な条件下での単眼深度の推定を改善するためにかなりの努力が払われていま … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting

投稿日: 2025年1月24日作成者: jarxiv

要約この論文では、3D ガウススプラッティング (3DGS) を使用した陰的 … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

EgoHand: Ego-centric Hand Pose Estimation and Gesture Recognition with Head-mounted Millimeter-wave Radar and IMUs

投稿日: 2025年1月24日作成者: jarxiv

要約 Apple Vision Pro などの最近の高度な仮想現実 (VR) ヘ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MuMA-ToM: Multi-modal Multi-Agent Theory of Mind

投稿日: 2025年1月24日作成者: jarxiv

要約複雑な現実世界のシナリオで人々の社会的相互作用を理解することは、しばしば複 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

By-Example Synthesis of Vector Textures

投稿日: 2025年1月24日作成者: jarxiv

要約単一のラスターの模範を考慮して、任意のサイズの新しいベクトルテクスチャを合 … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Ensuring Medical AI Safety: Explainable AI-Driven Detection and Mitigation of Spurious Model Behavior and Associated Data

投稿日: 2025年1月24日作成者: jarxiv

要約ディープニューラルネットワークは、実際には致命的な結果をもたらす可能性 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

投稿日: 2025年1月24日作成者: jarxiv

要約人間は、情報の知覚、知識の理解、新しい問題を解決するために知識を適応させる … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

MV-GMN: State Space Model for Multi-View Action Recognition

投稿日: 2025年1月24日作成者: jarxiv

要約マルチビューアクション認識の最近の進歩は、トランスベースのモデルに大きく依 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

投稿日: 2025年1月24日作成者: jarxiv

要約拡散モデルは、視覚生成の支配的なアプローチとなっています。彼らは、入力に … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Where Do You Go? Pedestrian Trajectory Prediction using Scene Features

投稿日: 2025年1月24日作成者: jarxiv

要約歩行者の軌跡を正確に予測することは、自動運転車の安全性を高め、歩行者が巻き … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments

3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting

EgoHand: Ego-centric Hand Pose Estimation and Gesture Recognition with Head-mounted Millimeter-wave Radar and IMUs

MuMA-ToM: Multi-modal Multi-Agent Theory of Mind

By-Example Synthesis of Vector Textures

Ensuring Medical AI Safety: Explainable AI-Driven Detection and Mitigation of Spurious Model Behavior and Associated Data

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

MV-GMN: State Space Model for Multi-View Action Recognition

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

Where Do You Go? Pedestrian Trajectory Prediction using Scene Features

最近の投稿

最近のコメント

アーカイブ

カテゴリー