月別アーカイブ: 2025年3月

BHViT: Binarized Hybrid Vision Transformer

投稿日: 2025年3月6日作成者: jarxiv

要約モデルのバイナリゼーションは、畳み込みニューラルネットワーク（CNN）のリ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A self-supervised cyclic neural-analytic approach for novel view synthesis and 3D reconstruction

投稿日: 2025年3月6日作成者: jarxiv

要約録画されたビデオから新しいビューを生成することは、自律的なUAVナビゲーシ … 続きを読む →

カテゴリー: cs.CV, I.2.10 | コメントを受け付けていません

Deblur-Avatar: Animatable Avatars from Motion-Blurred Monocular Videos

投稿日: 2025年3月6日作成者: jarxiv

要約運動式モノクーラービデオ入力からの高忠実度のある3Dヒトアバターをモデル化 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Simulation-Based Performance Evaluation of 3D Object Detection Methods with Deep Learning for a LiDAR Point Cloud Dataset in a SOTIF-related Use Case

投稿日: 2025年3月6日作成者: jarxiv

要約意図した機能（SOTIF）の安全性は、センサーのパフォーマンスの制限と、自 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.SY, eess.SY | コメントを受け付けていません

DFREC: DeepFake Identity Recovery Based on Identity-aware Masked Autoencoder

投稿日: 2025年3月6日作成者: jarxiv

要約 Deepfake Forensicsの最近の進歩は、主に分類の精度と一般化 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Perceptual Multi-Exposure Fusion

投稿日: 2025年3月6日作成者: jarxiv

要約高ダイナミックレンジ（HDR）シーン撮影に対するますます増え続ける需要とし … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

投稿日: 2025年3月6日作成者: jarxiv

要約この作業では、深い生成モデルが、大規模な言語モデル（LLMS）などのテキス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation

投稿日: 2025年3月6日作成者: jarxiv

要約物理的特性に基づいてオブジェクト機能を推測する能力であるオブジェクトアフォ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

High-Quality Virtual Single-Viewpoint Surgical Video: Geometric Autocalibration of Multiple Cameras in Surgical Lights

投稿日: 2025年3月6日作成者: jarxiv

要約閉塞のないビデオ生成は、カメラの視野での外科医の妨害のために困難です。一 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Safety Without Semantic Disruptions: Editing-free Safe Image Generation via Context-preserving Dual Latent Reconstruction

投稿日: 2025年3月6日作成者: jarxiv

要約大規模で未確認のデータセットでマルチモーダル生成モデルをトレーニングすると … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年3月

BHViT: Binarized Hybrid Vision Transformer

A self-supervised cyclic neural-analytic approach for novel view synthesis and 3D reconstruction

Deblur-Avatar: Animatable Avatars from Motion-Blurred Monocular Videos

Simulation-Based Performance Evaluation of 3D Object Detection Methods with Deep Learning for a LiDAR Point Cloud Dataset in a SOTIF-related Use Case

DFREC: DeepFake Identity Recovery Based on Identity-aware Masked Autoencoder

Perceptual Multi-Exposure Fusion

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation

High-Quality Virtual Single-Viewpoint Surgical Video: Geometric Autocalibration of Multiple Cameras in Surgical Lights

Safety Without Semantic Disruptions: Editing-free Safe Image Generation via Context-preserving Dual Latent Reconstruction

最近の投稿

最近のコメント

アーカイブ

カテゴリー