「cs.CV」カテゴリーアーカイブ

CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting

投稿日: 2025年3月11日作成者: jarxiv

要約安全な自律運転（AD）には、正確なモーション予測が重要です。この研究では … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

MonoSOWA: Scalable monocular 3D Object detector Without human Annotations

投稿日: 2025年3月11日作成者: jarxiv

要約単一のRGBカメラからのオブジェクト3Dの位置と方向を推測することは、多く … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

AI-Driven Automated Tool for Abdominal CT Body Composition Analysis in Gastrointestinal Cancer Management

投稿日: 2025年3月11日作成者: jarxiv

要約胃腸がんの発生率は、特に中国では、正確な予後評価と効果的な治療戦略の重要性 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

COMODO: Cross-Modal Video-to-IMU Distillation for Efficient Egocentric Human Activity Recognition

投稿日: 2025年3月11日作成者: jarxiv

要約エゴセントリックビデオベースのモデルは、豊富なセマンティック情報をキャプチ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

投稿日: 2025年3月11日作成者: jarxiv

要約テキストツーイメージ（T2I）モデルは、高品質の芸術作品と視覚的なコンテン … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification

投稿日: 2025年3月11日作成者: jarxiv

要約量子視力変圧器（QVITS）は、自己触媒メカニズム内の線形層をパラメーター … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

ALLVB: All-in-One Long Video Understanding Benchmark

投稿日: 2025年3月11日作成者: jarxiv

要約画像からビデオの理解まで、マルチモーダルLLMS（MLLM）の機能はますま … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Goal Conditioned Reinforcement Learning for Photo Finishing Tuning

投稿日: 2025年3月11日作成者: jarxiv

要約写真仕上げのチューニングは、Adobe LightroomやDarktab … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models

投稿日: 2025年3月11日作成者: jarxiv

要約拡散モデルはスタイル転送タスクで顕著な進歩を遂げましたが、既存の方法は通常 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis

投稿日: 2025年3月11日作成者: jarxiv

要約ビルボードスプラッティング（BBSPLAT） – テクスチャの … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting

MonoSOWA: Scalable monocular 3D Object detector Without human Annotations

AI-Driven Automated Tool for Abdominal CT Body Composition Analysis in Gastrointestinal Cancer Management

COMODO: Cross-Modal Video-to-IMU Distillation for Efficient Egocentric Human Activity Recognition

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification

ALLVB: All-in-One Long Video Understanding Benchmark

Goal Conditioned Reinforcement Learning for Photo Finishing Tuning

AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models

BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis

最近の投稿

最近のコメント

アーカイブ

カテゴリー