月別アーカイブ: 2025年2月

SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest

投稿日: 2025年2月14日作成者: jarxiv

要約機械学習アルゴリズムにより、高品質のステレオ深度推定は、拡張および仮想現実 … 続きを読む →

カテゴリー: cs.AR, cs.CV | コメントを受け付けていません

Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model

投稿日: 2025年2月14日作成者: jarxiv

要約条件付き拡散モデルの最近の進歩により、現実的なトーキングフェイスビデオを生 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community

投稿日: 2025年2月14日作成者: jarxiv

要約オブジェクトの検出、特にオープンボキャブラリーオブジェクトの検出は、環境監 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

投稿日: 2025年2月14日作成者: jarxiv

要約具体化されたエージェントを作成するためにマルチモーダルの大手言語モデル（M … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

投稿日: 2025年2月14日作成者: jarxiv

要約この作品は、画像とビデオの両方を密集した理解のための最初の統一モデルである … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Self-Calibrating Gaussian Splatting for Large Field of View Reconstruction

投稿日: 2025年2月14日作成者: jarxiv

要約このペーパーでは、カメラパラメーター、レンズの歪み、3Dガウス表現を共同で … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Diffusing DeBias: a Recipe for Turning a Bug into a Feature

投稿日: 2025年2月14日作成者: jarxiv

要約分類タスクにおける深い学習モデルの有効性は、特定の属性とターゲットラベルの … 続きを読む →

カテゴリー: cs.CV, cs.LG, I.4 | コメントを受け付けていません

Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery

投稿日: 2025年2月14日作成者: jarxiv

要約 Vision Transformers（VIT）は最近、コンピュータービジ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering

投稿日: 2025年2月14日作成者: jarxiv

要約この調査では、ビデオ品質の7つの重要なカテゴリにわたってゼロショット分類の … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis

投稿日: 2025年2月14日作成者: jarxiv

要約地球軌道衛星の連続動作は、リモートセンシング（RS）画像の広大で成長し続け … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年2月

SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest

Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model

Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Self-Calibrating Gaussian Splatting for Large Field of View Reconstruction

Diffusing DeBias: a Recipe for Turning a Bug into a Feature

Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery

Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering

GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis

最近の投稿

最近のコメント

アーカイブ

カテゴリー