月別アーカイブ: 2025年2月

Conformal Predictions for Human Action Recognition with Vision-Language Models

投稿日: 2025年2月11日作成者: jarxiv

要約 Human-in-the-Loop（HITL）フレームワークは、多くの現実 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Few-Shot Classification and Anatomical Localization of Tissues in SPECT Imaging

投稿日: 2025年2月11日作成者: jarxiv

要約正確な分類と解剖学的局在は、効果的な医療診断と研究に不可欠であり、深い学習 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Do generative video models learn physical principles from watching videos?

投稿日: 2025年2月11日作成者: jarxiv

要約 AIビデオ生成は革命を起こしており、品質とリアリズムが急速に進歩しています … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.LG | コメントを受け付けていません

Prototype Contrastive Consistency Learning for Semi-Supervised Medical Image Segmentation

投稿日: 2025年2月11日作成者: jarxiv

要約医療画像のセグメンテーションは、医療画像分析において重要なタスクですが、特 … 続きを読む →

カテゴリー: cs.CV, I.4.6 | コメントを受け付けていません

Generalizable Implicit Motion Modeling for Video Frame Interpolation

投稿日: 2025年2月11日作成者: jarxiv

要約モーションモデリングは、フローベースのビデオフレーム補間（VFI）で重要で … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CHIRLA: Comprehensive High-resolution Identification and Re-identification for Large-scale Analysis

投稿日: 2025年2月11日作成者: jarxiv

要約人の再識別（REID）は、コンピュータービジョンの重要な課題であり、さまざ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene

投稿日: 2025年2月11日作成者: jarxiv

要約自動運転車は、自我中心の認識に依存して、感覚の制限に直面し、しばしば閉塞さ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Optimal Visual Search with Highly Heuristic Decision Rules

投稿日: 2025年2月11日作成者: jarxiv

要約視覚検索は、人間や他の動物にとって基本的な自然な仕事です。よく分離された … 続きを読む →

カテゴリー: cs.CV, q-bio.NC, stat.AP | コメントを受け付けていません

MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval

投稿日: 2025年2月11日作成者: jarxiv

要約大規模なマルチモーダルコレクションから情報を効率的に取得して合成することが … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

GHOST: Gaussian Hypothesis Open-Set Technique

投稿日: 2025年2月11日作成者: jarxiv

要約大規模な認識方法の評価は、通常、全体的なパフォーマンスに焦点を当てています … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年2月

Conformal Predictions for Human Action Recognition with Vision-Language Models

Few-Shot Classification and Anatomical Localization of Tissues in SPECT Imaging

Do generative video models learn physical principles from watching videos?

Prototype Contrastive Consistency Learning for Semi-Supervised Medical Image Segmentation

Generalizable Implicit Motion Modeling for Video Frame Interpolation

CHIRLA: Comprehensive High-resolution Identification and Re-identification for Large-scale Analysis

Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene

Optimal Visual Search with Highly Heuristic Decision Rules

MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval

GHOST: Gaussian Hypothesis Open-Set Technique

最近の投稿

最近のコメント

アーカイブ

カテゴリー