月別アーカイブ: 2025年4月

Explaining Low Perception Model Competency with High-Competency Counterfactuals

投稿日: 2025年4月8日作成者: jarxiv

要約画像分類モデルがその決定を生成する方法を説明する多くの方法が存在しますが、 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction

投稿日: 2025年4月8日作成者: jarxiv

要約データ表現の選択は、幾何学的なタスクにおける深い学習の成功における重要な要 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

TflosYOLO+TFSC: An Accurate and Robust Model for Estimating Flower Count and Flowering Period

投稿日: 2025年4月8日作成者: jarxiv

要約茶の花は、茶植物の分類学的研究とハイブリッド繁殖において重要な役割を果たし … 続きを読む →

カテゴリー: cs.CV, q-bio.QM | コメントを受け付けていません

From Sparse Signal to Smooth Motion: Real-Time Motion Generation with Rolling Prediction Models

投稿日: 2025年4月8日作成者: jarxiv

要約拡張現実（XR）では、ユーザーの全身動きを生成することは、自分の行動を理解 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis

投稿日: 2025年4月8日作成者: jarxiv

要約基礎モデルは、医療ドメインでますます効果的になりつつあり、下流のタスクに容 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

AnomalousNet: A Hybrid Approach with Attention U-Nets and Change Point Detection for Accurate Characterization of Anomalous Diffusion in Video Data

投稿日: 2025年4月8日作成者: jarxiv

要約異常な拡散は、細胞内のタンパク質輸送、複雑な生息地の動物の動き、地下水の汚 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

LiveVQA: Live Visual Knowledge Seeking

投稿日: 2025年4月8日作成者: jarxiv

要約合成されたVQA問題を備えたインターネットからの最新の視覚知識の自動的に収 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Let it Snow! Animating Static Gaussian Scenes With Dynamic Weather Effects

投稿日: 2025年4月8日作成者: jarxiv

要約 3D Gaussian Splattingは最近、静的3Dシーンの高速かつ … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

One-Minute Video Generation with Test-Time Training

投稿日: 2025年4月8日作成者: jarxiv

要約今日のトランスフォーマーは、自己触媒層が長いコンテキストでは非効率的である … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SmolVLM: Redefining small and efficient multimodal models

投稿日: 2025年4月8日作成者: jarxiv

要約大規模なビジョン言語モデル（VLM）は、例外的なパフォーマンスを提供します … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年4月

Explaining Low Perception Model Competency with High-Competency Counterfactuals

DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction

TflosYOLO+TFSC: An Accurate and Robust Model for Estimating Flower Count and Flowering Period

From Sparse Signal to Smooth Motion: Real-Time Motion Generation with Rolling Prediction Models

FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis

AnomalousNet: A Hybrid Approach with Attention U-Nets and Change Point Detection for Accurate Characterization of Anomalous Diffusion in Video Data

LiveVQA: Live Visual Knowledge Seeking

Let it Snow! Animating Static Gaussian Scenes With Dynamic Weather Effects

One-Minute Video Generation with Test-Time Training

SmolVLM: Redefining small and efficient multimodal models

最近の投稿

最近のコメント

アーカイブ

カテゴリー