「cs.CV」カテゴリーアーカイブ

Knowledge Circuits in Pretrained Transformers

投稿日: 2025年1月6日作成者: jarxiv

要約現代の大規模言語モデルの卓越した能力は、そのパラメータにエンコードされた膨 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.IR, cs.LG | コメントを受け付けていません

Agent Planning with World Knowledge Model

投稿日: 2025年1月6日作成者: jarxiv

要約大規模言語モデル（LLM）をエージェントモデルとして直接利用し、対話的な計 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.MA | コメントを受け付けていません

EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation

投稿日: 2025年1月6日作成者: jarxiv

要約我々は、ロボット操作タスクのために特別に設計された、具現化された未来空間生 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

投稿日: 2025年1月6日作成者: jarxiv

要約近年、大規模言語モデル(LLM)を用いたゆっくり考える推論システムが、推論 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Detecting and Mitigating Adversarial Attacks on Deep Learning-Based MRI Reconstruction Without Any Retraining

投稿日: 2025年1月6日作成者: jarxiv

要約ディープラーニング（DL）手法、特に物理駆動型DLに基づく手法は、サブサン … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Conditional Consistency Guided Image Translation and Enhancement

投稿日: 2025年1月6日作成者: jarxiv

要約一貫性モデルは、拡散モデルに代わる有望なモデルとして登場し、シングルステッ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Exoplanet Detection via Differentiable Rendering

投稿日: 2025年1月6日作成者: jarxiv

要約太陽系外惑星の直接撮像は、太陽系外の惑星系の理解を進める上で極めて重要であ … 続きを読む →

カテゴリー: astro-ph.EP, astro-ph.IM, cs.CV, eess.IV | コメントを受け付けていません

DINO-LG: A Task-Specific DINO Model for Coronary Calcium Scoring

投稿日: 2025年1月6日作成者: jarxiv

要約冠動脈疾患（CAD）は、世界的な死亡率の主要原因の1つであり、効果的なリス … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

Transformer-Driven Inverse Problem Transform for Fast Blind Hyperspectral Image Dehazing

投稿日: 2025年1月6日作成者: jarxiv

要約ハイパースペクトルデハイズ（HyDHZ）は、その後の同定や分類作業を容易に … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding

投稿日: 2025年1月6日作成者: jarxiv

要約大規模視覚言語モデル(LVLM)は、下流のマルチモーダルタスクの視覚言語理 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Knowledge Circuits in Pretrained Transformers

Agent Planning with World Knowledge Model

EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Detecting and Mitigating Adversarial Attacks on Deep Learning-Based MRI Reconstruction Without Any Retraining

Conditional Consistency Guided Image Translation and Enhancement

Exoplanet Detection via Differentiable Rendering

DINO-LG: A Task-Specific DINO Model for Coronary Calcium Scoring

Transformer-Driven Inverse Problem Transform for Fast Blind Hyperspectral Image Dehazing

Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding

最近の投稿

最近のコメント

アーカイブ

カテゴリー