「cs.CV」カテゴリーアーカイブ

CollEX — A Multimodal Agentic RAG System Enabling Interactive Exploration of Scientific Collections

投稿日: 2025年4月11日作成者: jarxiv

要約このペーパーでは、広範な科学コレクションのインタラクティブな探索を強化する … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.IR | コメントを受け付けていません

CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections

投稿日: 2025年4月11日作成者: jarxiv

要約基礎モデルの時代では、Clipは、テキストと視覚モダリティを共通の埋め込み … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map

投稿日: 2025年4月11日作成者: jarxiv

要約交通署名規制の順守を確保することは、人間と自律の車両ナビゲーションの両方に … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Hierarchical Modeling for Medical Visual Question Answering with Cross-Attention Fusion

投稿日: 2025年4月11日作成者: jarxiv

要約医療視覚的質問応答（MED-VQA）は、医療画像を使用して臨床的質問に答え … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

GaussianAnything: Interactive Point Cloud Flow Matching For 3D Object Generation

投稿日: 2025年4月11日作成者: jarxiv

要約 3Dコンテンツの生成は大幅に進歩していますが、既存の方法は、入力形式、潜在 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR | コメントを受け付けていません

Dreamweaver: Learning Compositional World Models from Pixels

投稿日: 2025年4月11日作成者: jarxiv

要約人間は、世界の認識をオブジェクトと、色、形状、運動パターンなどの属性に分解 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Benchmarking Multi-Organ Segmentation Tools for Multi-Parametric T1-weighted Abdominal MRI

投稿日: 2025年4月11日作成者: jarxiv

要約マルチパラメトリックMRI研究における複数の臓器のセグメンテーションは、イ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SF2T: Self-supervised Fragment Finetuning of Video-LLMs for Fine-Grained Understanding

投稿日: 2025年4月11日作成者: jarxiv

要約ビデオベースの大規模な言語モデル（ビデオ-LLM）は、マルチモーダルLLM … 続きを読む →

カテゴリー: 68T45, cs.AI, cs.CV, I.4.8 | コメントを受け付けていません

PIDSR:ComplementaryPolarizedImageDemosaicingandSuper-Resolution

投稿日: 2025年4月11日作成者: jarxiv

要約偏光カメラは、単一ショットで異なる偏光子角を持つ複数の偏光画像をキャプチャ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

PRAD: Periapical Radiograph Analysis Dataset and Benchmark Model Development

投稿日: 2025年4月11日作成者: jarxiv

要約人工知能の極めて重要な技術であるDeep Learning（DL）は、最近 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

CollEX — A Multimodal Agentic RAG System Enabling Interactive Exploration of Scientific Collections

CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections

Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map

Hierarchical Modeling for Medical Visual Question Answering with Cross-Attention Fusion

GaussianAnything: Interactive Point Cloud Flow Matching For 3D Object Generation

Dreamweaver: Learning Compositional World Models from Pixels

Benchmarking Multi-Organ Segmentation Tools for Multi-Parametric T1-weighted Abdominal MRI

SF2T: Self-supervised Fragment Finetuning of Video-LLMs for Fine-Grained Understanding

PIDSR:ComplementaryPolarizedImageDemosaicingandSuper-Resolution

PRAD: Periapical Radiograph Analysis Dataset and Benchmark Model Development

最近の投稿

最近のコメント

アーカイブ

カテゴリー