「cs.CV」カテゴリーアーカイブ

Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes

投稿日: 2025年2月21日作成者: jarxiv

要約新しいトレーニングデータセットにデータに注釈を付けるために必要な重要な努力 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

YOLOv12: A Breakdown of the Key Architectural Features

投稿日: 2025年2月21日作成者: jarxiv

要約このペーパーでは、Yolov12の建築分析を紹介します。これは、重要な改善 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Data Attribution for Text-to-Image Models by Unlearning Synthesized Images

投稿日: 2025年2月21日作成者: jarxiv

要約テキストから画像へのモデルのデータ属性の目標は、新しい画像の生成に最も影響 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

MedVAE: Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders

投稿日: 2025年2月21日作成者: jarxiv

要約医療画像は、臨床的意思決定に必要な細粒の特徴をキャプチャするために、大きな … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

Sculpting [CLS] Features for Pre-Trained Model-Based Class-Incremental Learning

投稿日: 2025年2月21日作成者: jarxiv

要約クラスインクリメンタル学習では、モデルが古いクラスを忘れずに新しいクラスの … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Harnessing PDF Data for Improving Japanese Large Multimodal Models

投稿日: 2025年2月21日作成者: jarxiv

要約大規模なマルチモーダルモデル（LMM）は英語で強力なパフォーマンスを実証し … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models

投稿日: 2025年2月21日作成者: jarxiv

要約このホワイトペーパーでは、DC（Decouple）-Controlnetを … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ReVision: A Dataset and Baseline VLM for Privacy-Preserving Task-Oriented Visual Instruction Rewriting

投稿日: 2025年2月21日作成者: jarxiv

要約 AR、VR、および強力なカメラを備えた最新のスマートフォンが人間コンピュー … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning

投稿日: 2025年2月21日作成者: jarxiv

要約 3D大手言語モデル（3DLLMS）の最近の進歩は、3D現実世界の汎用エージ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

投稿日: 2025年2月21日作成者: jarxiv

要約元のSiglipの成功に基づいて構築された新しい多言語ビジョン言語エンコー … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes

YOLOv12: A Breakdown of the Key Architectural Features

Data Attribution for Text-to-Image Models by Unlearning Synthesized Images

MedVAE: Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders

Sculpting [CLS] Features for Pre-Trained Model-Based Class-Incremental Learning

Harnessing PDF Data for Improving Japanese Large Multimodal Models

DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models

ReVision: A Dataset and Baseline VLM for Privacy-Preserving Task-Oriented Visual Instruction Rewriting

Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

最近の投稿

最近のコメント

アーカイブ

カテゴリー