「cs.CV」カテゴリーアーカイブ

Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports

投稿日: 2025年1月16日作成者: jarxiv

要約質問に答えるためにスポーツビデオを推論することは、選手のトレーニングや情 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Few-Shot Learner Generalizes Across AI-Generated Image Detection

投稿日: 2025年1月16日作成者: jarxiv

要約大規模な合成画像データセットでトレーニングされた現在の偽画像検出器は、限ら … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

When No-Reference Image Quality Models Meet MAP Estimation in Diffusion Latents

投稿日: 2025年1月16日作成者: jarxiv

要約最新の非参照画質評価 (NR-IQA) モデルは、知覚される画質を効果的に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Admitting Ignorance Helps the Video Question Answering Models to Answer

投稿日: 2025年1月16日作成者: jarxiv

要約ディープラーニングと大規模な事前トレーニングのおかげで、ビデオ質問応答 ( … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

3VL: Using Trees to Improve Vision-Language Models’ Interpretability

投稿日: 2025年1月16日作成者: jarxiv

要約ビジョン言語モデル (VLM) は、画像とテキスト表現を調整するのに効果的 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Solving Energy-Independent Density for CT Metal Artifact Reduction via Neural Representation

投稿日: 2025年1月16日作成者: jarxiv

要約 X 線 CT では、金属材料の存在下で影や縞模様のアーチファクトが発生し、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling

投稿日: 2025年1月16日作成者: jarxiv

要約さまざまな画像生成および編集タスクに取り組む命令ベースの拡散フレームワーク … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis

投稿日: 2025年1月16日作成者: jarxiv

要約良好な共同音声動作生成は、一般的なリズミカルな動作と、まれではあるが不可欠 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Exploring ChatGPT for Face Presentation Attack Detection in Zero and Few-Shot in-Context Learning

投稿日: 2025年1月16日作成者: jarxiv

要約この研究は、顔提示攻撃検出 (PAD) の競合代替手段としての ChatG … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

Structural damage detection via hierarchical damage information with volumetric assessment

投稿日: 2025年1月16日作成者: jarxiv

要約構造健全性モニタリング (SHM) は、インフラストラクチャの安全性と寿命 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports

Few-Shot Learner Generalizes Across AI-Generated Image Detection

When No-Reference Image Quality Models Meet MAP Estimation in Diffusion Latents

Admitting Ignorance Helps the Video Question Answering Models to Answer

3VL: Using Trees to Improve Vision-Language Models’ Interpretability

Solving Energy-Independent Density for CT Metal Artifact Reduction via Neural Representation

ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling

SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis

Exploring ChatGPT for Face Presentation Attack Detection in Zero and Few-Shot in-Context Learning

Structural damage detection via hierarchical damage information with volumetric assessment

最近の投稿

最近のコメント

アーカイブ

カテゴリー