「cs.CV」カテゴリーアーカイブ

Human-AI Collaborative Multi-modal Multi-rater Learning for Endometriosis Diagnosis

投稿日: 2024年10月28日作成者: jarxiv

要約子宮内膜症は、出生時に女性として割り当てられた人の約 10% に罹患してお … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization

投稿日: 2024年10月28日作成者: jarxiv

要約ドメイン一般化 (DG) 手法は、複数のソースドメインからのトレーニング … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight

投稿日: 2024年10月28日作成者: jarxiv

要約アジャイルクワッドローター飛行のための視覚運動ポリシーの学習には、主に高次 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Peter Parker or Spiderman? Disambiguating Multiple Class Labels

投稿日: 2024年10月28日作成者: jarxiv

要約教師あり分類設定では、推論中にディープネットワークが通常、複数の予測を行 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language Models

投稿日: 2024年10月28日作成者: jarxiv

要約視覚言語モデル (VLM) は、画像に映る個人の評価を必要とするタスクなど … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

AttentionPainter: An Efficient and Adaptive Stroke Predictor for Scene Painting

投稿日: 2024年10月28日作成者: jarxiv

要約ストロークベースレンダリング (SBR) は、入力イメージをパラメータ化 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization

投稿日: 2024年10月28日作成者: jarxiv

要約 Neural Radiance Fields (NeRF)、Instant … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

x-RAGE: eXtended Reality — Action & Gesture Events Dataset

投稿日: 2024年10月28日作成者: jarxiv

要約メタバースの出現と近年のウェアラブルデバイスへの注目により、ジェスチャ … 続きを読む →

カテゴリー: cs.CV, cs.ET | コメントを受け付けていません

MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset

投稿日: 2024年10月28日作成者: jarxiv

要約分離手話認識 (ISLR) は、個々の手話の光沢を識別することに重点を置い … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Conditional Hallucinations for Image Compression

投稿日: 2024年10月28日作成者: jarxiv

要約非可逆画像圧縮では、モデルは、情報のボトルネックにより、細部が幻覚になった … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Human-AI Collaborative Multi-modal Multi-rater Learning for Endometriosis Diagnosis

LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization

Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight

Peter Parker or Spiderman? Disambiguating Multiple Class Labels

GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language Models

AttentionPainter: An Efficient and Adaptive Stroke Predictor for Scene Painting

Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization

x-RAGE: eXtended Reality — Action & Gesture Events Dataset

MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset

Conditional Hallucinations for Image Compression

最近の投稿

最近のコメント

アーカイブ

カテゴリー