「cs.AI」カテゴリーアーカイブ

Feature Fusion for Human Activity Recognition using Parameter-Optimized Multi-Stage Graph Convolutional Network and Transformer Models

投稿日: 2024年6月25日作成者: jarxiv

要約人間活動認識 (HAR) は、コンピューターとマシンビジョンテクノロジ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment

投稿日: 2024年6月25日作成者: jarxiv

要約最近、テキストプロンプトチューニングは、Contrastive Lan … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A Systematic Review of Few-Shot Learning in Medical Imaging

投稿日: 2024年6月25日作成者: jarxiv

要約注釈付きの医療画像が不足すると、通常、大規模なラベル付きデータセットが必要 … 続きを読む →

カテゴリー: cs.AI, cs.CV, I.2.6 | コメントを受け付けていません

The Progression of Transformers from Language to Vision to MOT: A Literature Review on Multi-Object Tracking with Transformers

投稿日: 2024年6月25日作成者: jarxiv

要約トランスフォーマーニューラルネットワークアーキテクチャでは、アテンシ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts

投稿日: 2024年6月25日作成者: jarxiv

要約ビジョン言語モデル (VLM) におけるロングコンテキストの抽出推論を評価 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

投稿日: 2024年6月25日作成者: jarxiv

要約この研究は、単眼カラーの入力 (つまり、画像やビデオ) からの高品質の表面 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR | コメントを受け付けていません

VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

投稿日: 2024年6月25日作成者: jarxiv

要約近年、ビデオ生成において大きな進歩が見られます。ただし、自動ビデオ指標の … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

This actually looks like that: Proto-BagNets for local and global interpretability-by-design

投稿日: 2024年6月25日作成者: jarxiv

要約解釈可能性は、医療診断を含む一か八かのアプリケーションで機械学習モデルを使 … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Investigating the impact of 2D gesture representation on co-speech gesture generation

投稿日: 2024年6月25日作成者: jarxiv

要約共同スピーチジェスチャーは、人間と身体的会話エージェント (ECA) との … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Adaptive Manipulation using Behavior Trees

投稿日: 2024年6月24日作成者: jarxiv

要約多くの操作タスクでは、バルブを締めたり緩めたりするためのひねり動作など、一 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Feature Fusion for Human Activity Recognition using Parameter-Optimized Multi-Stage Graph Convolutional Network and Transformer Models

Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment

A Systematic Review of Few-Shot Learning in Medical Imaging

The Progression of Transformers from Language to Vision to MOT: A Literature Review on Multi-Object Tracking with Transformers

Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts

StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

This actually looks like that: Proto-BagNets for local and global interpretability-by-design

Investigating the impact of 2D gesture representation on co-speech gesture generation

Adaptive Manipulation using Behavior Trees

最近の投稿

最近のコメント

アーカイブ

カテゴリー