月別アーカイブ: 2023年9月

A Comprehensive Analysis of AI Biases in DeepFake Detection With Massively Annotated Databases

投稿日: 2023年9月21日作成者: jarxiv

要約近年、ディープフェイクによる画像や動画の改ざんは、セキュリティや社会にとっ … 続きを読む →

カテゴリー: cs.CV, cs.CY, cs.LG | コメントを受け付けていません

Budget-Aware Pruning: Handling Multiple Domains with Less Parameters

投稿日: 2023年9月21日作成者: jarxiv

要約深層学習は、いくつかのコンピュータービジョンタスクおよびドメインで最先 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation

投稿日: 2023年9月21日作成者: jarxiv

要約オーディオビジュアルビデオセグメンテーション (AVVS) は、画像フレー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DeepAqua: Self-Supervised Semantic Segmentation of Wetland Surface Water Extent with SAR Images using Knowledge Distillation

投稿日: 2023年9月21日作成者: jarxiv

要約ディープラーニングとリモートセンシング技術により、水監視能力が大幅に向上し … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

FreeU: Free Lunch in Diffusion U-Net

投稿日: 2023年9月21日作成者: jarxiv

要約この論文では、その場で生成品質を大幅に向上させる「フリーランチ」として機能 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DreamLLM: Synergistic Multimodal Comprehension and Creation

投稿日: 2023年9月21日作成者: jarxiv

要約この論文では、マルチモーダルな理解と作成の間で見落とされがちな相乗効果を強 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

A Large-scale Dataset for Audio-Language Representation Learning

投稿日: 2023年9月21日作成者: jarxiv

要約 AI コミュニティは、大規模なマルチモーダルデータセットを活用した強力な … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

GloPro: Globally-Consistent Uncertainty-Aware 3D Human Pose Estimation & Tracking in the Wild

投稿日: 2023年9月21日作成者: jarxiv

要約正確で不確実性を考慮した 3D 人体の姿勢推定は、真に安全かつ効率的な人間 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation

投稿日: 2023年9月21日作成者: jarxiv

要約事前トレーニングされた言語モデルは、さまざまな音楽の理解と生成のタスクにお … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

KFC: Kinship Verification with Fair Contrastive Loss and Multi-Task Learning

投稿日: 2023年9月21日作成者: jarxiv

要約親族関係の検証は、複数の潜在的なアプリケーションを持つコンピュータービジ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2023年9月

A Comprehensive Analysis of AI Biases in DeepFake Detection With Massively Annotated Databases

Budget-Aware Pruning: Handling Multiple Domains with Less Parameters

CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation

DeepAqua: Self-Supervised Semantic Segmentation of Wetland Surface Water Extent with SAR Images using Knowledge Distillation

FreeU: Free Lunch in Diffusion U-Net

DreamLLM: Synergistic Multimodal Comprehension and Creation

A Large-scale Dataset for Audio-Language Representation Learning

GloPro: Globally-Consistent Uncertainty-Aware 3D Human Pose Estimation & Tracking in the Wild

MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation

KFC: Kinship Verification with Fair Contrastive Loss and Multi-Task Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー