「cs.MM」カテゴリーアーカイブ

Learning from Label Relationships in Human Affect

投稿日: 2022年7月13日作成者: jarxiv

要約自動化された方法での人間の影響と精神状態の推定は、時間分解能が低いかまった … 続きを読む →

カテゴリー: cs.CV, cs.HC, cs.MM | コメントを受け付けていません

LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval

投稿日: 2022年7月12日作成者: jarxiv

要約ビデオテキスト検索は、クロスモーダル表現学習問題のクラスであり、目的は、特 … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

SHREC’22 Track: Sketch-Based 3D Shape Retrieval in the Wild

投稿日: 2022年7月12日作成者: jarxiv

要約スケッチベースの3D形状検索（SBSR）は重要でありながら挑戦的なタスクで … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.MM | コメントを受け付けていません

Intra-Modal Constraint Loss For Image-Text Retrieval

投稿日: 2022年7月12日作成者: jarxiv

要約クロスモーダル検索は、コンピュータービジョンと自然言語処理の両方の分野で大 … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

Audio-Visual Segmentation

投稿日: 2022年7月12日作成者: jarxiv

要約視聴覚セグメンテーション（AVS）と呼ばれる新しい問題を調査することを提案 … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.SD, eess.AS, eess.IV | コメントを受け付けていません

Exploring the Effectiveness of Video Perceptual Representation in Blind Video Quality Assessment

投稿日: 2022年7月11日作成者: jarxiv

要約非専門家によって撮影された野生のビデオの急速な成長に伴い、ブラインドビデオ … 続きを読む →

カテゴリー: cs.CV, cs.MM, eess.IV | コメントを受け付けていません

FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis

投稿日: 2022年7月11日作成者: jarxiv

要約制約のない唇から音声への合成は、頭のポーズや語彙に制限がなく、話している顔 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.MM, cs.SD, eess.AS, I.2.10 | コメントを受け付けていません

Self-Supervised Learning of Music-Dance Representation through Explicit-Implicit Rhythm Synchronization

投稿日: 2022年7月8日作成者: jarxiv

要約視聴覚表現は多くの下流のタスクに適用可能であることが証明されていますが、よ … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment Sampling

投稿日: 2022年7月7日作成者: jarxiv

要約現在のディープビデオ品質評価 (VQA) 手法は、高解像度ビデオを評価する … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

Adversarial Robustness of Visual Dialog

投稿日: 2022年7月7日作成者: jarxiv

要約敵対的ロバスト性とは、機械学習モデルの安全性と信頼性を確保するために、最悪 … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

「cs.MM」カテゴリーアーカイブ

Learning from Label Relationships in Human Affect

LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval

SHREC’22 Track: Sketch-Based 3D Shape Retrieval in the Wild

Intra-Modal Constraint Loss For Image-Text Retrieval

Audio-Visual Segmentation

Exploring the Effectiveness of Video Perceptual Representation in Blind Video Quality Assessment

FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis

Self-Supervised Learning of Music-Dance Representation through Explicit-Implicit Rhythm Synchronization

FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment Sampling

Adversarial Robustness of Visual Dialog

最近の投稿

最近のコメント

アーカイブ

カテゴリー