「cs.MM」カテゴリーアーカイブ

Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera

投稿日: 2023年12月21日作成者: jarxiv

要約このペーパーでは、視覚的なオドメトリを活用して、GPS が拒否された環境で … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.MM, cs.RO | コメントを受け付けていません

FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline

投稿日: 2023年12月21日作成者: jarxiv

要約マルチメディア生成アプローチは、人工知能研究において重要な位置を占めていま … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.MM | コメントを受け付けていません

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

投稿日: 2023年12月21日作成者: jarxiv

要約 OpenAI の GPT-4V(ision) など、マルチモーダル大規模言 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MM | コメントを受け付けていません

Learning from Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation

投稿日: 2023年12月20日作成者: jarxiv

要約自律型ロボット技術の最近の進歩により、正確な環境分析の必要性が高まっていま … 続きを読む →

カテゴリー: cs.CV, cs.MM, stat.ML | コメントを受け付けていません

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

投稿日: 2023年12月20日作成者: jarxiv

要約 OpenAI の GPT-4V(ision) など、マルチモーダル大規模言 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MM | コメントを受け付けていません

Debiasing Multimodal Sarcasm Detection with Contrastive Learning

投稿日: 2023年12月20日作成者: jarxiv

要約既存の研究によって達成された賞賛に値する成果にもかかわらず、一般的なマルチ … 続きを読む →

カテゴリー: cs.CL, cs.MM | コメントを受け付けていません

Debiasing Multimodal Sarcasm Detection with Contrastive Learning

投稿日: 2023年12月19日作成者: jarxiv

要約既存の研究によって達成された賞賛に値する成果にもかかわらず、一般的なマルチ … 続きを読む →

カテゴリー: cs.CL, cs.MM | コメントを受け付けていません

Part Representation Learning with Teacher-Student Decoder for Occluded Person Re-identification

投稿日: 2023年12月18日作成者: jarxiv

要約遮蔽された人物の再識別 (ReID) は、遮蔽障害と不完全なターゲット情報 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.MM | コメントを受け付けていません

Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis

投稿日: 2023年12月15日作成者: jarxiv

要約マルチモーダル感情分析 (MSA) は、複数のソース (言語、ビデオ、音声 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MM | コメントを受け付けていません

CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor

投稿日: 2023年12月14日作成者: jarxiv

要約既存のオープンボキャブラリーの画像セグメンテーション方法では、マスクの注釈 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

「cs.MM」カテゴリーアーカイブ

Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera

FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

Learning from Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

Debiasing Multimodal Sarcasm Detection with Contrastive Learning

Debiasing Multimodal Sarcasm Detection with Contrastive Learning

Part Representation Learning with Teacher-Student Decoder for Occluded Person Re-identification

Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis

CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor

最近の投稿

最近のコメント

アーカイブ

カテゴリー