月別アーカイブ: 2024年8月

Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection

投稿日: 2024年8月6日作成者: jarxiv

要約皮肉は皮肉の一種であり、文字通りの解釈と意図された意味合いの間に本質的な不 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need

投稿日: 2024年8月6日作成者: jarxiv

要約クラス増分学習 (CIL) は、古いクラスを忘れることなく、出現する新しい … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

APARATE: Adaptive Adversarial Patch for CNN-based Monocular Depth Estimation for Autonomous Navigation

投稿日: 2024年8月6日作成者: jarxiv

要約最近、単眼深度推定 (MDE) のパフォーマンスが大幅に向上しました。これ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

SSAP: A Shape-Sensitive Adversarial Patch for Comprehensive Disruption of Monocular Depth Estimation in Autonomous Navigation Applications

投稿日: 2024年8月6日作成者: jarxiv

要約単眼深度推定 (MDE) は、主に畳み込みニューラルネットワーク (CN … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba

投稿日: 2024年8月6日作成者: jarxiv

要約最近の Transformer ベースの拡散モデルは、顕著なパフォーマンス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

YOWOv3: An Efficient and Generalized Framework for Human Action Detection and Recognition

投稿日: 2024年8月6日作成者: jarxiv

要約この論文では、YOWOv3 と呼ばれる新しいフレームワークを提案します。こ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Unsupervised Change Detection for Space Habitats Using 3D Point Clouds

投稿日: 2024年8月6日作成者: jarxiv

要約この研究では、将来の宇宙生息地での自律的なロボットによる世話を可能にする、 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

VidGen-1M: A Large-Scale Dataset for Text-to-video Generation

投稿日: 2024年8月6日作成者: jarxiv

要約ビデオとテキストのペアの品質は、基本的にテキストとビデオのモデルの上限を決 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Interactive 3D Medical Image Segmentation with SAM 2

投稿日: 2024年8月6日作成者: jarxiv

要約インタラクティブ医用画像セグメンテーション (IMIS) は、医療専門家か … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

On Using Quasirandom Sequences in Machine Learning for Model Weight Initialization

投稿日: 2024年8月6日作成者: jarxiv

要約ニューラルネットワークのトレーニングの有効性は、機械学習アプリケーション … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年8月

Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection

Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need

APARATE: Adaptive Adversarial Patch for CNN-based Monocular Depth Estimation for Autonomous Navigation

SSAP: A Shape-Sensitive Adversarial Patch for Comprehensive Disruption of Monocular Depth Estimation in Autonomous Navigation Applications

LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba

YOWOv3: An Efficient and Generalized Framework for Human Action Detection and Recognition

Unsupervised Change Detection for Space Habitats Using 3D Point Clouds

VidGen-1M: A Large-Scale Dataset for Text-to-video Generation

Interactive 3D Medical Image Segmentation with SAM 2

On Using Quasirandom Sequences in Machine Learning for Model Weight Initialization

最近の投稿

最近のコメント

アーカイブ

カテゴリー