月別アーカイブ: 2024年3月

Zero-Shot Aerial Object Detection with Visual Description Regularization

投稿日: 2024年3月4日作成者: jarxiv

要約既存の物体検出モデルは、主に大規模なラベル付きデータセットを用いて学習され … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SegReg: Segmenting OARs by Registering MR Images and CT Annotations

投稿日: 2024年3月4日作成者: jarxiv

要約 OAR（Organ at Risk）セグメンテーションは、頭頸部腫瘍などの … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Out-of-Distribution Detection using Neural Activation Prior

投稿日: 2024年3月4日作成者: jarxiv

要約分布外検出は、機械学習モデルを実世界に導入し、未知のシナリオに対応するため … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE

投稿日: 2024年3月4日作成者: jarxiv

要約多様なマルチモーダルデータから学習するスケーラブルな視覚言語モデルの構築は … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder

投稿日: 2024年3月4日作成者: jarxiv

要約最近の研究により、音声駆動型話し顔生成は大きく進歩したが、生成された映像の … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

Unfolding Local Growth Rate Estimates for (Almost) Perfect Adversarial Detection

投稿日: 2024年3月4日作成者: jarxiv

要約畳み込みニューラルネットワーク（CNN）は、多くの知覚タスクにおいて最先端 … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

High-Speed Detector For Low-Powered Devices In Aerial Grasping

投稿日: 2024年3月4日作成者: jarxiv

要約自律的な空中収穫は、非常に複雑な問題である。というのも、小型の低電力コンピ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Adversarial Examples are Misaligned in Diffusion Model Manifolds

投稿日: 2024年3月4日作成者: jarxiv

要約近年、拡散モデル(Diffusion Model: DM)は、データ分布の … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

GAMMA: Generalizable Articulation Modeling and Manipulation for Articulated Objects

投稿日: 2024年3月4日作成者: jarxiv

要約キャビネットやドアのような多関節オブジェクトは、日常生活に広く普及している … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

BenchCloudVision: A Benchmark Analysis of Deep Learning Approaches for Cloud Detection and Segmentation in Remote Sensing Imagery

投稿日: 2024年3月4日作成者: jarxiv

要約光学センサーを搭載した人工衛星は高解像度の画像を取得し、様々な環境現象に対 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

月別アーカイブ: 2024年3月

Zero-Shot Aerial Object Detection with Visual Description Regularization

SegReg: Segmenting OARs by Registering MR Images and CT Annotations

Out-of-Distribution Detection using Neural Activation Prior

EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE

DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder

Unfolding Local Growth Rate Estimates for (Almost) Perfect Adversarial Detection

High-Speed Detector For Low-Powered Devices In Aerial Grasping

Adversarial Examples are Misaligned in Diffusion Model Manifolds

GAMMA: Generalizable Articulation Modeling and Manipulation for Articulated Objects

BenchCloudVision: A Benchmark Analysis of Deep Learning Approaches for Cloud Detection and Segmentation in Remote Sensing Imagery

最近の投稿

最近のコメント

アーカイブ

カテゴリー