月別アーカイブ: 2022年8月

Metadata-enhanced contrastive learning from retinal optical coherence tomography images

投稿日: 2022年8月5日作成者: jarxiv

要約教師あり深層学習アルゴリズムは、医療画像のスクリーニング、モニタリング、グ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MVSFormer: Learning Robust Image Representations via Transformers and Temperature-based Depth for Multi-View Stereo

投稿日: 2022年8月5日作成者: jarxiv

要約特徴表現学習は、学習型マルチビューステレオ(MVS)の重要なレシピである。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

On the Connection between Local Attention and Dynamic Depth-wise Convolution

投稿日: 2022年8月5日作成者: jarxiv

要約 Vision Transformer（ViT）は、視覚認識において最先端の … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Multi-modal volumetric concept activation to explain detection and classification of metastatic prostate cancer on PSMA-PET/CT

投稿日: 2022年8月5日作成者: jarxiv

要約ニューラルネットワークの挙動を解析するために、説明可能な人工知能（XAI） … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Privacy-Preserving Image Classification Using ConvMixer with Adaptive Permutation Matrix

投稿日: 2022年8月5日作成者: jarxiv

要約本論文では、ConvMixer構造の利用下で暗号化画像を用いたプライバシー … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

Constructing Balance from Imbalance for Long-tailed Image Recognition

投稿日: 2022年8月5日作成者: jarxiv

要約ロングテール画像認識では、多数（ヘッド）クラスと少数（テール）クラスの間の … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Glance and Focus Networks for Dynamic Visual Recognition

投稿日: 2022年8月5日作成者: jarxiv

要約視覚認識タスクには空間的な冗長性が広く存在する。すなわち、画像やビデオフレ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

SOMPT22: A Surveillance Oriented Multi-Pedestrian Tracking Dataset

投稿日: 2022年8月5日作成者: jarxiv

要約マルチオブジェクト・トラッキング（MOT）は、過去10年間、検出に関する畳 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Surgical Skill Assessment via Video Semantic Aggregation

投稿日: 2022年8月5日作成者: jarxiv

要約手術スキルのビデオベースの自動評価は、特にリソースの乏しい地域における若い … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Semantic Interleaving Global Channel Attention for Multilabel Remote Sensing Image Classification

投稿日: 2022年8月5日作成者: jarxiv

要約マルチラベルによるリモートセンシング画像分類(MLRSIC)の研究が盛んに … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2022年8月

Metadata-enhanced contrastive learning from retinal optical coherence tomography images

MVSFormer: Learning Robust Image Representations via Transformers and Temperature-based Depth for Multi-View Stereo

On the Connection between Local Attention and Dynamic Depth-wise Convolution

Multi-modal volumetric concept activation to explain detection and classification of metastatic prostate cancer on PSMA-PET/CT

Privacy-Preserving Image Classification Using ConvMixer with Adaptive Permutation Matrix

Constructing Balance from Imbalance for Long-tailed Image Recognition

Glance and Focus Networks for Dynamic Visual Recognition

SOMPT22: A Surveillance Oriented Multi-Pedestrian Tracking Dataset

Surgical Skill Assessment via Video Semantic Aggregation

Semantic Interleaving Global Channel Attention for Multilabel Remote Sensing Image Classification

最近の投稿

最近のコメント

アーカイブ

カテゴリー