月別アーカイブ: 2023年6月

Devil is in Channels: Contrastive Single Domain Generalization for Medical Image Segmentation

投稿日: 2023年6月9日作成者: jarxiv

要約深層学習ベースの医療画像セグメンテーションモデルは、新しい医療センターに … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Unscented Autoencoder

投稿日: 2023年6月9日作成者: jarxiv

要約変分オートエンコーダー (VAE) は、潜在変数を使用した深い生成モデリン … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

EXOT: Exit-aware Object Tracker for Safe Robotic Manipulation of Moving Object

投稿日: 2023年6月9日作成者: jarxiv

要約現在のロボットハンド操作は、限られた環境内で予測可能な位置にある物体を操作 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

On the Hidden Mystery of OCR in Large Multimodal Models

投稿日: 2023年6月9日作成者: jarxiv

要約大規模モデルは、最近、自然言語処理とマルチモーダル視覚言語学習において主要 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Factorized Contrastive Learning: Going Beyond Multi-view Redundancy

投稿日: 2023年6月9日作成者: jarxiv

要約幅広いマルチモーダルタスクにおいて、対比学習は、ペア情報（画像とキャプショ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models

投稿日: 2023年6月9日作成者: jarxiv

要約大規模な事前トレーニング済みモデルの出現は、視覚表現の学習と自然言語処理の … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Out-of-domain GAN inversion via Invertibility Decomposition for Photo-Realistic Human Face Manipulation

投稿日: 2023年6月9日作成者: jarxiv

要約 Generative Adversarial Networks (GAN) … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Connectional-Style-Guided Contextual Representation Learning for Brain Disease Diagnosis

投稿日: 2023年6月9日作成者: jarxiv

要約構造磁気共鳴画像法 (sMRI) は大きな臨床的価値を示しており、深層学習 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

One does not fit all! On the Complementarity of Vision Encoders for Vision and Language Tasks

投稿日: 2023年6月9日作成者: jarxiv

要約現在のマルチモーダルモデルは、ビジョンと言語 (V+L) タスクの解決を … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Enhance-NeRF: Multiple Performance Evaluation for Neural Radiance Fields

投稿日: 2023年6月9日作成者: jarxiv

要約 3 次元再構築の品質は、仮想現実 (VR) や拡張現実 (AR) テクノロ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2023年6月

Devil is in Channels: Contrastive Single Domain Generalization for Medical Image Segmentation

Unscented Autoencoder

EXOT: Exit-aware Object Tracker for Safe Robotic Manipulation of Moving Object

On the Hidden Mystery of OCR in Large Multimodal Models

Factorized Contrastive Learning: Going Beyond Multi-view Redundancy

Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models

Out-of-domain GAN inversion via Invertibility Decomposition for Photo-Realistic Human Face Manipulation

Connectional-Style-Guided Contextual Representation Learning for Brain Disease Diagnosis

One does not fit all! On the Complementarity of Vision Encoders for Vision and Language Tasks

Enhance-NeRF: Multiple Performance Evaluation for Neural Radiance Fields

最近の投稿

最近のコメント

アーカイブ

カテゴリー