月別アーカイブ: 2023年8月

Evaluating the Quality and Diversity of DCGAN-based Generatively Synthesized Diabetic Retinopathy Imagery

投稿日: 2023年8月31日作成者: jarxiv

要約公的に利用可能な糖尿病性網膜症 (DR) データセットは不均衡であり、DR … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

BinaryViT: Towards Efficient and Accurate Binary Vision Transformers

投稿日: 2023年8月31日作成者: jarxiv

要約ビジョントランスフォーマー (ViT) は、ほとんどのコンピュータービ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SegViTv2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers

投稿日: 2023年8月31日作成者: jarxiv

要約この論文では、エンコーダ/デコーダフレームワークを使用したセマンティック … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Topology-aware MLP for Skeleton-based Action Recognition

投稿日: 2023年8月31日作成者: jarxiv

要約グラフ畳み込みネットワーク (GCN) は、スケルトンベースのアクション認 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing

投稿日: 2023年8月31日作成者: jarxiv

要約近年、Transformer ネットワークは、そのグローバルな受容野と入力 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

How Good is Google Bard’s Visual Understanding? An Empirical Study on Open Challenges

投稿日: 2023年8月31日作成者: jarxiv

要約 Google の Bard は、会話型 AI の分野で OpenAI の … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Nonrigid Object Contact Estimation With Regional Unwrapping Transformer

投稿日: 2023年8月31日作成者: jarxiv

要約手と非剛体物体との間の接触パターンを取得することは、視覚およびロボット工学 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Fault Localization for Buggy Deep Learning Framework Conversions in Image Recognition

投稿日: 2023年8月31日作成者: jarxiv

要約ディープニューラルネットワーク (DNN) をデプロイする場合、開発者 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.SE, cs.SY, eess.SY | コメントを受け付けていません

Discriminator-free Unsupervised Domain Adaptation for Multi-label Image Classification

投稿日: 2023年8月31日作成者: jarxiv

要約この論文では、DDA-MLIC と呼ばれるマルチラベル画像分類 (MLIC … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications

投稿日: 2023年8月31日作成者: jarxiv

要約ディープラーニングとコンピュータービジョンの最近の進歩により、現実的なトー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2023年8月

Evaluating the Quality and Diversity of DCGAN-based Generatively Synthesized Diabetic Retinopathy Imagery

BinaryViT: Towards Efficient and Accurate Binary Vision Transformers

SegViTv2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers

Topology-aware MLP for Skeleton-based Action Recognition

MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing

How Good is Google Bard’s Visual Understanding? An Empirical Study on Open Challenges

Nonrigid Object Contact Estimation With Regional Unwrapping Transformer

Fault Localization for Buggy Deep Learning Framework Conversions in Image Recognition

Discriminator-free Unsupervised Domain Adaptation for Multi-label Image Classification

From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications

最近の投稿

最近のコメント

アーカイブ

カテゴリー