月別アーカイブ: 2023年2月

DDM$^2$: Self-Supervised Diffusion MRI Denoising with Generative Diffusion Models

投稿日: 2023年2月7日作成者: jarxiv

要約磁気共鳴イメージング（MRI）は、一般的で命を救う医療画像技術である。しか … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

RLSbench: Domain Adaptation Under Relaxed Label Shift

投稿日: 2023年2月7日作成者: jarxiv

要約ラベルシフト下でのドメイン適応のための原理的な方法が出現しているにもかかわ … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

SurgT: Soft-Tissue Tracking for Robotic Surgery, Benchmark and Challenge

投稿日: 2023年2月7日作成者: jarxiv

要約本稿では、SurgT MICCAI 2022チャレンジとその最初の結果につ … 続きを読む →

カテゴリー: cs.CV, cs.RO, eess.IV | コメントを受け付けていません

V1T: large-scale mouse V1 response prediction using a Vision Transformer

投稿日: 2023年2月7日作成者: jarxiv

要約視覚刺激に対する視覚野の神経応答を正確に予測するモデルは、計算論的神経科学 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.NE, q-bio.NC | コメントを受け付けていません

AIM: Adapting Image Models for Efficient Video Action Recognition

投稿日: 2023年2月7日作成者: jarxiv

要約近年の視覚変換器を用いた映像モデルは、「画像事前学習→微調整」のパラダイム … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Zero-shot Image-to-Image Translation

投稿日: 2023年2月7日作成者: jarxiv

要約大規模なテキスト画像生成モデルは、多様で高品質な画像を合成するための顕著な … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.LG | コメントを受け付けていません

The Learnable Typewriter: A Generative Approach to Text Line Analysis

投稿日: 2023年2月7日作成者: jarxiv

要約我々は、テキスト行の文字解析と認識に対する生成的な文書固有のアプローチを提 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Show me your NFT and I tell you how it will perform: Multimodal representation learning for NFT selling price prediction

投稿日: 2023年2月7日作成者: jarxiv

要約 Non-Fungible Token（NFT）は、ブロックチェーン技術とス … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.NE | コメントを受け付けていません

Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment

投稿日: 2023年2月6日作成者: jarxiv

要約近年、大規模言語モデルの拡張が進み、テキストベースの様々なタスクにおいて数 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Bridging the Emotional Semantic Gap via Multimodal Relevance Estimation

投稿日: 2023年2月6日作成者: jarxiv

要約人間は、表情、音声、自然言語など、豊かな感情表現手段を持っている。しかし、 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2023年2月

DDM$^2$: Self-Supervised Diffusion MRI Denoising with Generative Diffusion Models

RLSbench: Domain Adaptation Under Relaxed Label Shift

SurgT: Soft-Tissue Tracking for Robotic Surgery, Benchmark and Challenge

V1T: large-scale mouse V1 response prediction using a Vision Transformer

AIM: Adapting Image Models for Efficient Video Action Recognition

Zero-shot Image-to-Image Translation

The Learnable Typewriter: A Generative Approach to Text Line Analysis

Show me your NFT and I tell you how it will perform: Multimodal representation learning for NFT selling price prediction

Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment

Bridging the Emotional Semantic Gap via Multimodal Relevance Estimation

最近の投稿

最近のコメント

アーカイブ

カテゴリー