月別アーカイブ: 2022年9月

Contrastive Unsupervised Learning of World Model with Invariant Causal Features

投稿日: 2022年9月30日作成者: jarxiv

要約この論文では、不変性原理を使用して因果的特徴を学習する世界モデルを提示しま … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO, stat.ML | コメントを受け付けていません

EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual and Language Learning

投稿日: 2022年9月30日作成者: jarxiv

要約 3D ビジュアルグラウンディングは、豊富なセマンティックコンポーネント … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

EiHi Net: Out-of-Distribution Generalization Paradigm

投稿日: 2022年9月30日作成者: jarxiv

要約この論文では、深層学習における分布外 (OoD) 一般化問題を解決するため … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment

投稿日: 2022年9月30日作成者: jarxiv

要約直接法は、ビジュアルオドメトリと SLAM のアプリケーションで優れたパ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DreamFusion: Text-to-3D using 2D Diffusion

投稿日: 2022年9月30日作成者: jarxiv

要約テキストから画像への合成における最近のブレークスルーは、何十億もの画像とテ … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

REST: REtrieve & Self-Train for generative action recognition

投稿日: 2022年9月30日作成者: jarxiv

要約この作業は、生成的なアクション/ビデオ認識モデルのトレーニングに関するもの … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Dilated Neighborhood Attention Transformer

投稿日: 2022年9月30日作成者: jarxiv

要約トランスフォーマーは、モダリティ、ドメイン、およびタスク全体で最も頻繁に適 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Effective Vision Transformer Training: A Data-Centric Perspective

投稿日: 2022年9月30日作成者: jarxiv

要約ビジョントランスフォーマー (ViT) は、畳み込みニューラルネットワ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Understanding Collapse in Non-Contrastive Learning

投稿日: 2022年9月30日作成者: jarxiv

要約対照的な方法により、自己教師あり表現学習 (SSL) のパフォーマンスが最 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.NE, cs.RO | コメントを受け付けていません

Training Strategies for Improved Lip-reading

投稿日: 2022年9月30日作成者: jarxiv

要約いくつかのトレーニング戦略と時間モデルは、一連の独立した研究で孤立した単語 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

月別アーカイブ: 2022年9月

Contrastive Unsupervised Learning of World Model with Invariant Causal Features

EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual and Language Learning

EiHi Net: Out-of-Distribution Generalization Paradigm

DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment

DreamFusion: Text-to-3D using 2D Diffusion

REST: REtrieve & Self-Train for generative action recognition

Dilated Neighborhood Attention Transformer

Effective Vision Transformer Training: A Data-Centric Perspective

Understanding Collapse in Non-Contrastive Learning

Training Strategies for Improved Lip-reading

最近の投稿

最近のコメント

アーカイブ

カテゴリー