月別アーカイブ: 2024年4月

Self-supervised Dataset Distillation: A Good Compression Is All You Need

投稿日: 2024年4月12日作成者: jarxiv

要約データセットの蒸留は、元のデータの情報の本質を最大限に保持しながら、大規模 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Gaga: Group Any Gaussians via 3D-aware Memory Bank

投稿日: 2024年4月12日作成者: jarxiv

要約ゼロショットセグメンテーションモデルによって予測された一貫性のない 2 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Representation Learning

投稿日: 2024年4月12日作成者: jarxiv

要約 CLIP のような対照的な視覚言語モデルは、さまざまな下流タスクで多用途に … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

View Selection for 3D Captioning via Diffusion Ranking

投稿日: 2024年4月12日作成者: jarxiv

要約スケーラブルなアノテーションアプローチは、広範な 3D テキストデータ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

WaveMo: Learning Wavefront Modulations to See Through Scattering

投稿日: 2024年4月12日作成者: jarxiv

要約散乱媒体を介したイメージングは、医療診断から天文学に至るまでの分野にお … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

投稿日: 2024年4月12日作成者: jarxiv

要約テキストから画像への拡散モデルの制御性を高めるために、ControlNet … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

QuasiSim: Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer

投稿日: 2024年4月12日作成者: jarxiv

要約私たちはシミュレータを設計することによって、器用な操作の伝達の問題を調査し … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.RO | コメントを受け付けていません

Supervised Fine-tuning in turn Improves Visual Foundation Models

投稿日: 2024年4月12日作成者: jarxiv

要約近年、CLIP のような画像テキストトレーニングが視覚基礎モデルの事前ト … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding

投稿日: 2024年4月12日作成者: jarxiv

要約最近、大規模な基礎モデルが注目を集めており、広範なシナリオで優れたパフォー … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

投稿日: 2024年4月12日作成者: jarxiv

要約テキストから画像への生成モデルはますます人気が高まっており、一般の人々が利 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年4月

Self-supervised Dataset Distillation: A Good Compression Is All You Need

Gaga: Group Any Gaussians via 3D-aware Memory Bank

Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Representation Learning

View Selection for 3D Captioning via Diffusion Ranking

WaveMo: Learning Wavefront Modulations to See Through Scattering

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

QuasiSim: Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer

Supervised Fine-tuning in turn Improves Visual Foundation Models

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding

OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー