「cs.LG」カテゴリーアーカイブ

Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models

投稿日: 2024年10月11日作成者: jarxiv

要約 CLIP のような対照的視覚言語モデル (VLM) は、さまざまな下流タス … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Sparse Repellency for Shielded Generation in Text-to-image Diffusion Models

投稿日: 2024年10月11日作成者: jarxiv

要約テキストから画像への生成における拡散モデルの採用の増加により、その信頼性に … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models

投稿日: 2024年10月11日作成者: jarxiv

要約離散拡散モデルは、画像生成やマスクされた言語モデリングなどのタスクでは成功 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

SPA: 3D Spatial-Awareness Enables Effective Embodied Representation

投稿日: 2024年10月11日作成者: jarxiv

要約この論文では、身体化された AI における 3D 空間認識の重要性を強調す … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision

投稿日: 2024年10月11日作成者: jarxiv

要約現在の大規模マルチモーダルモデル (LMM) は、モデルが言語コンポーネ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Quanda: An Interpretability Toolkit for Training Data Attribution Evaluation and Beyond

投稿日: 2024年10月11日作成者: jarxiv

要約近年、トレーニングデータアトリビューション (TDA) 手法が、ニュー … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions

投稿日: 2024年10月10日作成者: jarxiv

要約悪天候は、LiDAR やカメラなどのセンサーに影響を与え、自動運転車 (A … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

M${}^{3}$Bench: Benchmarking Whole-body Motion Generation for Mobile Manipulation in 3D Scenes

投稿日: 2024年10月10日作成者: jarxiv

要約我々は、モバイル操作タスクのための全身動作生成のための新しいベンチマークで … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering

投稿日: 2024年10月10日作成者: jarxiv

要約この論文では、モデルフリーの安全強化学習 (RL) における安全制約と過大 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Gaitor: Learning a Unified Representation Across Gaits for Real-World Quadruped Locomotion

投稿日: 2024年10月10日作成者: jarxiv

要約現在の最先端の四足歩行は、さまざまな複雑な動きを生み出すことができます。 … 続きを読む →

カテゴリー: cs.LG, cs.RO | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models

Sparse Repellency for Shielded Generation in Text-to-image Diffusion Models

DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models

SPA: 3D Spatial-Awareness Enables Effective Embodied Representation

Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision

Quanda: An Interpretability Toolkit for Training Data Attribution Evaluation and Beyond

Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions

M${}^{3}$Bench: Benchmarking Whole-body Motion Generation for Mobile Manipulation in 3D Scenes

A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering

Gaitor: Learning a Unified Representation Across Gaits for Real-World Quadruped Locomotion

最近の投稿

最近のコメント

アーカイブ

カテゴリー