「cs.LG」カテゴリーアーカイブ

Sim-and-Real Co-Training: A Simple Recipe for Vision-Based Robotic Manipulation

投稿日: 2025年4月1日作成者: jarxiv

要約大規模な現実世界のロボットデータセットは、ジェネラリストのロボットモデルを … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Which LIME should I trust? Concepts, Challenges, and Solutions

投稿日: 2025年4月1日作成者: jarxiv

要約ニューラルネットワークが必須システムで支配的になるにつれて、説明可能な人工 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Effectively Controlling Reasoning Models through Thinking Intervention

投稿日: 2025年4月1日作成者: jarxiv

要約推論強化された大手言語モデル（LLMS）は、最終回答を生成する前に中間推論 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

投稿日: 2025年4月1日作成者: jarxiv

要約アクションの前に推論し、潜在的な結果（つまり、世界モデル）を想像することは … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

From Colors to Classes: Emergence of Concepts in Vision Transformers

投稿日: 2025年4月1日作成者: jarxiv

要約ビジョントランス（VITS）は、強力な表現能力により、さまざまなコンピュー … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

IMPACT: A Generic Semantic Loss for Multimodal Medical Image Registration

投稿日: 2025年4月1日作成者: jarxiv

要約画像登録は医療イメージングの基本であり、診断、治療計画、画像誘導治療、また … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

It’s a (Blind) Match! Towards Vision-Language Correspondence without Parallel Data

投稿日: 2025年4月1日作成者: jarxiv

要約プラトニック表現仮説は、モデルとデータセットのサイズが増加するにつれて、ビ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models

投稿日: 2025年4月1日作成者: jarxiv

要約離散拡散モデルは、画像生成やマスクされた言語モデリングなどのタスクで成功を … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing

投稿日: 2025年4月1日作成者: jarxiv

要約リモートセンシング（RS）画像のオブジェクト検出（OD）と視覚的接地（VG … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

The impact of internal variability on benchmarking deep learning climate emulators

投稿日: 2025年4月1日作成者: jarxiv

要約完全複雑さのアースシステムモデル（ESM）は計算的に非常に高価であり、複数 … 続きを読む →

カテゴリー: cs.AI, cs.CE, cs.CV, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Sim-and-Real Co-Training: A Simple Recipe for Vision-Based Robotic Manipulation

Which LIME should I trust? Concepts, Challenges, and Solutions

Effectively Controlling Reasoning Models through Thinking Intervention

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

From Colors to Classes: Emergence of Concepts in Vision Transformers

IMPACT: A Generic Semantic Loss for Multimodal Medical Image Registration

It’s a (Blind) Match! Towards Vision-Language Correspondence without Parallel Data

DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models

MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing

The impact of internal variability on benchmarking deep learning climate emulators

最近の投稿

最近のコメント

アーカイブ

カテゴリー