「cs.LG」カテゴリーアーカイブ

Multimodal Unsupervised Domain Generalization by Retrieving Across the Modality Gap

投稿日: 2025年6月11日作成者: jarxiv

要約ドメイン一般化（DG）は、共有ラベルスペースの仮定の下で、1つ以上のソース … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Segment Concealed Objects with Incomplete Supervision

投稿日: 2025年6月11日作成者: jarxiv

要約不完全に監視されている隠されたオブジェクトセグメンテーション（ISCOS） … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Data Augmentation For Small Object using Fast AutoAugment

投稿日: 2025年6月11日作成者: jarxiv

要約近年、オブジェクト検出パフォーマンスには大きな進歩があります。ただし、こ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models

投稿日: 2025年6月11日作成者: jarxiv

要約クロスモーダルコントラスト学習を通じて、医療視覚言語の整合により、検索やゼ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

DIsoN: Decentralized Isolation Networks for Out-of-Distribution Detection in Medical Imaging

投稿日: 2025年6月11日作成者: jarxiv

要約医療イメージングなどの安全性が批判的なドメインにおける機械学習（ML）モデ … 続きを読む →

カテゴリー: cs.CV, cs.LG, I.2.0 | コメントを受け付けていません

Diffuse and Disperse: Image Generation with Representation Regularization

投稿日: 2025年6月11日作成者: jarxiv

要約過去10年間の拡散ベースの生成モデルの開発は、表現学習の進歩とは独立して主 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction

投稿日: 2025年6月11日作成者: jarxiv

要約テスト時間スケーリングの現在のパラダイムは、応答を生成する前に、長い推論ト … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

From Pixels to Predicates: Learning Symbolic World Models via Pretrained Vision-Language Models

投稿日: 2025年6月11日作成者: jarxiv

要約私たちの目的は、低レベルのスキルと、一連の画像を含む少数の短距離デモンスト … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Towards Autonomous Reinforcement Learning for Real-World Robotic Manipulation with Large Language Models

投稿日: 2025年6月11日作成者: jarxiv

要約大規模な言語モデル（LLMS）および視覚言語モデル（VLM）の最近の進歩は … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy

投稿日: 2025年6月10日作成者: jarxiv

要約強化や模倣学習などのデータ駆動型の方法は、ロボットの自律性において顕著な成 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Multimodal Unsupervised Domain Generalization by Retrieving Across the Modality Gap

Segment Concealed Objects with Incomplete Supervision

Data Augmentation For Small Object using Fast AutoAugment

Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models

DIsoN: Decentralized Isolation Networks for Out-of-Distribution Detection in Medical Imaging

Diffuse and Disperse: Image Generation with Representation Regularization

Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction

From Pixels to Predicates: Learning Symbolic World Models via Pretrained Vision-Language Models

Towards Autonomous Reinforcement Learning for Real-World Robotic Manipulation with Large Language Models

Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy

最近の投稿

最近のコメント

アーカイブ

カテゴリー