「cs.LG」カテゴリーアーカイブ

Augmentation-aware Self-supervised Learning with Conditioned Projector

投稿日: 2024年10月16日作成者: jarxiv

要約自己教師あり学習 (SSL) は、ラベルのないデータから学習するための強力 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

投稿日: 2024年10月16日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は頻繁に幻覚現象を示しますが … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation

投稿日: 2024年10月16日作成者: jarxiv

要約私たちは、単一のビデオデモンストレーションを模倣して人型ロボットの操作スキ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Active Label Refinement for Robust Training of Imbalanced Medical Image Classification Tasks in the Presence of High Label Noise

投稿日: 2024年10月16日作成者: jarxiv

要約教師あり深層学習ベースの医用画像分類の堅牢性は、ラベルノイズによって大幅 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Improving Long-Text Alignment for Text-to-Image Diffusion Models

投稿日: 2024年10月16日作成者: jarxiv

要約テキストから画像への (T2I) 拡散モデルの急速な進歩により、与えられた … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.MM | コメントを受け付けていません

MoH: Multi-Head Attention as Mixture-of-Head Attention

投稿日: 2024年10月16日作成者: jarxiv

要約この作業では、Transformer モデルの中核であるマルチヘッドアテ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

投稿日: 2024年10月16日作成者: jarxiv

要約マルチモーダルビデオの理解と生成には、きめの細かい時間ダイナミクスを理解す … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

MoVEInt: Mixture of Variational Experts for Learning Human-Robot Interactions from Demonstrations

投稿日: 2024年10月15日作成者: jarxiv

要約共有ダイナミクスモデルは、ヒューマンロボットインタラクション (HR … 続きを読む →

カテゴリー: cs.HC, cs.LG, cs.RO | コメントを受け付けていません

Make the Pertinent Salient: Task-Relevant Reconstruction for Visual Control with Distractions

投稿日: 2024年10月15日作成者: jarxiv

要約モデルベース強化学習 (MBRL) の最近の進歩により、MBRL は視覚的 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Input-to-State Stable Coupled Oscillator Networks for Closed-form Model-based Control in Latent Space

投稿日: 2024年10月15日作成者: jarxiv

要約文献ではさまざまな方法が提案されていますが、物理システムの効率的かつ効果的 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Augmentation-aware Self-supervised Learning with Conditioned Projector

MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation

Active Label Refinement for Robust Training of Imbalanced Medical Image Classification Tasks in the Presence of High Label Noise

Improving Long-Text Alignment for Text-to-Image Diffusion Models

MoH: Multi-Head Attention as Mixture-of-Head Attention

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

MoVEInt: Mixture of Variational Experts for Learning Human-Robot Interactions from Demonstrations

Make the Pertinent Salient: Task-Relevant Reconstruction for Visual Control with Distractions

Input-to-State Stable Coupled Oscillator Networks for Closed-form Model-based Control in Latent Space

最近の投稿

最近のコメント

アーカイブ

カテゴリー