投稿者「jarxiv」のアーカイブ

IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-Tuning

投稿日: 2025年5月16日作成者: jarxiv

要約模倣学習（IL）と強化学習（RL）はそれぞれ、ロボット工学ポリシー学習に明 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations?

投稿日: 2025年5月16日作成者: jarxiv

要約大規模な言語モデル（LLMS）の推論と堅牢性を理解することは、プログラミン … 続きを読む →

カテゴリー: cs.AI, cs.SE | コメントを受け付けていません

Superposition Yields Robust Neural Scaling

投稿日: 2025年5月16日作成者: jarxiv

要約今日の大規模な言語モデル（LLMS）の成功は、より大きなモデルのパフォーマ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge

投稿日: 2025年5月16日作成者: jarxiv

要約この研究では、AIエージェントとエージェントAIを批判的に区別し、構造化さ … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction

投稿日: 2025年5月16日作成者: jarxiv

要約この研究では、スプリットコンフォーマル予測（SCP）フレームワークを介した … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps

投稿日: 2025年5月16日作成者: jarxiv

要約ロボット工学、ゲーム、自律運転などの意思決定シナリオで広く採用されている拡 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Benchmarking Generative AI for Scoring Medical Student Interviews in Objective Structured Clinical Examinations (OSCEs)

投稿日: 2025年5月16日作成者: jarxiv

要約客観的な構造化された臨床検査（OSCES）は、医学生のコミュニケーションス … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

投稿日: 2025年5月16日作成者: jarxiv

要約この研究では、大規模な言語モデル（LLM）内の注意に基づいた情報の流れが、 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

PnPXAI: A Universal XAI Framework Providing Automatic Explanations Across Diverse Modalities and Models

投稿日: 2025年5月16日作成者: jarxiv

要約最近、モデルの出力を入力機能に帰属させることにより、モデルの透明度を高める … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation

投稿日: 2025年5月16日作成者: jarxiv

要約強化学習（RL）は、ロボット操作において顕著な可能性を示していますが、サン … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-Tuning

Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations?

Superposition Yields Robust Neural Scaling

AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge

Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction

Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps

Benchmarking Generative AI for Scoring Medical Student Interviews in Objective Structured Clinical Examinations (OSCEs)

PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

PnPXAI: A Universal XAI Framework Providing Automatic Explanations Across Diverse Modalities and Models

Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation

最近の投稿

最近のコメント

アーカイブ

カテゴリー