投稿者「jarxiv」のアーカイブ

Prefix-Tuning+: Modernizing Prefix-Tuning through Attention Independent Prefix Data

投稿日: 2025年6月17日作成者: jarxiv

要約パラメーター効率の高い微調整（PEFT）メソッドは、大規模な言語モデル（L … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Meta-learning how to Share Credit among Macro-Actions

投稿日: 2025年6月17日作成者: jarxiv

要約強化学習の探査を改善するための提案されているメカニズムの1つは、マクロアク … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems

投稿日: 2025年6月17日作成者: jarxiv

要約大規模な言語モデルの進歩により、多くのダイアログシステムは現在、患者の病状 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

FoMoH: A clinically meaningful foundation model evaluation for structured electronic health records

投稿日: 2025年6月17日作成者: jarxiv

要約財団モデルは、ダウンストリームタスクとは無関係に意味のある表現を抽出する能 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Distinguishing Autonomous AI Agents from Collaborative Agentic Systems: A Comprehensive Framework for Understanding Modern Intelligent Architectures

投稿日: 2025年6月17日作成者: jarxiv

要約大規模な言語モデルの出現により、人工知能における2つの明確で相互接続された … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Value-Free Policy Optimization via Reward Partitioning

投稿日: 2025年6月17日作成者: jarxiv

要約単一の操作補強学習（RL）メソッドは、スカラーリワードが直接利用可能な（プ … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning

投稿日: 2025年6月17日作成者: jarxiv

要約時系列の推論は、動的な時間的パターン、曖昧なセマンティクス、および時間的前 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Contrastive Self-Supervised Learning As Neural Manifold Packing

投稿日: 2025年6月17日作成者: jarxiv

要約ポイントごとの比較に基づいた対照的な自己監視学習は、ビジョンタスクのために … 続きを読む →

カテゴリー: cs.AI, cs.LG, q-bio.NC, stat.ML | コメントを受け付けていません

Weakest Link in the Chain: Security Vulnerabilities in Advanced Reasoning Models

投稿日: 2025年6月17日作成者: jarxiv

要約高度な推論機能の導入により、特に数学とコーディングベンチマークでの大規模な … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.LG | コメントを受け付けていません

Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMs

投稿日: 2025年6月17日作成者: jarxiv

要約大規模な言語モデル（LLM）は、多くの現代的なAIアプリケーションの中心で … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Prefix-Tuning+: Modernizing Prefix-Tuning through Attention Independent Prefix Data

Meta-learning how to Share Credit among Macro-Actions

Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems

FoMoH: A clinically meaningful foundation model evaluation for structured electronic health records

Distinguishing Autonomous AI Agents from Collaborative Agentic Systems: A Comprehensive Framework for Understanding Modern Intelligent Architectures

Value-Free Policy Optimization via Reward Partitioning

TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning

Contrastive Self-Supervised Learning As Neural Manifold Packing

Weakest Link in the Chain: Security Vulnerabilities in Advanced Reasoning Models

Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMs

最近の投稿

最近のコメント

アーカイブ

カテゴリー