投稿者「jarxiv」のアーカイブ

ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering

投稿日: 2025年5月30日作成者: jarxiv

要約大規模な言語モデル（LLM）ベースのエージェントの出現により、自律機械学習 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA

投稿日: 2025年5月30日作成者: jarxiv

要約パラメーター効率の高い微調整（PEFT）メソッド、特に低ランク適応（LOR … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time

投稿日: 2025年5月30日作成者: jarxiv

要約大規模な言語モデルを人間に合わせることは、優先フィードバックの本質的に多面 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Keep Everyone Happy: Online Fair Division of Numerous Items with Few Copies

投稿日: 2025年5月30日作成者: jarxiv

要約このペーパーでは、学習者が公平性と効率の制約を満たしながら、エージェントの … 続きを読む →

カテゴリー: cs.AI, cs.LG, stat.ML | コメントを受け付けていません

Exposing the Impact of GenAI for Cybercrime: An Investigation into the Dark Side

投稿日: 2025年5月30日作成者: jarxiv

要約近年、生成的AIモデルの急速な進歩と民主化は、特にサイバーセキュリティの文 … 続きを読む →

カテゴリー: cs.AI, cs.CY, cs.HC | コメントを受け付けていません

ATLAS: Learning to Optimally Memorize the Context at Test Time

投稿日: 2025年5月30日作成者: jarxiv

要約主にコンテキスト内検索タスクでの有効性と大規模な学習能力により、トランスは … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Comparative of Genetic Fuzzy regression techniques for aeroacoustic phenomenons

投稿日: 2025年5月30日作成者: jarxiv

要約この研究では、航空障害の重要な問題を抱える航空act、自動車、ドローンアプ … 続きを読む →

カテゴリー: cs.AI, cs.NE | コメントを受け付けていません

PhyX: Does Your Model Have the ‘Wits’ for Physical Reasoning?

投稿日: 2025年5月30日作成者: jarxiv

要約既存のベンチマークは、インテリジェンスの重要な側面をキャプチャできません。 … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

投稿日: 2025年5月30日作成者: jarxiv

要約定理証明は、大規模な言語モデル（LLM）の複雑な推論能力を評価するための主 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Differential Information: An Information-Theoretic Perspective on Preference Optimization

投稿日: 2025年5月30日作成者: jarxiv

要約直接選好最適化（DPO）は、監督された方法で人間の好みを整合するための標準 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering

SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA

Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time

Keep Everyone Happy: Online Fair Division of Numerous Items with Few Copies

Exposing the Impact of GenAI for Cybercrime: An Investigation into the Dark Side

ATLAS: Learning to Optimally Memorize the Context at Test Time

Comparative of Genetic Fuzzy regression techniques for aeroacoustic phenomenons

PhyX: Does Your Model Have the ‘Wits’ for Physical Reasoning?

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

Differential Information: An Information-Theoretic Perspective on Preference Optimization

最近の投稿

最近のコメント

アーカイブ

カテゴリー