月別アーカイブ: 2025年5月

Language Models Optimized to Fool Detectors Still Have a Distinct Style (And How to Change It)

投稿日: 2025年5月21日作成者: jarxiv

要約マシンテキスト検出器の開発においてかなりの進歩にもかかわらず、問題は本質的 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

SATBench: Benchmarking LLMs’ Logical Reasoning via Automated Puzzle Generation from SAT Formulas

投稿日: 2025年5月21日作成者: jarxiv

要約 Satbenchを紹介します。これは、ブールの満足度（SAT）の問題から派 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.LO | コメントを受け付けていません

TiEBe: Tracking Language Model Recall of Notable Worldwide Events Through Time

投稿日: 2025年5月21日作成者: jarxiv

要約知識の状況が進化し、大規模な言語モデル（LLM）がますます広くなるにつれて … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

投稿日: 2025年5月21日作成者: jarxiv

要約 Rehnection Learning（RL）は、報酬信号でポリシーを最適 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Debating for Better Reasoning: An Unsupervised Multimodal Approach

投稿日: 2025年5月21日作成者: jarxiv

要約大規模な言語モデル（LLM）が多様なドメインとモダリティにわたって専門知識 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Diffusion-Based Failure Sampling for Evaluating Safety-Critical Autonomous Systems

投稿日: 2025年5月21日作成者: jarxiv

要約ロボット工学などの高次元ドメインでの安全性クリティカルな自律システムの検証 … 続きを読む →

カテゴリー: cs.AI, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas

投稿日: 2025年5月21日作成者: jarxiv

要約 AIのリスクの検出は、より強力なモデルが出現し、これらの検出試行を回避する … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.HC, cs.LG | コメントを受け付けていません

Improving Medium Range Severe Weather Prediction through Transformer Post-processing of AI Weather Forecasts

投稿日: 2025年5月21日作成者: jarxiv

要約中距離（1〜8日）の厳しい気象予測のスキルを向上させることは、社会的影響を … 続きを読む →

カテゴリー: cs.AI, cs.LG, physics.ao-ph | コメントを受け付けていません

Cost-Augmented Monte Carlo Tree Search for LLM-Assisted Planning

投稿日: 2025年5月21日作成者: jarxiv

要約 LLMSは自由回答形式の推論に優れていますが、彼らはしばしばコストに敏感な … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Explainable AI for Securing Healthcare in IoT-Integrated 6G Wireless Networks

投稿日: 2025年5月21日作成者: jarxiv

要約ヘルスケアシステムは、高度なワイヤレスネットワークと接続されたデバイスをま … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年5月

Language Models Optimized to Fool Detectors Still Have a Distinct Style (And How to Change It)

SATBench: Benchmarking LLMs’ Logical Reasoning via Automated Puzzle Generation from SAT Formulas

TiEBe: Tracking Language Model Recall of Notable Worldwide Events Through Time

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Debating for Better Reasoning: An Unsupervised Multimodal Approach

Diffusion-Based Failure Sampling for Evaluating Safety-Critical Autonomous Systems

Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas

Improving Medium Range Severe Weather Prediction through Transformer Post-processing of AI Weather Forecasts

Cost-Augmented Monte Carlo Tree Search for LLM-Assisted Planning

Explainable AI for Securing Healthcare in IoT-Integrated 6G Wireless Networks

最近の投稿

最近のコメント

アーカイブ

カテゴリー