月別アーカイブ: 2025年5月

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

投稿日: 2025年5月7日作成者: jarxiv

要約 LLMベースのエージェントは、複雑なコードベース内でコードを生成および管理 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

HAIR: Hardness-Aware Inverse Reinforcement Learning with Introspective Reasoning for LLM Alignment

投稿日: 2025年5月7日作成者: jarxiv

要約大規模な言語モデル（LLMS）と人間の価値の調整は、重要なものであるが、4 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

STORY2GAME: Generating (Almost) Everything in an Interactive Fiction Game

投稿日: 2025年5月7日作成者: jarxiv

要約 Story2Gameを紹介します。これは、大規模な言語モデルを使用して、ス … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning

投稿日: 2025年5月7日作成者: jarxiv

要約大規模な言語モデル（LLM）からの一貫性のない出力と幻覚は、信頼できるAI … 続きを読む →

カテゴリー: cs.AI, cs.DC | コメントを受け付けていません

Co-NavGPT: Multi-Robot Cooperative Visual Semantic Navigation Using Vision Language Models

投稿日: 2025年5月7日作成者: jarxiv

要約視覚ターゲットナビゲーションは、未知の環境、特に人間とロボットの相互作用シ … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Rapid AI-based generation of coverage paths for dispensing applications

投稿日: 2025年5月7日作成者: jarxiv

要約カバレッジパスサーマルインターフェイス材料（TIM）の計画は、電子電子機器 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Ergodic Generative Flows

投稿日: 2025年5月7日作成者: jarxiv

要約生成フローネットワーク（GFN）は、正規化されていない分布密度からサンプリ … 続きを読む →

カテゴリー: 37A25, 68Q87, 68T07, 68T99, 68W20, cs.AI, cs.LG, math.DG, math.DS | コメントを受け付けていません

The Adaptive Arms Race: Redefining Robustness in AI Security

投稿日: 2025年5月7日作成者: jarxiv

要約それらを堅牢にするためのかなりの努力にもかかわらず、現実世界のAIベースの … 続きを読む →

カテゴリー: cs.AI, cs.CR | コメントを受け付けていません

OSUniverse: Benchmark for Multimodal GUI-navigation AI Agents

投稿日: 2025年5月7日作成者: jarxiv

要約このホワイトペーパーでは、Osuniverseを紹介します。これは、使いや … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

LlamaFirewall: An open source guardrail system for building secure AI agents

投稿日: 2025年5月7日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、シンプルなチャットボットから、本番コード … 続きを読む →

カテゴリー: cs.AI, cs.CR | コメントを受け付けていません

月別アーカイブ: 2025年5月

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

HAIR: Hardness-Aware Inverse Reinforcement Learning with Introspective Reasoning for LLM Alignment

STORY2GAME: Generating (Almost) Everything in an Interactive Fiction Game

A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning

Co-NavGPT: Multi-Robot Cooperative Visual Semantic Navigation Using Vision Language Models

Rapid AI-based generation of coverage paths for dispensing applications

Ergodic Generative Flows

The Adaptive Arms Race: Redefining Robustness in AI Security

OSUniverse: Benchmark for Multimodal GUI-navigation AI Agents

LlamaFirewall: An open source guardrail system for building secure AI agents

最近の投稿

最近のコメント

アーカイブ

カテゴリー