月別アーカイブ: 2025年2月

BioMaze: Benchmarking and Enhancing Large Language Models for Biological Pathway Reasoning

投稿日: 2025年2月28日作成者: jarxiv

要約さまざまな生物学的領域における大規模な言語モデル（LLM）の応用が最近検討 … 続きを読む →

カテゴリー: cs.AI, cs.LG, q-bio.QM | コメントを受け付けていません

LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction

投稿日: 2025年2月28日作成者: jarxiv

要約数学的推論は、幻覚のために大規模な言語モデル（LLMS）にとって重要な課題 … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

An exploration of features to improve the generalisability of fake news detection models

投稿日: 2025年2月28日作成者: jarxiv

要約偽のニュースは、選挙に影響を与え、誤った情報を広め、検出を重要にすることに … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants

投稿日: 2025年2月28日作成者: jarxiv

要約最近の進歩により、AI、特に大規模な言語モデル（LLMS）が科学的研究のた … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Building reliable sim driving agents by scaling self-play

投稿日: 2025年2月28日作成者: jarxiv

要約シミュレーションエージェントは、自律車両（AVS）などの人間と相互作用する … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

LangProBe: a Language Programs Benchmark

投稿日: 2025年2月28日作成者: jarxiv

要約言語モデル（LMS）をマルチステップ言語プログラムに作成し、モジュラープロ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR, cs.LG | コメントを受け付けていません

Mixture of Structural-and-Textual Retrieval over Text-rich Graph Knowledge Bases

投稿日: 2025年2月28日作成者: jarxiv

要約テキストが豊富なグラフ知識ベース（TG-KBS）は、テキストおよび構造 … 続きを読む →

カテゴリー: cs.AI, cs.IR, cs.LG | コメントを受け付けていません

Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

投稿日: 2025年2月28日作成者: jarxiv

要約私たちは、次の迅速な指定ルールから大規模な言語モデル（LLMS）を破壊する … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG | コメントを受け付けていません

Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application

投稿日: 2025年2月28日作成者: jarxiv

要約このペーパーでは、グローバルナビゲーション衛星システム（GNSS）デニード … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

投稿日: 2025年2月28日作成者: jarxiv

要約最近の多くの研究では、大規模な言語モデルにおける緊急の推論能力の証拠が発見 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

月別アーカイブ: 2025年2月

BioMaze: Benchmarking and Enhancing Large Language Models for Biological Pathway Reasoning

LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction

An exploration of features to improve the generalisability of fake news detection models

EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants

Building reliable sim driving agents by scaling self-play

LangProBe: a Language Programs Benchmark

Mixture of Structural-and-Textual Retrieval over Text-rich Graph Knowledge Bases

Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application

Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー