投稿者「jarxiv」のアーカイブ

None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks

投稿日: 2025年5月13日作成者: jarxiv

要約 LLMの評価では、数学指向の質問に数値のバリエーションを実行することにより … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Comparative sentiment analysis of public perception: Monkeypox vs. COVID-19 behavioral insights

投稿日: 2025年5月13日作成者: jarxiv

要約 Covid-19やMonkeypox（MPox）などの世界的な健康危機の出 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?

投稿日: 2025年5月13日作成者: jarxiv

要約大規模な言語モデル（LLMS）の顕著な成果は、多くの人が知性の形を示すと結 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Matching Tasks with Industry Groups for Augmenting Commonsense Knowledge

投稿日: 2025年5月13日作成者: jarxiv

要約常識的な知識ベース（KB）は、機械学習アプリケーションを改善するために広く … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems

投稿日: 2025年5月13日作成者: jarxiv

要約人工知能（AI）システムは、自律車両から生体認証まで、ますます重要なアプリ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.CV | コメントを受け付けていません

A Survey on Collaborative Mechanisms Between Large and Small Language Models

投稿日: 2025年5月13日作成者: jarxiv

要約大規模な言語モデル（LLM）は強力なAI機能を提供しますが、リソースコスト … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Translating the Grievance Dictionary: a psychometric evaluation of Dutch, German, and Italian versions

投稿日: 2025年5月13日作成者: jarxiv

要約このペーパーでは、暴力的、脅迫的、または苦情処理されたテキストの分析のため … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution

投稿日: 2025年5月13日作成者: jarxiv

要約大規模な言語モデル（LLM）のツール使用機能により、最新の外部情報にアクセ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective

投稿日: 2025年5月13日作成者: jarxiv

要約大規模な言語モデル（LLM）は、数学的推論で顕著な進歩を遂げていますが、多 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

SEReDeEP: Hallucination Detection in Retrieval-Augmented Models via Semantic Entropy and Context-Parameter Fusion

投稿日: 2025年5月13日作成者: jarxiv

要約検索された生成（RAG）モデルは、外部情報を内部のパラメトリック知識と統合 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks

Comparative sentiment analysis of public perception: Monkeypox vs. COVID-19 behavioral insights

I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?

Matching Tasks with Industry Groups for Augmenting Commonsense Knowledge

Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems

A Survey on Collaborative Mechanisms Between Large and Small Language Models

Translating the Grievance Dictionary: a psychometric evaluation of Dutch, German, and Italian versions

ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution

Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective

SEReDeEP: Hallucination Detection in Retrieval-Augmented Models via Semantic Entropy and Context-Parameter Fusion

最近の投稿

最近のコメント

アーカイブ

カテゴリー