投稿者「jarxiv」のアーカイブ

Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning

投稿日: 2025年5月26日作成者: jarxiv

要約さまざまな推論タスクにおける大規模な言語モデル（LLMS）の顕著な能力にも … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Handling Symbolic Language in Student Texts: A Comparative Study of NLP Embedding Models

投稿日: 2025年5月26日作成者: jarxiv

要約自然言語加工（NLP）の最近の進歩により、特にNLP埋め込みモデルの使用に … 続きを読む →

カテゴリー: cs.AI, cs.CL, physics.ed-ph | コメントを受け付けていません

Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL

投稿日: 2025年5月26日作成者: jarxiv

要約複雑なタスクのパフォーマンスを改善し、特に臨床応用のために、大規模な言語モ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Counting Cycles with Deepseek

投稿日: 2025年5月26日作成者: jarxiv

要約最近の進歩にもかかわらず、AIはまだ高度な数学に苦労しています。困難なオ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Are Large Language Models Reliable AI Scientists? Assessing Reverse-Engineering of Black-Box Systems

投稿日: 2025年5月26日作成者: jarxiv

要約 AIを使用して自律的な研究者を作成することは、科学的発見を加速する可能性が … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection

投稿日: 2025年5月26日作成者: jarxiv

要約テキストの異常検出は、自然言語処理タスクにおけるスパム、誤った情報、および … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

AVerImaTeC: A Dataset for Automatic Verification of Image-Text Claims with Evidence from the Web

投稿日: 2025年5月26日作成者: jarxiv

要約テキストの主張には、多くの場合、その信頼性を高め、ソーシャルメディアでの広 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Retrieval-Augmented Fine-Tuning With Preference Optimization For Visual Program Generation

投稿日: 2025年5月26日作成者: jarxiv

要約ビジュアルプログラミング言語（VPL）により、ユーザーはグラフィカルインタ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

The AI Gap: How Socioeconomic Status Affects Language Technology Interactions

投稿日: 2025年5月26日作成者: jarxiv

要約社会経済的地位（SES）は、大規模な言語モデル（LLM）のようなデジタルテ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective

投稿日: 2025年5月26日作成者: jarxiv

要約 VAPOフレームワークは、大規模な言語モデル（LLM）を使用した長いチェー … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning

Handling Symbolic Language in Student Texts: A Comparative Study of NLP Embedding Models

Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL

Counting Cycles with Deepseek

Are Large Language Models Reliable AI Scientists? Assessing Reverse-Engineering of Black-Box Systems

TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection

AVerImaTeC: A Dataset for Automatic Verification of Image-Text Claims with Evidence from the Web

Retrieval-Augmented Fine-Tuning With Preference Optimization For Visual Program Generation

The AI Gap: How Socioeconomic Status Affects Language Technology Interactions

Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective

最近の投稿

最近のコメント

アーカイブ

カテゴリー