月別アーカイブ: 2025年4月

CliniChat: A Multi-Source Knowledge-Driven Framework for Clinical Interview Dialogue Reconstruction and Evaluation

投稿日: 2025年4月15日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、流fluentなインタラクティブな能力と … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA

投稿日: 2025年4月15日作成者: jarxiv

要約チェックボックスは、ダニの有無がデータの抽出と意思決定プロセスを直接通知す … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods

投稿日: 2025年4月15日作成者: jarxiv

要約大規模な言語モデル（LLM）は、人間でさえ、テキストが別の人間によって生成 … 続きを読む →

カテゴリー: cs.CL, cs.CY | コメントを受け付けていません

MDIT: A Model-free Data Interpolation Method for Diverse Instruction Tuning

投稿日: 2025年4月15日作成者: jarxiv

要約大規模な言語モデル（LLM）がさまざまなタスクにますます適用されるため、モ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

SuperBPE: Space Travel for Language Models

投稿日: 2025年4月15日作成者: jarxiv

要約ほぼすべての言語モデル（LM）トークン化スキームにわたる仮定は、トークンが … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

投稿日: 2025年4月15日作成者: jarxiv

要約 OpenaiによるO1モデルのリリースにより、ゆっくりと思考戦略を採用する … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

The Future of MLLM Prompting is Adaptive: A Comprehensive Experimental Evaluation of Prompt Engineering Methods for Robust Multimodal Performance

投稿日: 2025年4月15日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLM）は、マシンがテキスト、画像、コード … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.ET | コメントを受け付けていません

Command A: An Enterprise-Ready Large Language Model

投稿日: 2025年4月15日作成者: jarxiv

要約このレポートでは、Command Aの開発について説明します。コマンドAは … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

LLM Unlearning Reveals a Stronger-Than-Expected Coreset Effect in Current Benchmarks

投稿日: 2025年4月15日作成者: jarxiv

要約大規模な言語モデルの未解決は、一般的なユーティリティを維持しながら、未抑制 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Deep Reasoning Translation via Reinforcement Learning

投稿日: 2025年4月15日作成者: jarxiv

要約最近、深い推論LLM（例：Openai O1/O3およびDeepSeek- … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

月別アーカイブ: 2025年4月

CliniChat: A Multi-Source Knowledge-Driven Framework for Clinical Interview Dialogue Reconstruction and Evaluation

Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA

Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods

MDIT: A Model-free Data Interpolation Method for Diverse Instruction Tuning

SuperBPE: Space Travel for Language Models

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

The Future of MLLM Prompting is Adaptive: A Comprehensive Experimental Evaluation of Prompt Engineering Methods for Robust Multimodal Performance

Command A: An Enterprise-Ready Large Language Model

LLM Unlearning Reveals a Stronger-Than-Expected Coreset Effect in Current Benchmarks

Deep Reasoning Translation via Reinforcement Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー