「cs.CL」カテゴリーアーカイブ

Task Memory Engine (TME): A Structured Memory Framework with Graph-Aware Extensions for Multi-Step LLM Agent Tasks

投稿日: 2025年4月15日作成者: jarxiv

要約大規模な言語モデル（LLM）は、マルチステップタスクの自律エージェントとし … 続きを読む →

カテゴリー: 68T05, cs.AI, cs.CL, H.3.3 | コメントを受け付けていません

Large language models could be rote learners

投稿日: 2025年4月15日作成者: jarxiv

要約複数選択の質問（MCQ）ベンチマークは、大規模な言語モデル（LLM）の評価 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Out of Style: RAG’s Fragility to Linguistic Variation

投稿日: 2025年4月14日作成者: jarxiv

要約さまざまなNLPベンチマークにわたる検索された生成（RAG）システムの印象 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Humanity’s Last Exam

投稿日: 2025年4月14日作成者: jarxiv

要約ベンチマークは、大規模な言語モデル（LLM）機能の急速な進歩を追跡するため … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

IFShip: Interpretable Fine-grained Ship Classification with Domain Knowledge-Enhanced Vision-Language Models

投稿日: 2025年4月14日作成者: jarxiv

要約エンドツーエンドの解釈は、現在、リモートセンシングの細粒船分類（RS-FG … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Millions of States: Designing a Scalable MoE Architecture with RWKV-7 Meta-learner

投稿日: 2025年4月14日作成者: jarxiv

要約 RWKV-7のような状態ベースのシーケンスモデルは、変圧器アーキテクチャの … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula

投稿日: 2025年4月14日作成者: jarxiv

要約数学の講義や研究プレゼンテーションなどのさまざまな学術的および専門的な設定 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents

投稿日: 2025年4月14日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLM）は大きな進歩を示しており、具体化さ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Evaluating the Bias in LLMs for Surveying Opinion and Decision Making in Healthcare

投稿日: 2025年4月14日作成者: jarxiv

要約生成エージェントは、大規模な言語モデル（LLM）によって駆動される、シリコ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

VLMT: Vision-Language Multimodal Transformer for Multimodal Multi-hop Question Answering

投稿日: 2025年4月14日作成者: jarxiv

要約テキスト、テーブル、画像間のマルチモーダルデータの可用性の向上は、複雑なク … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Task Memory Engine (TME): A Structured Memory Framework with Graph-Aware Extensions for Multi-Step LLM Agent Tasks

Large language models could be rote learners

Out of Style: RAG’s Fragility to Linguistic Variation

Humanity’s Last Exam

IFShip: Interpretable Fine-grained Ship Classification with Domain Knowledge-Enhanced Vision-Language Models

Millions of States: Designing a Scalable MoE Architecture with RWKV-7 Meta-learner

MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents

Evaluating the Bias in LLMs for Surveying Opinion and Decision Making in Healthcare

VLMT: Vision-Language Multimodal Transformer for Multimodal Multi-hop Question Answering

最近の投稿

最近のコメント

アーカイブ

カテゴリー