「cs.CL」カテゴリーアーカイブ

ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning

投稿日: 2025年5月16日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、複雑な評価ベンチマークで印象的な機能を実 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, I.2.7 | コメントを受け付けていません

Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models

投稿日: 2025年5月16日作成者: jarxiv

要約大規模な言語モデルは静的ベンチマークで印象的なパフォーマンスを示しています … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation

投稿日: 2025年5月16日作成者: jarxiv

要約視覚的なストーリーテリングシステムは、フレーム全体でキャラクターのアイデン … 続きを読む →

カテゴリー: cs.CL, cs.CV, I.2.10 | コメントを受け付けていません

Pose Priors from Language Models

投稿日: 2025年5月16日作成者: jarxiv

要約言語は物理的な相互作用を説明するためによく使用されますが、ほとんどの3D人 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Multi-Token Prediction Needs Registers

投稿日: 2025年5月16日作成者: jarxiv

要約マルチトークンの予測は、言語モデルの事前トレーニングを改善するための有望な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

MASSV: Multimodal Adaptation and Self-Data Distillation for Speculative Decoding of Vision-Language Models

投稿日: 2025年5月16日作成者: jarxiv

要約投機的デコードは、軽量のドラフトモデルが複数のターゲットモデルが同時に検証 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

投稿日: 2025年5月16日作成者: jarxiv

要約大規模なマルチモーダルモデルのトレーニングに広く使用されている自然言語画像 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Behind Maya: Building a Multilingual Vision Language Model

投稿日: 2025年5月16日作成者: jarxiv

要約最近では、大規模なビジョン言語モデル（VLM）の急速な発展が見られました。 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

FAMMA: A Benchmark for Financial Domain Multilingual Multimodal Question Answering

投稿日: 2025年5月16日作成者: jarxiv

要約この論文では、\ underline {a} ncial \ underl … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Construction and Application of Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language Model

投稿日: 2025年5月16日作成者: jarxiv

要約材料科学の知識は、広範な科学文献全体に広く分散されており、新しい材料の効率 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning

Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models

StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation

Pose Priors from Language Models

Multi-Token Prediction Needs Registers

MASSV: Multimodal Adaptation and Self-Data Distillation for Speculative Decoding of Vision-Language Models

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Behind Maya: Building a Multilingual Vision Language Model

FAMMA: A Benchmark for Financial Domain Multilingual Multimodal Question Answering

Construction and Application of Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language Model

最近の投稿

最近のコメント

アーカイブ

カテゴリー