「cs.CL」カテゴリーアーカイブ

Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity

投稿日: 2024年12月24日作成者: jarxiv

要約最近、Linformer や Mamba などのアーキテクチャが、トランス … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

CityBench: Evaluating the Capabilities of Large Language Models for Urban Tasks

投稿日: 2024年12月24日作成者: jarxiv

要約最近、広範な一般知識と強力な推論能力を備えた大規模言語モデル (LLM) … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

LiveIdeaBench: Evaluating LLMs’ Scientific Creativity and Idea Generation with Minimal Context

投稿日: 2024年12月24日作成者: jarxiv

要約大規模言語モデル (LLM) は科学的タスクにおいて顕著な能力を実証してき … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

FocusLLM: Precise Understanding of Long Context by Dynamic Condensing

投稿日: 2024年12月24日作成者: jarxiv

要約 LLM に長いコンテキストを正確に理解できるようにすることは、多くの下流ア … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Chumor 2.0: Towards Benchmarking Chinese Humor Understanding

投稿日: 2024年12月24日作成者: jarxiv

要約既存のユーモアのデータセットと評価は主に英語に焦点を当てており、中国語など … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving

投稿日: 2024年12月24日作成者: jarxiv

要約数学的問題を解決するには高度な推論能力が必要であり、大規模な言語モデルには … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization

投稿日: 2024年12月24日作成者: jarxiv

要約 Rotary Position Embedding (RoPE) を改善す … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

RepoTransBench: A Real-World Benchmark for Repository-Level Code Translation

投稿日: 2024年12月24日作成者: jarxiv

要約リポジトリレベルのコード変換とは、ソースリポジトリの機能を維持しながら … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SE | コメントを受け付けていません

Quantifying Positional Biases in Text Embedding Models

投稿日: 2024年12月24日作成者: jarxiv

要約埋め込みモデルは、情報検索 (IR) や意味的類似性の測定のタスクにとって … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

Deliberation in Latent Space via Differentiable Cache Augmentation

投稿日: 2024年12月24日作成者: jarxiv

要約中間の推論ステップを生成して処理することで大規模言語モデル (LLM) が … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません