「cs.CL」カテゴリーアーカイブ

Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models

投稿日: 2024年6月3日作成者: jarxiv

要約大規模言語モデル (LLM) の優れた機能に関する最近の声明は、通常、オー … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG, cs.SE | コメントを受け付けていません

Code Pretraining Improves Entity Tracking Abilities of Language Models

投稿日: 2024年6月3日作成者: jarxiv

要約最近の研究では、コード上で言語モデルを事前トレーニングすると、自然言語で表 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Enhancing Vision Models for Text-Heavy Content Understanding and Interaction

投稿日: 2024年6月3日作成者: jarxiv

要約複数の画像を含むテキストの多いビジュアルコンテンツを操作して理解すること … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Calibrated Self-Rewarding Vision Language Models

投稿日: 2024年6月3日作成者: jarxiv

要約大規模ビジョン言語モデル (LVLM) は、事前トレーニングされた大規模言 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet

投稿日: 2024年6月3日作成者: jarxiv

要約線形注意メカニズムは、線形計算の複雑さと速度の向上により、因果言語モデルで … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering

投稿日: 2024年6月3日作成者: jarxiv

要約 Visual Question Answering (VQA) には、視覚 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights

投稿日: 2024年6月3日作成者: jarxiv

要約 Web スケールのビジョン言語データセット間には、当然ながら深刻なデータの … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

投稿日: 2024年6月3日作成者: jarxiv

要約汎用人工知能の探求において、マルチモーダル大規模言語モデル (MLLM) … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Iterative Feature Boosting for Explainable Speech Emotion Recognition

投稿日: 2024年6月3日作成者: jarxiv

要約音声感情認識 (SER) では、実際の重要性を考慮せずに事前定義された特徴 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS, I.2.1 | コメントを受け付けていません

Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation

投稿日: 2024年5月31日作成者: jarxiv

要約 Zero-Shot Object Navigation (ZSON) を使 … 続きを読む →

カテゴリー: cs.CL, cs.HC, cs.RO | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models

Code Pretraining Improves Entity Tracking Abilities of Language Models

Enhancing Vision Models for Text-Heavy Content Understanding and Interaction

Calibrated Self-Rewarding Vision Language Models

You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet

II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering

Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Iterative Feature Boosting for Explainable Speech Emotion Recognition

Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation

最近の投稿

最近のコメント

アーカイブ

カテゴリー