「cs.CL」カテゴリーアーカイブ

S2-Attention: Hardware-Aware Context Sharding Among Attention Heads

投稿日: 2025年1月13日作成者: jarxiv

要約コンテキスト内のトークンのサブセットに選択的に注意を向ける、まばらな注意が … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Long Story Short: Story-level Video Understanding from 20K Short Films

投稿日: 2025年1月13日作成者: jarxiv

要約視覚言語モデルの最近の開発により、ビデオの理解が大幅に進歩しました。ただ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

VideoRAG: Retrieval-Augmented Generation over Video Corpus

投稿日: 2025年1月13日作成者: jarxiv

要約検索拡張生成 (RAG) は、クエリに関連する外部知識を取得し、それを生成 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.IR, cs.LG | コメントを受け付けていません

Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs

投稿日: 2025年1月13日作成者: jarxiv

要約教育においては、大規模言語モデル (LLM) の人間に似たテキストを生成す … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Effective faking of verbal deception detection with target-aligned adversarial attacks

投稿日: 2025年1月13日作成者: jarxiv

要約背景: 言語の分析による欺瞞の検出は、人間の判断と自動化された機械学習の判 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Addressing speaker gender bias in large scale speech translation systems

投稿日: 2025年1月13日作成者: jarxiv

要約この研究は、攻撃的で不正確な翻訳につながる可能性がある、音声翻訳 (ST) … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Are We Done with MMLU?

投稿日: 2025年1月13日作成者: jarxiv

要約たぶんそうではありません。人気のある Massive Multitask … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

How to Tune a Multilingual Encoder Model for Germanic Languages: A Study of PEFT, Full Fine-Tuning, and Language Adapters

投稿日: 2025年1月13日作成者: jarxiv

要約この論文では、mDeBERTas の事前トレーニングデータにおけるさまざ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Benchmarking Rotary Position Embeddings for Automatic Speech Recognition

投稿日: 2025年1月13日作成者: jarxiv

要約 Rotary Position Embedding (RoPE) は、シー … 続きを読む →

カテゴリー: cs.AI, cs.CL, eess.AS | コメントを受け付けていません

Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding

投稿日: 2025年1月13日作成者: jarxiv

要約最近の多言語自動音声認識モデルは数千の言語をサポートすると主張していますが … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

S2-Attention: Hardware-Aware Context Sharding Among Attention Heads

Long Story Short: Story-level Video Understanding from 20K Short Films

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs

Effective faking of verbal deception detection with target-aligned adversarial attacks

Addressing speaker gender bias in large scale speech translation systems

Are We Done with MMLU?

How to Tune a Multilingual Encoder Model for Germanic Languages: A Study of PEFT, Full Fine-Tuning, and Language Adapters

Benchmarking Rotary Position Embeddings for Automatic Speech Recognition

Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding

最近の投稿

最近のコメント

アーカイブ

カテゴリー