「cs.AI」カテゴリーアーカイブ

Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality

投稿日: 2024年10月8日作成者: jarxiv

要約 Fr\’echet Video Distance (FVD) … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

SimO Loss: Anchor-Free Contrastive Loss for Fine-Grained Supervised Contrastive Learning

投稿日: 2024年10月8日作成者: jarxiv

要約私たちが提案する類似性直交性 (SimO) 損失を活用した、新しいアンカー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

TextHawk2: A Large Vision-Language Model Excels in Bilingual OCR and Grounding with 16x Fewer Tokens

投稿日: 2024年10月8日作成者: jarxiv

要約密なテキストを読んだり、画像内のオブジェクトを見つけたりすることは、高度な … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks

投稿日: 2024年10月8日作成者: jarxiv

要約好みの最適化 (PO) は、言語モデルを人間の望ましい行動に合わせるための … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Decoding Intelligence: A Framework for Certifying Knowledge Comprehension in LLMs

投稿日: 2024年10月8日作成者: jarxiv

要約知識理解能力は人間の知性の重要な側面です。大規模言語モデル (LLM) … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together

投稿日: 2024年10月8日作成者: jarxiv

要約自然言語処理 (NLP) システムは、検索拡張生成 (RAG) などの洗練 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation

投稿日: 2024年10月8日作成者: jarxiv

要約ストリーミング音声翻訳 (ST) のモデルは、ソース言語の音声とターゲット … 続きを読む →

カテゴリー: cs.AI, cs.CL, eess.AS | コメントを受け付けていません

VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks

投稿日: 2024年10月8日作成者: jarxiv

要約埋め込みモデルは、意味的類似性、情報検索、クラスタリングなどのさまざまな下 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Better Instruction-Following Through Minimum Bayes Risk

投稿日: 2024年10月8日作成者: jarxiv

要約人間レベルの評価が可能な汎用 LLM ジャッジは、指示に従う LLM を評 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

When ‘A Helpful Assistant’ Is Not Really Helpful: Personas in System Prompts Do Not Improve Performances of Large Language Models

投稿日: 2024年10月8日作成者: jarxiv

要約プロンプトは、人間が大規模言語モデル (LLM) と対話する主要な方法とし … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.HC, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality

SimO Loss: Anchor-Free Contrastive Loss for Fine-Grained Supervised Contrastive Learning

TextHawk2: A Large Vision-Language Model Excels in Bilingual OCR and Grounding with 16x Fewer Tokens

SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks

Decoding Intelligence: A Framework for Certifying Knowledge Comprehension in LLMs

Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together

CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation

VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks

Better Instruction-Following Through Minimum Bayes Risk

When ‘A Helpful Assistant’ Is Not Really Helpful: Personas in System Prompts Do Not Improve Performances of Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー