「cs.CL」カテゴリーアーカイブ

Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering

投稿日: 2025年3月20日作成者: jarxiv

要約最近、強化学習（RL）は、大規模な言語モデル（LLM）の推論能力を大幅に強 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Probabilities of Chat LLMs Are Miscalibrated but Still Predict Correctness on Multiple-Choice Q&A

投稿日: 2025年3月20日作成者: jarxiv

要約チャット用に微調整された15の大手言語モデル（LLM）を研究し、最大のソフ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning

投稿日: 2025年3月20日作成者: jarxiv

要約自然言語加工（NLP）は、事前に訓練されたタンパク質言語モデル（PLMS） … 続きを読む →

カテゴリー: cs.AI, cs.CL, q-bio.QM | コメントを受け付けていません

From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment

投稿日: 2025年3月20日作成者: jarxiv

要約大規模な言語モデル（LLM）は、ユーザーの価値とニーズの多様性を根本的に見 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Dynamic Bi-Elman Attention Networks (DBEAN): Dual-Directional Context-Aware Representation Learning for Enhanced Text Classification

投稿日: 2025年3月20日作成者: jarxiv

要約自然言語処理（NLP）の基本的なタスクであるテキスト分類は、テキストデータ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

What Makes a Reward Model a Good Teacher? An Optimization Perspective

投稿日: 2025年3月20日作成者: jarxiv

要約人間のフィードバック（RLHF）からの強化学習の成功は、報酬モデルの品質に … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, stat.ML | コメントを受け付けていません

Value Profiles for Encoding Human Variation

投稿日: 2025年3月20日作成者: jarxiv

要約評価タスクにおける人間の変動のモデリングは、パーソナライズ、多元的モデルア … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC, cs.LG | コメントを受け付けていません

SemEval-2025 Task 1: AdMIRe — Advancing Multimodal Idiomaticity Representation

投稿日: 2025年3月20日作成者: jarxiv

要約慣用的な表現は、NLPにユニークな課題を提示します。その意味は、構成要素の … 続きを読む →

カテゴリー: cs.CL, cs.CV, I.2.7 | コメントを受け付けていません

Safety at Scale: A Comprehensive Survey of Large Model Safety

投稿日: 2025年3月20日作成者: jarxiv

要約大規模な事前トレーニングによる学習と一般化における並外れた能力によって推進 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.CV | コメントを受け付けていません

TULIP: Towards Unified Language-Image Pretraining

投稿日: 2025年3月20日作成者: jarxiv

要約 ClipやSiglipなどの画像テキストコントラストモデルの最近の成功にも … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering

Probabilities of Chat LLMs Are Miscalibrated but Still Predict Correctness on Multiple-Choice Q&A

VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning

From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment

Dynamic Bi-Elman Attention Networks (DBEAN): Dual-Directional Context-Aware Representation Learning for Enhanced Text Classification

What Makes a Reward Model a Good Teacher? An Optimization Perspective

Value Profiles for Encoding Human Variation

SemEval-2025 Task 1: AdMIRe — Advancing Multimodal Idiomaticity Representation

Safety at Scale: A Comprehensive Survey of Large Model Safety

TULIP: Towards Unified Language-Image Pretraining

最近の投稿

最近のコメント

アーカイブ

カテゴリー