月別アーカイブ: 2025年2月

STAIR: Improving Safety Alignment with Introspective Reasoning

投稿日: 2025年2月5日作成者: jarxiv

要約大規模言語モデル(LLM)の安全性と無害性を保証することは、アプリケーショ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Avoiding spurious sharpness minimization broadens applicability of SAM

投稿日: 2025年2月5日作成者: jarxiv

要約 Sharpness Aware Minimization (SAM)のよう … 続きを読む →

カテゴリー: cs.CL, cs.LG, stat.ML | コメントを受け付けていません

Plan*RAG: Efficient Test-Time Planning for Retrieval Augmented Generation

投稿日: 2025年2月5日作成者: jarxiv

要約本論文では、Plan*RAGを紹介する。Plan*RAGは、テスト時間の推 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

MILU: A Multi-task Indic Language Understanding Benchmark

投稿日: 2025年2月5日作成者: jarxiv

要約低リソースで言語的に多様な言語における大規模言語モデル（LLM）の評価は、 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Is poisoning a real threat to LLM alignment? Maybe more so than you think

投稿日: 2025年2月5日作成者: jarxiv

要約人間のフィードバックを伴う強化学習(RLHF)の最近の進歩は、大規模言語モ … 続きを読む →

カテゴリー: cs.CL, cs.CR, cs.LG | コメントを受け付けていません

SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters

投稿日: 2025年2月5日作成者: jarxiv

要約言語モデルのアライメントのための既存のプリファレンス最適化目標では、最適な … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Beemo: Benchmark of Expert-edited Machine-generated Outputs

投稿日: 2025年2月5日作成者: jarxiv

要約大規模言語モデル（LLM）の急速な普及により、機械生成テキスト（MGT）の … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study

投稿日: 2025年2月5日作成者: jarxiv

要約本研究では、非英語コーパスにおける道徳的基盤（MF）を測定するための計算論 … 続きを読む →

カテゴリー: cs.CL, cs.SI | コメントを受け付けていません

Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation

投稿日: 2025年2月5日作成者: jarxiv

要約検索、再順位付け、および検索拡張生成（RAG）は、情報検索、質問応答、およ … 続きを読む →

カテゴリー: cs.CL, cs.IR | コメントを受け付けていません

Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study

投稿日: 2025年2月5日作成者: jarxiv

要約大規模言語モデル(LLM)は継続的に多言語能力を向上させており、小規模なオ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

月別アーカイブ: 2025年2月

STAIR: Improving Safety Alignment with Introspective Reasoning

Avoiding spurious sharpness minimization broadens applicability of SAM

Plan*RAG: Efficient Test-Time Planning for Retrieval Augmented Generation

MILU: A Multi-task Indic Language Understanding Benchmark

Is poisoning a real threat to LLM alignment? Maybe more so than you think

SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters

Beemo: Benchmark of Expert-edited Machine-generated Outputs

Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study

Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation

Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study

最近の投稿

最近のコメント

アーカイブ

カテゴリー