「cs.CL」カテゴリーアーカイブ

ReSpark: Leveraging Previous Data Reports as References to Generate New Reports with LLMs

投稿日: 2025年2月5日作成者: jarxiv

要約データレポートの作成は、データの探索と理解を繰り返し、その後に洞察を要約す … 続きを読む →

カテゴリー: cs.CL, cs.HC | コメントを受け付けていません

AlphaSharpe: LLM-Driven Discovery of Robust Risk-Adjusted Metrics

投稿日: 2025年2月5日作成者: jarxiv

要約シャープレシオのような財務指標は、リスクとリターンのバランスを取ることによ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.NE, q-fin.PM, q-fin.RM | コメントを受け付けていません

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

投稿日: 2025年2月5日作成者: jarxiv

要約マルチモーダル大規模言語モデル（MLLM）は印象的な能力を示すが、複雑な視 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs

投稿日: 2025年2月5日作成者: jarxiv

要約 Chain-of-Thought(CoT)プロンプトは、詳細なステップバイ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

STAIR: Improving Safety Alignment with Introspective Reasoning

投稿日: 2025年2月5日作成者: jarxiv

要約大規模言語モデル(LLM)の安全性と無害性を保証することは、アプリケーショ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Avoiding spurious sharpness minimization broadens applicability of SAM

投稿日: 2025年2月5日作成者: jarxiv

要約 Sharpness Aware Minimization (SAM)のよう … 続きを読む →

カテゴリー: cs.CL, cs.LG, stat.ML | コメントを受け付けていません

Plan*RAG: Efficient Test-Time Planning for Retrieval Augmented Generation

投稿日: 2025年2月5日作成者: jarxiv

要約本論文では、Plan*RAGを紹介する。Plan*RAGは、テスト時間の推 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

MILU: A Multi-task Indic Language Understanding Benchmark

投稿日: 2025年2月5日作成者: jarxiv

要約低リソースで言語的に多様な言語における大規模言語モデル（LLM）の評価は、 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Is poisoning a real threat to LLM alignment? Maybe more so than you think

投稿日: 2025年2月5日作成者: jarxiv

要約人間のフィードバックを伴う強化学習(RLHF)の最近の進歩は、大規模言語モ … 続きを読む →

カテゴリー: cs.CL, cs.CR, cs.LG | コメントを受け付けていません

SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters

投稿日: 2025年2月5日作成者: jarxiv

要約言語モデルのアライメントのための既存のプリファレンス最適化目標では、最適な … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

ReSpark: Leveraging Previous Data Reports as References to Generate New Reports with LLMs

AlphaSharpe: LLM-Driven Discovery of Robust Risk-Adjusted Metrics

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs

STAIR: Improving Safety Alignment with Introspective Reasoning

Avoiding spurious sharpness minimization broadens applicability of SAM

Plan*RAG: Efficient Test-Time Planning for Retrieval Augmented Generation

MILU: A Multi-task Indic Language Understanding Benchmark

Is poisoning a real threat to LLM alignment? Maybe more so than you think

SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters

最近の投稿

最近のコメント

アーカイブ

カテゴリー