月別アーカイブ: 2025年2月

Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems

投稿日: 2025年2月3日作成者: jarxiv

要約談話と認知科学の理論は、ゆったりとしたペーシングの価値を長い間認識してきま … 続きを読む →

カテゴリー: cs.CL, cs.HC | コメントを受け付けていません

SafetyAnalyst: Interpretable, transparent, and steerable safety moderation for AI behavior

投稿日: 2025年2月3日作成者: jarxiv

要約理想的なAIの安全性節度システムは、構造的に解釈可能であり（そのため、その … 続きを読む →

カテゴリー: cs.CL, cs.CY | コメントを受け付けていません

GPT-4o as the Gold Standard: A Scalable and General Purpose Approach to Filter Language Model Pretraining Data

投稿日: 2025年2月3日作成者: jarxiv

要約大規模な言語モデルには膨大な量の高品質のトレーニングデータが必要ですが、W … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions

投稿日: 2025年2月3日作成者: jarxiv

要約この作業では、オーディオとテキストを大規模な言語モデル（LLM）に統合する … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

TableMaster: A Recipe to Advance Table Understanding with Language Models

投稿日: 2025年2月3日作成者: jarxiv

要約テーブルは、構造化されたリレーショナルデータを表すための基本形式として機能 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Diverse Preference Optimization

投稿日: 2025年2月3日作成者: jarxiv

要約補強学習、好みの最適化、または監視された微調整のいずれかを通じて、言語モデ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Strassen Attention: Unlocking Compositional Abilities in Transformers Based on a New Lower Bound Method

投稿日: 2025年2月3日作成者: jarxiv

要約変圧器の理論的な制限を評価するための新しい方法を提案し、無限の精度で1層ソ … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models

投稿日: 2025年2月3日作成者: jarxiv

要約私たちは、大規模な言語モデル（LLMS）の特徴普遍性を調査します。これは、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation

投稿日: 2025年2月3日作成者: jarxiv

要約ゼロショットクロスドメインの順次推奨（ZCDSR）は、追加のトレーニングや … 続きを読む →

カテゴリー: cs.AI, cs.IR | コメントを受け付けていません

Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence

投稿日: 2025年2月3日作成者: jarxiv

要約リスクに敏感な強化学習（RL）は、ハイステークスアプリケーションで信頼でき … 続きを読む →

カテゴリー: cs.AI, cs.LG, math.OC | コメントを受け付けていません

月別アーカイブ: 2025年2月

Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems

SafetyAnalyst: Interpretable, transparent, and steerable safety moderation for AI behavior

GPT-4o as the Gold Standard: A Scalable and General Purpose Approach to Filter Language Model Pretraining Data

SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions

TableMaster: A Recipe to Advance Table Understanding with Language Models

Diverse Preference Optimization

Strassen Attention: Unlocking Compositional Abilities in Transformers Based on a New Lower Bound Method

Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models

A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation

Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence

最近の投稿

最近のコメント

アーカイブ

カテゴリー