「cs.CY」カテゴリーアーカイブ

Plurals: A System for Guiding LLMs Via Simulated Social Ensembles

投稿日: 2024年11月4日作成者: jarxiv

要約最近の議論では、言語モデルが特定の視点を好むのではないかという懸念が提起さ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.HC, cs.MA | コメントを受け付けていません

ProgressGym: Alignment with a Millennium of Moral Progress

投稿日: 2024年11月1日作成者: jarxiv

要約大規模言語モデル (LLM) を含むフロンティア AI システムは、人間の … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.HC, cs.LG | コメントを受け付けていません

Representative Social Choice: From Learning Theory to AI Alignment

投稿日: 2024年11月1日作成者: jarxiv

要約社会選択理論は、集団全体にわたる選好の集約に関する研究であり、人間エージェ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.GT, cs.LG | コメントを受け付けていません

Revealing Fine-Grained Values and Opinions in Large Language Models

投稿日: 2024年11月1日作成者: jarxiv

要約大規模言語モデル (LLM) に埋め込まれた潜在的な価値観や意見を明らかに … 続きを読む →

カテゴリー: cs.CL, cs.CY, cs.LG | コメントを受け付けていません

Hidden Persuaders: LLMs’ Political Leaning and Their Influence on Voters

投稿日: 2024年11月1日作成者: jarxiv

要約 LLM は私たちの民主主義にどのような影響を与えるのでしょうか? 私たちは … 続きを読む →

カテゴリー: cs.CL, cs.CY | コメントを受け付けていません

Implicit Personalization in Language Models: A Systematic Study

投稿日: 2024年11月1日作成者: jarxiv

要約暗黙的なパーソナライゼーション (IP) は、言語モデルが入力プロンプト内 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.HC, cs.LG | コメントを受け付けていません

The Good, the Bad, and the Ugly: The Role of AI Quality Disclosure in Lie Detection

投稿日: 2024年10月31日作成者: jarxiv

要約私たちは、質の高い情報開示を欠いた低品質の AI アドバイザーが、人々が嘘 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.HC, cs.LG | コメントを受け付けていません

Aequitas Flow: Streamlining Fair ML Experimentation

投稿日: 2024年10月31日作成者: jarxiv

要約 Aequitas Flow は、エンドツーエンドの公平な機械学習 (ML) … 続きを読む →

カテゴリー: cs.AI, cs.CY, cs.LG | コメントを受け付けていません

Instigating Cooperation among LLM Agents Using Adaptive Information Modulation

投稿日: 2024年10月31日作成者: jarxiv

要約この論文では、人間の戦略的行動の代理として LLM エージェントを強化学習 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.GT | コメントを受け付けていません

Breach By A Thousand Leaks: Unsafe Information Leakage in `Safe’ AI Responses

投稿日: 2024年10月31日作成者: jarxiv

要約 Frontier 言語モデルには誤用やジェイルブレイクに対する脆弱性がある … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.CY | コメントを受け付けていません

「cs.CY」カテゴリーアーカイブ

Plurals: A System for Guiding LLMs Via Simulated Social Ensembles

ProgressGym: Alignment with a Millennium of Moral Progress

Representative Social Choice: From Learning Theory to AI Alignment

Revealing Fine-Grained Values and Opinions in Large Language Models

Hidden Persuaders: LLMs’ Political Leaning and Their Influence on Voters

Implicit Personalization in Language Models: A Systematic Study

The Good, the Bad, and the Ugly: The Role of AI Quality Disclosure in Lie Detection

Aequitas Flow: Streamlining Fair ML Experimentation

Instigating Cooperation among LLM Agents Using Adaptive Information Modulation

Breach By A Thousand Leaks: Unsafe Information Leakage in `Safe’ AI Responses

最近の投稿

最近のコメント

アーカイブ

カテゴリー