月別アーカイブ: 2024年6月

Extracting thin film structures of energy materials using transformers

投稿日: 2024年6月25日作成者: jarxiv

要約中性子反射率測定データ解析には、変圧器アーキテクチャを使用したニューラル … 続きを読む →

カテゴリー: cs.AI, physics.comp-ph | コメントを受け付けていません

Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking

投稿日: 2024年6月25日作成者: jarxiv

要約発散的思考、つまり多様な解決策を生み出す認知プロセスは、人間の創造性と問題 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Bandits with Preference Feedback: A Stackelberg Game Perspective

投稿日: 2024年6月25日作成者: jarxiv

要約好みのフィードバックを備えたバンディットは、直接値のクエリではなくペアごと … 続きを読む →

カテゴリー: cs.AI, cs.GT, cs.LG, stat.ML | コメントを受け付けていません

Feature learning as alignment: a structural property of gradient descent in non-linear neural networks

投稿日: 2024年6月25日作成者: jarxiv

要約ニューラルネットワークが特徴学習を通じて入力ラベルのペアから統計を抽出す … 続きを読む →

カテゴリー: cs.AI, cs.LG, stat.ML | コメントを受け付けていません

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

投稿日: 2024年6月25日作成者: jarxiv

要約基礎モデルの開発には、急速に拡大する貢献者、科学者、アプリケーションが集ま … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Addressing Polarization and Unfairness in Performative Prediction

投稿日: 2024年6月25日作成者: jarxiv

要約機械学習 (ML) モデルが人間が関与するアプリケーション (オンラインで … 続きを読む →

カテゴリー: cs.AI, cs.CY, cs.LG | コメントを受け付けていません

Who Plays First? Optimizing the Order of Play in Stackelberg Games with Many Robots

投稿日: 2024年6月25日作成者: jarxiv

要約我々は、社会的に最適なプレイの順序、つまり、エージェントが自分の決定にコミ … 続きを読む →

カテゴリー: cs.AI, cs.RO, cs.SY, eess.SY, math.OC | コメントを受け付けていません

Pandora’s White-Box: Precise Training Data Detection and Extraction in Large Language Models

投稿日: 2024年6月25日作成者: jarxiv

要約この論文では、大規模言語モデル (LLM) に対する最先端のプライバシー攻 … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.LG | コメントを受け付けていません

WARP: On the Benefits of Weight Averaged Rewarded Policies

投稿日: 2024年6月25日作成者: jarxiv

要約ヒューマンフィードバックからの強化学習 (RLHF) は、人間の好みに基 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?

投稿日: 2024年6月25日作成者: jarxiv

要約このレポートでは、次のような質問を投げかけます。OlympicArena … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

月別アーカイブ: 2024年6月

Extracting thin film structures of energy materials using transformers

Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking

Bandits with Preference Feedback: A Stackelberg Game Perspective

Feature learning as alignment: a structural property of gradient descent in non-linear neural networks

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

Addressing Polarization and Unfairness in Performative Prediction

Who Plays First? Optimizing the Order of Play in Stackelberg Games with Many Robots

Pandora’s White-Box: Precise Training Data Detection and Extraction in Large Language Models

WARP: On the Benefits of Weight Averaged Rewarded Policies

OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?

最近の投稿

最近のコメント

アーカイブ

カテゴリー