「cs.AI」カテゴリーアーカイブ

IDInit: A Universal and Stable Initialization Method for Neural Network Training

投稿日: 2025年3月7日作成者: jarxiv

要約深いニューラルネットワークは、実際に顕著な成果を達成しています。これらの … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Mark Your LLM: Detecting the Misuse of Open-Source Large Language Models via Watermarking

投稿日: 2025年3月7日作成者: jarxiv

要約 LLAMA3のようなオープンソースの大手言語モデル（LLM）がより能力が高 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG | コメントを受け付けていません

Tutorial on amortized optimization

投稿日: 2025年3月7日作成者: jarxiv

要約最適化はユビキタスモデリングツールであり、同じ問題の同様のインスタンスを繰 … 続きを読む →

カテゴリー: cs.AI, cs.LG, math.OC | コメントを受け付けていません

Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment

投稿日: 2025年3月7日作成者: jarxiv

要約直接選好最適化（DPO）は、大規模な言語モデル（LLM）を人間の好みに合わ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement

投稿日: 2025年3月7日作成者: jarxiv

要約人間を支援する具体化されたエージェントは、多くの場合、新しいタスクを完了す … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.RO | コメントを受け付けていません

Multi-Agent Inverse Q-Learning from Demonstrations

投稿日: 2025年3月7日作成者: jarxiv

要約報酬機能が手指定されている場合、深い強化学習アルゴリズムは、多くの場合、報 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.MA, cs.RO | コメントを受け付けていません

Matrix Factorization for Inferring Associations and Missing Links

投稿日: 2025年3月7日作成者: jarxiv

要約ミスリンク予測は、知識グラフの推奨システム、生物学、社会科学、サイバーセキ … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.LO | コメントを受け付けていません

HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly

投稿日: 2025年3月7日作成者: jarxiv

要約長いコンテキスト言語モデル（LCLMS）を評価するために多くのベンチマーク … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

投稿日: 2025年3月7日作成者: jarxiv

要約 Reasoning Languageモデルは、「Thinking long … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Do Not Trust Licenses You See — Dataset Compliance Requires Massive-Scale AI-Powered Lifecycle Tracing

投稿日: 2025年3月7日作成者: jarxiv

要約このペーパーでは、データセットの法的リスクは、ライセンス条件だけでは正確に … 続きを読む →

カテゴリー: cs.AI, cs.CY | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

IDInit: A Universal and Stable Initialization Method for Neural Network Training

Mark Your LLM: Detecting the Misuse of Open-Source Large Language Models via Watermarking

Tutorial on amortized optimization

Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment

AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement

Multi-Agent Inverse Q-Learning from Demonstrations

Matrix Factorization for Inferring Associations and Missing Links

HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

Do Not Trust Licenses You See — Dataset Compliance Requires Massive-Scale AI-Powered Lifecycle Tracing

最近の投稿

最近のコメント

アーカイブ

カテゴリー