月別アーカイブ: 2024年5月

Understanding and Minimising Outlier Features in Neural Network Training

投稿日: 2024年5月30日作成者: jarxiv

要約外れ値特徴 (OF) は、その活性化の大きさがニューラルネットワーク ( … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Causal Inference from Slowly Varying Nonstationary Processes

投稿日: 2024年5月30日作成者: jarxiv

要約制限構造因果モデル (SCM) フレームワークに従った観察データからの因果 … 続きを読む →

カテゴリー: cs.LG, stat.ML | コメントを受け付けていません

DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints

投稿日: 2024年5月30日作成者: jarxiv

要約大規模言語モデル (LLM) の最近の進歩により、LLM は不可欠なものと … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG | コメントを受け付けていません

From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models

投稿日: 2024年5月30日作成者: jarxiv

要約これまで、言語モデルにおける有害性の軽減は、ほぼ完全に単一言語設定に焦点を … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Open-Source LLMs for Text Annotation: A Practical Guide for Model Setting and Fine-Tuning

投稿日: 2024年5月30日作成者: jarxiv

要約この論文では、政治科学研究に典型的なテキスト分類タスクにおけるオープンソー … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

BLSP-KD: Bootstrapping Language-Speech Pre-training via Knowledge Distillation

投稿日: 2024年5月30日作成者: jarxiv

要約最近のエンドツーエンドのアプローチは、大規模言語モデル (LLM) を音声 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Building Guardrails for Large Language Models

投稿日: 2024年5月30日作成者: jarxiv

要約大規模言語モデル (LLM) が私たちの日常生活にさらに統合されるにつれて … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models

投稿日: 2024年5月30日作成者: jarxiv

要約大規模言語モデル (LLM) の使用がさらに普及するにつれて、生成された応 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and Design

投稿日: 2024年5月30日作成者: jarxiv

要約 Cephalo は、材料科学アプリケーション向けに設計された一連のマルチモ … 続きを読む →

カテゴリー: cond-mat.mes-hall, cond-mat.mtrl-sci, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Beyond the Limits: A Survey of Techniques to Extend the Context Length in Large Language Models

投稿日: 2024年5月30日作成者: jarxiv

要約最近、大規模言語モデル (LLM) は、コンテキストの理解、論理的推論の実 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年5月

Understanding and Minimising Outlier Features in Neural Network Training

Causal Inference from Slowly Varying Nonstationary Processes

DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints

From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models

Open-Source LLMs for Text Annotation: A Practical Guide for Model Setting and Fine-Tuning

BLSP-KD: Bootstrapping Language-Speech Pre-training via Knowledge Distillation

Building Guardrails for Large Language Models

Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models

Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and Design

Beyond the Limits: A Survey of Techniques to Extend the Context Length in Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー