月別アーカイブ: 2024年9月

Legilimens: Practical and Unified Content Moderation for Large Language Model Services

投稿日: 2024年9月6日作成者: jarxiv

要約大規模言語モデル (LLM) によって生成された安全でないコンテンツが社会 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Towards Evaluating and Building Versatile Large Language Models for Medicine

投稿日: 2024年9月6日作成者: jarxiv

要約この研究では、臨床状況における大規模言語モデル (LLM) のパフォーマン … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization

投稿日: 2024年9月6日作成者: jarxiv

要約人間のフィードバックからの強化学習 (RLHF) は、言語モデルを人間の好 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

LLM-based multi-agent poetry generation in non-cooperative environments

投稿日: 2024年9月6日作成者: jarxiv

要約自動詩生成のための大規模言語モデル (LLM) は大幅に進歩しているにもか … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

The representation landscape of few-shot learning and fine-tuning in large language models

投稿日: 2024年9月6日作成者: jarxiv

要約インコンテキスト学習 (ICL) と教師あり微調整 (SFT) は、特定の … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Positioning Political Texts with Large Language Models by Asking and Averaging

投稿日: 2024年9月6日作成者: jarxiv

要約 GPT-4、Llama 3、MiXtral、Aya などの命令調整されたラ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Exploring Group and Symmetry Principles in Large Language Models

投稿日: 2024年9月6日作成者: jarxiv

要約大規模言語モデル (LLM) は、幅広いアプリケーションにわたって優れたパ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Cost-Efficient Subjective Task Annotation and Modeling through Few-Shot Annotator Adaptation

投稿日: 2024年9月6日作成者: jarxiv

要約主観的な NLP タスクでは、単一のグラウンドトゥルースが存在しないため … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

A Fused Large Language Model for Predicting Startup Success

投稿日: 2024年9月6日作成者: jarxiv

要約投資家はスタートアップへの収益性の高い投資機会を継続的に求めているため、効 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

投稿日: 2024年9月6日作成者: jarxiv

要約現在の最強の言語モデルの事前トレーニングデータは不透明です。特に、さま … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年9月

Legilimens: Practical and Unified Content Moderation for Large Language Model Services

Towards Evaluating and Building Versatile Large Language Models for Medicine

On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization

LLM-based multi-agent poetry generation in non-cooperative environments

The representation landscape of few-shot learning and fine-tuning in large language models

Positioning Political Texts with Large Language Models by Asking and Averaging

Exploring Group and Symmetry Principles in Large Language Models

Cost-Efficient Subjective Task Annotation and Modeling through Few-Shot Annotator Adaptation

A Fused Large Language Model for Predicting Startup Success

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

最近の投稿

最近のコメント

アーカイブ

カテゴリー