月別アーカイブ: 2024年2月

Beyond prompt brittleness: Evaluating the reliability and consistency of political worldviews in LLMs

投稿日: 2024年2月28日作成者: jarxiv

要約ユビキタスシステムでは大規模言語モデル (LLM) が広く使用されている … 続きを読む →

カテゴリー: cs.CL, cs.CY | コメントを受け付けていません

Anatomy of Neural Language Models

投稿日: 2024年2月28日作成者: jarxiv

要約生成 AI と転移学習の分野は、近年、特に自然言語処理 (NLP) の分野 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Looking Right is Sometimes Right: Investigating the Capabilities of Decoder-only LLMs for Sequence Labeling

投稿日: 2024年2月28日作成者: jarxiv

要約マスク言語モデリング (MLM) に基づく事前トレーニング済み言語モデルは … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

NextLevelBERT: Investigating Masked Language Modeling with Higher-Level Representations for Long Documents

投稿日: 2024年2月28日作成者: jarxiv

要約（大規模な）言語モデルはここ数年で大幅に改善されましたが、基礎となる注意メ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Advancing Translation Preference Modeling with RLHF: A Step Towards Cost-Effective Solution

投稿日: 2024年2月28日作成者: jarxiv

要約機械翻訳では、忠実さ、表現力、優雅さが常に追求されています。ただし、 \ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations

投稿日: 2024年2月28日作成者: jarxiv

要約個々のニューロンは、複数の高レベルの概念の表現に参加します。さまざまな解 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

AmbigNLG: Addressing Task Ambiguity in Instruction for NLG

投稿日: 2024年2月28日作成者: jarxiv

要約この研究では、自然言語生成 (NLG) タスクの命令におけるタスクの曖昧さ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

投稿日: 2024年2月28日作成者: jarxiv

要約汎用大規模言語モデル (LLM) は、翻訳分野内の複数のタスクに習熟してい … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Towards Optimal Learning of Language Models

投稿日: 2024年2月28日作成者: jarxiv

要約この研究では、優れたパフォーマンスを達成するために必要なトレーニング手順を … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Massive Activations in Large Language Models

投稿日: 2024年2月28日作成者: jarxiv

要約大規模言語モデル (LLM) では経験的な現象が観察されています。ごく少数 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年2月

Beyond prompt brittleness: Evaluating the reliability and consistency of political worldviews in LLMs

Anatomy of Neural Language Models

Looking Right is Sometimes Right: Investigating the Capabilities of Decoder-only LLMs for Sequence Labeling

NextLevelBERT: Investigating Masked Language Modeling with Higher-Level Representations for Long Documents

Advancing Translation Preference Modeling with RLHF: A Step Towards Cost-Effective Solution

RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations

AmbigNLG: Addressing Task Ambiguity in Instruction for NLG

Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

Towards Optimal Learning of Language Models

Massive Activations in Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー