月別アーカイブ: 2024年1月

Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning

投稿日: 2024年1月15日作成者: jarxiv

要約この論文では、コンテキスト内学習 (ICL) をメタ最適化プロセスとして扱 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

An investigation of structures responsible for gender bias in BERT and DistilBERT

投稿日: 2024年1月15日作成者: jarxiv

要約近年、大規模な Transformer ベースの事前トレーニング済み言語モ … 続きを読む →

カテゴリー: cs.CL, cs.CY, cs.LG | コメントを受け付けていません

AntEval: Quantitatively Evaluating Informativeness and Expressiveness of Agent Social Interactions

投稿日: 2024年1月15日作成者: jarxiv

要約大規模言語モデル (LLM) ベースのエージェントは、さまざまなシナリオで … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection

投稿日: 2024年1月15日作成者: jarxiv

要約ヘイトスピーチは、蔓延する有害なオンライン言説の一種であり、多くの場合、憎 … 続きを読む →

カテゴリー: cs.CL, cs.SI | コメントを受け付けていません

Improving Language Plasticity via Pretraining with Active Forgetting

投稿日: 2024年1月15日作成者: jarxiv

要約事前トレーニング済み言語モデル (PLM) は、現在、自然言語処理の主要な … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.NE | コメントを受け付けていません

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

投稿日: 2024年1月15日作成者: jarxiv

要約大規模言語モデル (LLM) は、さまざまな自然言語処理タスクにおいて優れ … 続きを読む →

カテゴリー: cs.CL, cs.IR | コメントを受け付けていません

Assessing the Importance of Frequency versus Compositionality for Subword-based Tokenization in NMT

投稿日: 2024年1月15日作成者: jarxiv

要約サブワードのトークン化は、ニューラル言語モデルおよび機械翻訳システムにおけ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Intention Analysis Prompting Makes Large Language Models A Good Jailbreak Defender

投稿日: 2024年1月15日作成者: jarxiv

要約大規模言語モデル (LLM) を人間の価値観に合わせるのは、特にステルスで … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Multistage Collaborative Knowledge Distillation from Large Language Models for Semi-Supervised Sequence Generation

投稿日: 2024年1月15日作成者: jarxiv

要約私たちは、ラベル付きデータが不足しすぎてモデルを効果的に微調整できないと同 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Mergen: The First Manchu-Korean Machine Translation Model Trained on Augmented Data

投稿日: 2024年1月15日作成者: jarxiv

要約満州語は、中国東北部の歴史的な満州地域にルーツを持つ言語ですが、話者がほと … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

月別アーカイブ: 2024年1月

Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning

An investigation of structures responsible for gender bias in BERT and DistilBERT

AntEval: Quantitatively Evaluating Informativeness and Expressiveness of Agent Social Interactions

MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection

Improving Language Plasticity via Pretraining with Active Forgetting

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

Assessing the Importance of Frequency versus Compositionality for Subword-based Tokenization in NMT

Intention Analysis Prompting Makes Large Language Models A Good Jailbreak Defender

Multistage Collaborative Knowledge Distillation from Large Language Models for Semi-Supervised Sequence Generation

Mergen: The First Manchu-Korean Machine Translation Model Trained on Augmented Data

最近の投稿

最近のコメント

アーカイブ

カテゴリー