月別アーカイブ: 2024年3月

HealMe: Harnessing Cognitive Reframing in Large Language Models for Psychotherapy

投稿日: 2024年3月25日作成者: jarxiv

要約大規模言語モデル (LLM) は、認知的リフレーミングという重要なタスクを … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC, J.4 | コメントを受け付けていません

Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning

投稿日: 2024年3月25日作成者: jarxiv

要約イベント因果関係特定 (ECI) は、テキスト内のイベント間の因果関係の検 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

ESG Classification by Implicit Rule Learning via GPT-4

投稿日: 2024年3月25日作成者: jarxiv

要約環境、社会、ガバナンス (ESG) 要因は、より高い投資収益率の指標として … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

投稿日: 2024年3月25日作成者: jarxiv

要約事前トレーニングされた大規模言語モデル (LLM) は、現在、自然言語処理 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition

投稿日: 2024年3月25日作成者: jarxiv

要約自己注意は大規模言語モデル (LLM) の重要なコンポーネントですが、長い … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation

投稿日: 2024年3月25日作成者: jarxiv

要約スケーラブルなディープモデルと大規模なデータセットの出現により、ニューラ … 続きを読む →

カテゴリー: 68T50, cs.AI, cs.CL, I.2.7 | コメントを受け付けていません

E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity

投稿日: 2024年3月25日作成者: jarxiv

要約従来の枝刈り手法は、費用がかからないトレーニングプロセスと大量の計算要求 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Construction of a Japanese Financial Benchmark for Large Language Models

投稿日: 2024年3月25日作成者: jarxiv

要約最近の大規模言語モデル (LLM) の開発に伴い、特定のドメインと言語に焦 … 続きを読む →

カテゴリー: cs.CL, q-fin.CP | コメントを受け付けていません

Self-Guard: Empower the LLM to Safeguard Itself

投稿日: 2024年3月25日作成者: jarxiv

要約ジェイルブレイク攻撃は、Large Language Model (LLM … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

CHisIEC: An Information Extraction Corpus for Ancient Chinese History

投稿日: 2024年3月25日作成者: jarxiv

要約自然言語処理 (NLP) は、デジタルヒューマニティー (DH) の分野 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

月別アーカイブ: 2024年3月

HealMe: Harnessing Cognitive Reframing in Large Language Models for Psychotherapy

Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning

ESG Classification by Implicit Rule Learning via GPT-4

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition

Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation

E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity

Construction of a Japanese Financial Benchmark for Large Language Models

Self-Guard: Empower the LLM to Safeguard Itself

CHisIEC: An Information Extraction Corpus for Ancient Chinese History

最近の投稿

最近のコメント

アーカイブ

カテゴリー