「cs.CL」カテゴリーアーカイブ

DualKanbaFormer: Kolmogorov-Arnold Networks and State Space Model Transformer for Multimodal Aspect-based Sentiment Analysis

投稿日: 2024年9月2日作成者: jarxiv

要約マルチモーダルアスペクトベースセンチメント分析 (MABSA) は、テキ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Exploring Group and Symmetry Principles in Large Language Models

投稿日: 2024年9月2日作成者: jarxiv

要約大規模言語モデル (LLM) は、幅広いアプリケーションにわたって優れたパ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models

投稿日: 2024年9月2日作成者: jarxiv

要約歴史的な印刷メディアアーカイブのデジタル化は、現代の記録へのアクセスを増や … 続きを読む →

カテゴリー: cs.CL, cs.DL | コメントを受け付けていません

SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists

投稿日: 2024年9月2日作成者: jarxiv

要約 NLP における従来のベンチマークでは、通常、静的に保持されたテストセッ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning

投稿日: 2024年9月2日作成者: jarxiv

要約言語モデルの継続学習 (CL) は、再トレーニングせずに大規模言語モデル … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Jailbreak Attacks and Defenses Against Large Language Models: A Survey

投稿日: 2024年9月2日作成者: jarxiv

要約大規模言語モデル (LLM) は、質問応答、翻訳、コード補完などを含むさま … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG | コメントを受け付けていません

Flexible and Effective Mixing of Large Language Models into a Mixture of Domain Experts

投稿日: 2024年9月2日作成者: jarxiv

要約トレーニング済みモデルから低コストの混合ドメイン専門家 (MOE) を作成 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer

投稿日: 2024年9月2日作成者: jarxiv

要約 Mixture of Experts (MoE) は、大幅な追加の計算コス … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Bridging Domain Knowledge and Process Discovery Using Large Language Models

投稿日: 2024年9月2日作成者: jarxiv

要約適切なプロセスモデルを発見することは、適合性チェックやプロセス改善などの … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Modularity in Transformers: Investigating Neuron Separability & Specialization

投稿日: 2024年9月2日作成者: jarxiv

要約変圧器モデルはさまざまなアプリケーションでますます普及していますが、その内 … 続きを読む →

カテゴリー: (Primary), 68T05, cs.AI, cs.CL, cs.LG, I.2.4 | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

DualKanbaFormer: Kolmogorov-Arnold Networks and State Space Model Transformer for Multimodal Aspect-based Sentiment Analysis

Exploring Group and Symmetry Principles in Large Language Models

CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models

SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists

TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning

Jailbreak Attacks and Defenses Against Large Language Models: A Survey

Flexible and Effective Mixing of Large Language Models into a Mixture of Domain Experts

Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer

Bridging Domain Knowledge and Process Discovery Using Large Language Models

Modularity in Transformers: Investigating Neuron Separability & Specialization

最近の投稿

最近のコメント

アーカイブ

カテゴリー