「cs.CL」カテゴリーアーカイブ

MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained Classification

投稿日: 2024年6月27日作成者: jarxiv

要約大規模マルチモーダルモデル (LMM) におけるマルチモーダル数学推論の … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

投稿日: 2024年6月27日作成者: jarxiv

要約人間の判断ではなく、LLM が生成した判断を使用して NLP モデルを評価 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models

投稿日: 2024年6月27日作成者: jarxiv

要約大規模言語モデル (LLM) は、特にテキストの数学的問題解決において、優 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

投稿日: 2024年6月27日作成者: jarxiv

要約 WildGuard を紹介します。これは、(1) ユーザープロンプト内の … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Is In-Context Learning a Type of Gradient-Based Learning? Evidence from the Inverse Frequency Effect in Structural Priming

投稿日: 2024年6月27日作成者: jarxiv

要約大規模言語モデル (LLM) は、コンテキスト内学習 (ICL) の新たな … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

BASS: Batched Attention-optimized Speculative Sampling

投稿日: 2024年6月27日作成者: jarxiv

要約投機的デコードは、大規模な言語モデルをホストする際の待ち時間とスループット … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

投稿日: 2024年6月27日作成者: jarxiv

要約 WildTeaming は、自動 LLM 安全レッドチームフレームワーク … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

‘Is ChatGPT a Better Explainer than My Professor?’: Evaluating the Explanation Capabilities of LLMs in Conversation Compared to a Human Baseline

投稿日: 2024年6月27日作成者: jarxiv

要約説明は知識共有の基礎を形成し、コミュニケーション原則、社会力学、学習理論に … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation

投稿日: 2024年6月27日作成者: jarxiv

要約大規模言語モデル (LLM) は、NLP の分野に革命をもたらしました。 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

On the Impact of Voice Anonymization on Speech Diagnostic Applications: a Case Study on COVID-19 Detection

投稿日: 2024年6月27日作成者: jarxiv

要約深層学習の進歩に伴い、パーソナルアシスタント、感情コンピューティング、遠 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained Classification

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Is In-Context Learning a Type of Gradient-Based Learning? Evidence from the Inverse Frequency Effect in Structural Priming

BASS: Batched Attention-optimized Speculative Sampling

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

‘Is ChatGPT a Better Explainer than My Professor?’: Evaluating the Explanation Capabilities of LLMs in Conversation Compared to a Human Baseline

PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation

On the Impact of Voice Anonymization on Speech Diagnostic Applications: a Case Study on COVID-19 Detection

最近の投稿

最近のコメント

アーカイブ

カテゴリー