「cs.CL」カテゴリーアーカイブ

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

投稿日: 2025年2月7日作成者: jarxiv

要約最近のコーダーモデルのほとんどの進歩は、監視された微調整（SFT）によって … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SE | コメントを受け付けていません

Great Models Think Alike and this Undermines AI Oversight

投稿日: 2025年2月7日作成者: jarxiv

要約言語モデル（LM）機能が進歩するにつれて、それらを大規模に評価および監督す … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters

投稿日: 2025年2月7日作成者: jarxiv

要約大規模な言語モデル（LLMS）の最近の進歩により、多様なタスク全体で顕著な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions

投稿日: 2025年2月7日作成者: jarxiv

要約広範な安全整合の取り組みにもかかわらず、大規模な言語モデル（LLM）は、有 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.LG | コメントを受け付けていません

MuJo: Multimodal Joint Feature Space Learning for Human Activity Recognition

投稿日: 2025年2月7日作成者: jarxiv

要約人間の活動認識（HAR）は、ヘルスケア、スポーツ、フィットネス、セキュリテ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

DEFAME: Dynamic Evidence-based FAct-checking with Multimodal Experts

投稿日: 2025年2月7日作成者: jarxiv

要約偽情報の拡散は、信頼性が高くスケーラブルな事実確認ソリューションを必要とし … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Evaluating Numerical Reasoning in Text-to-Image Models

投稿日: 2025年2月7日作成者: jarxiv

要約テキストから画像への生成モデルは、自然言語を使用して記述されている概念を忠 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

投稿日: 2025年2月7日作成者: jarxiv

要約特にGPT-4Oに続く大規模な言語モデルの最近の進歩により、より多くのモダ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.MM, cs.SD, eess.AS, eess.IV | コメントを受け付けていません

UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models

投稿日: 2025年2月6日作成者: jarxiv

要約大規模な言語モデル（LLM）は、特に数学で複雑な推論タスクを解決する際に顕 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

投稿日: 2025年2月6日作成者: jarxiv

要約監視された微調整（SFT）は、一般的に言語モデルをトレーニングして、指定さ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Great Models Think Alike and this Undermines AI Oversight

ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters

Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions

MuJo: Multimodal Joint Feature Space Learning for Human Activity Recognition

DEFAME: Dynamic Evidence-based FAct-checking with Multimodal Experts

Evaluating Numerical Reasoning in Text-to-Image Models

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

最近の投稿

最近のコメント

アーカイブ

カテゴリー