「cs.CL」カテゴリーアーカイブ

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks

投稿日: 2024年12月25日作成者: jarxiv

要約汎用の身体エージェントは、ユーザーの自然な指示や意図を理解し、普遍的なタス … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.RO | コメントを受け付けていません

Investigating Large Language Models for Code Vulnerability Detection: An Experimental Study

投稿日: 2024年12月25日作成者: jarxiv

要約コード脆弱性検出 (CVD) は、システムセキュリティの問題に対処して防 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

GenAI Content Detection Task 2: AI vs. Human — Academic Essay Authenticity Challenge

投稿日: 2024年12月25日作成者: jarxiv

要約このペーパーでは、COLING 2025 と併置された GenAI コンテ … 続きを読む →

カテゴリー: 68T50, cs.AI, cs.CL, F.2.2 | コメントを受け付けていません

DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation

投稿日: 2024年12月25日作成者: jarxiv

要約コードレビューはソフトウェア開発において不可欠ですが要求の厳しい側面であ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SE | コメントを受け付けていません

GPTEval: A Survey on Assessments of ChatGPT and GPT-4

投稿日: 2024年12月25日作成者: jarxiv

要約 ChatGPT の出現により、社会および経済システムを混乱させる可能性につ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Re-examining learning linear functions in context

投稿日: 2024年12月25日作成者: jarxiv

要約インコンテキスト学習 (ICL) は、大規模言語モデル (LLM) をさま … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models

投稿日: 2024年12月25日作成者: jarxiv

要約自然言語処理 (NLP) の分野で大規模言語モデル (LLM) が広く適用 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Listening to Patients: A Framework of Detecting and Mitigating Patient Misreport for Medical Dialogue Generation

投稿日: 2024年12月25日作成者: jarxiv

要約 Medical Dialogue System は、患者とエージェントの会 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

XRAG: eXamining the Core — Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation

投稿日: 2024年12月25日作成者: jarxiv

要約検索拡張生成 (RAG) は、関連データの検索と大規模言語モデル (LLM … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Exploring Facets of Language Generation in the Limit

投稿日: 2024年12月25日作成者: jarxiv

要約 Kleinberg と Mullainathan の最近の研究 [KM24 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.DS, cs.LG | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks

Investigating Large Language Models for Code Vulnerability Detection: An Experimental Study

GenAI Content Detection Task 2: AI vs. Human — Academic Essay Authenticity Challenge

DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation

GPTEval: A Survey on Assessments of ChatGPT and GPT-4

Re-examining learning linear functions in context

M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models

Listening to Patients: A Framework of Detecting and Mitigating Patient Misreport for Medical Dialogue Generation

XRAG: eXamining the Core — Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation

Exploring Facets of Language Generation in the Limit

最近の投稿

最近のコメント

アーカイブ

カテゴリー