「cs.CL」カテゴリーアーカイブ

EvidenceMap: Learning Evidence Analysis to Unleash the Power of Small Language Models for Biomedical Question Answering

投稿日: 2025年1月31日作成者: jarxiv

要約生物医学の領域で専門的な質問に対処するとき、人間は通常、複数の情報を証拠と … 続きを読む →

カテゴリー: 68T50, cs.AI, cs.CL | コメントを受け付けていません

Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models

投稿日: 2025年1月31日作成者: jarxiv

要約大規模な言語モデル（LLMS）の最近の進歩により、計画と推論機能が組み込ま … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models

投稿日: 2025年1月31日作成者: jarxiv

要約複雑なマルチステップの推論タスクを実行する場合、構造化された中間証明ステッ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with Customisable Fairness Calibration

投稿日: 2025年1月31日作成者: jarxiv

要約偏りのない大規模な言語モデルの開発は、重要なものとして広く認識されています … 続きを読む →

カテゴリー: 68T50, cs.CL, F.4.2 | コメントを受け付けていません

Contextually Structured Token Dependency Encoding for Large Language Models

投稿日: 2025年1月31日作成者: jarxiv

要約大規模なニューラルアーキテクチャ内のトークン表現戦略は、しばしば文脈的に洗 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Idiom Detection in Sorani Kurdish Texts

投稿日: 2025年1月31日作成者: jarxiv

要約自然言語処理（NLP）を使用したイディオム検出は、単語の文字通りの解釈を超 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Statistical multi-metric evaluation and visualization of LLM system predictive performance

投稿日: 2025年1月31日作成者: jarxiv

要約生成または識別的な大手言語モデル（LLM）ベースのシステムの評価は、多くの … 続きを読む →

カテゴリー: cs.CL, cs.LG, stat.AP | コメントを受け付けていません

How to Select Datapoints for Efficient Human Evaluation of NLG Models?

投稿日: 2025年1月31日作成者: jarxiv

要約人間の評価は、テキスト生成モデルを評価するための金標準です。また、高価で … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Collecting Cost-Effective, High-Quality Truthfulness Assessments with LLM Summarized Evidence

投稿日: 2025年1月31日作成者: jarxiv

要約ガードレールがオンラインでの誤った情報や偽情報に対する劣化により、効果的に … 続きを読む →

カテゴリー: cs.CL, cs.HC, cs.IR | コメントを受け付けていません

Jailbreaking LLMs’ Safeguard with Universal Magic Words for Text Embedding Models

投稿日: 2025年1月31日作成者: jarxiv

要約大規模な言語モデル（LLMS）のセキュリティ問題は最近、有害な出力を防ぐた … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.NE | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

EvidenceMap: Learning Evidence Analysis to Unleash the Power of Small Language Models for Biomedical Question Answering

Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models

Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models

SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with Customisable Fairness Calibration

Contextually Structured Token Dependency Encoding for Large Language Models

Idiom Detection in Sorani Kurdish Texts

Statistical multi-metric evaluation and visualization of LLM system predictive performance

How to Select Datapoints for Efficient Human Evaluation of NLG Models?

Collecting Cost-Effective, High-Quality Truthfulness Assessments with LLM Summarized Evidence

Jailbreaking LLMs’ Safeguard with Universal Magic Words for Text Embedding Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー