月別アーカイブ: 2025年3月

How do language models learn facts? Dynamics, curricula and hallucinations

投稿日: 2025年3月28日作成者: jarxiv

要約大規模な言語モデルは、トレーニング前に膨大な知識を蓄積しますが、この獲得を … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models’ Detection of Human Self-Destructive Behavior Content in Jirai Community

投稿日: 2025年3月28日作成者: jarxiv

要約このペーパーでは、中国と日本のソーシャルメディアコミュニティで自己破壊的な … 続きを読む →

カテゴリー: cs.CL, cs.CY | コメントを受け付けていません

Learning to Represent Individual Differences for Choice Decision Making

投稿日: 2025年3月28日作成者: jarxiv

要約意思決定は多くの複雑な要因の影響を受けるため、人間の意思決定は予測するのが … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

As easy as PIE: understanding when pruning causes language models to disagree

投稿日: 2025年3月28日作成者: jarxiv

要約言語モデル（LM）剪定は、重み、ノード、またはそのアーキテクチャの他の部分 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?

投稿日: 2025年3月28日作成者: jarxiv

要約科学的ピアレビューの中核部分は、論文が行う科学的主張を直接評価する専門家の … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Understanding the Logic of Direct Preference Alignment through Logic

投稿日: 2025年3月28日作成者: jarxiv

要約 DPOなどの最近の直接選好アライメントアルゴリズム（DPA）は、大規模な言 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Effective Skill Unlearning through Intervention and Abstention

投稿日: 2025年3月28日作成者: jarxiv

要約大規模な言語モデル（LLM）は、さまざまなドメインで顕著なスキルを示してい … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

MemInsight: Autonomous Memory Augmentation for LLM Agents

投稿日: 2025年3月28日作成者: jarxiv

要約大規模な言語モデル（LLM）エージェントは、情報をインテリジェントに処理し … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

MONO2REST: Identifying and Exposing Microservices: a Reusable RESTification Approach

投稿日: 2025年3月28日作成者: jarxiv

要約マイクロサービスアーキテクチャスタイルは、大規模なクラウドアプリケーション … 続きを読む →

カテゴリー: cs.AI, cs.SE | コメントを受け付けていません

Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models

投稿日: 2025年3月28日作成者: jarxiv

要約情報検索（IR）フィールドが包括性の重要性をますます認識しているため、低リ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

月別アーカイブ: 2025年3月

How do language models learn facts? Dynamics, curricula and hallucinations

JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models’ Detection of Human Self-Destructive Behavior Content in Jirai Community

Learning to Represent Individual Differences for Choice Decision Making

As easy as PIE: understanding when pruning causes language models to disagree

CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?

Understanding the Logic of Direct Preference Alignment through Logic

Effective Skill Unlearning through Intervention and Abstention

MemInsight: Autonomous Memory Augmentation for LLM Agents

MONO2REST: Identifying and Exposing Microservices: a Reusable RESTification Approach

Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー