月別アーカイブ: 2025年4月

LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard

投稿日: 2025年4月18日作成者: jarxiv

要約このペーパーでは、大規模な言語モデル（LLM）の財務タスクへの適用を調査し … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents

投稿日: 2025年4月18日作成者: jarxiv

要約コミュニティの質問や回答から情報検索（IR）評価ベンチマークを自動的に構築 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo

投稿日: 2025年4月18日作成者: jarxiv

要約幅広いLMアプリケーションでは、構文またはセマンティックの制約に準拠するテ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

A general language model for peptide identification

投稿日: 2025年4月18日作成者: jarxiv

要約ペプチド同定の進歩は、タンパク質機能を解読し、治療的発見を加速する当社の能 … 続きを読む →

カテゴリー: 68T07, 92C40, cs.AI, cs.LG, I.2.6 | コメントを受け付けていません

Exploring Expert Failures Improves LLM Agent Tuning

投稿日: 2025年4月18日作成者: jarxiv

要約大規模な言語モデル（LLM）は、エージェントとして大きな可能性を示しており … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Antidistillation Sampling

投稿日: 2025年4月18日作成者: jarxiv

要約拡張された推論トレースを生成するフロンティアモデルは、モデルの蒸留を促進で … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

MIB: A Mechanistic Interpretability Benchmark

投稿日: 2025年4月18日作成者: jarxiv

要約新しい機械的解釈可能性の方法が実際の改善を達成するかどうかをどのように知る … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

RUKA: Rethinking the Design of Humanoid Hands with Learning

投稿日: 2025年4月18日作成者: jarxiv

要約器用な操作は、ロボットシステムの基本的な能力ですが、正確さ、コンパクト性、 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Sleep-time Compute: Beyond Inference Scaling at Test-time

投稿日: 2025年4月18日作成者: jarxiv

要約スケーリングテスト時間計算は、大規模な言語モデル（LLM）を可能にするため … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

It’s All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization

投稿日: 2025年4月18日作成者: jarxiv

要約効率的かつ効果的な建築バックボーンの設計は、基礎モデルの能力を高めるための … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年4月

LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard

FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents

Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo

A general language model for peptide identification

Exploring Expert Failures Improves LLM Agent Tuning

Antidistillation Sampling

MIB: A Mechanistic Interpretability Benchmark

RUKA: Rethinking the Design of Humanoid Hands with Learning

Sleep-time Compute: Beyond Inference Scaling at Test-time

It’s All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization

最近の投稿

最近のコメント

アーカイブ

カテゴリー