「cs.LG」カテゴリーアーカイブ

Glivenko-Cantelli for $f$-divergence

投稿日: 2025年3月24日作成者: jarxiv

要約総変動距離の標準設定からすべての$ f $ divergencesまで、統 … 続きを読む →

カテゴリー: 60B10, 60F15, 60F25, cs.LG, math.ST, stat.TH | コメントを受け付けていません

Gumbel-Softmax Flow Matching with Straight-Through Guidance for Controllable Biological Sequence Generation

投稿日: 2025年3月24日作成者: jarxiv

要約連続シンプレックスのフローマッチングは、DNA配列設計の有望な戦略として浮 … 続きを読む →

カテゴリー: cs.LG, q-bio.BM | コメントを受け付けていません

Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs

投稿日: 2025年3月24日作成者: jarxiv

要約知識の蒸留は、教師の出力ロジットを事前に計算してキャッシュすることができる … 続きを読む →

カテゴリー: 68T50, cs.AI, cs.CL, cs.LG, I.2.7 | コメントを受け付けていません

Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees

投稿日: 2025年3月24日作成者: jarxiv

要約多くの場合、APIの形でツールを活用してツールを活用して、複雑なタスクでの … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks

投稿日: 2025年3月24日作成者: jarxiv

要約この研究は、ファイナンスおよび会計研究における大規模な言語モデル（LLM） … 続きを読む →

カテゴリー: cs.AI, cs.CE, cs.CL, cs.LG, q-fin.GN | コメントを受け付けていません

Token Dynamics: Towards Efficient and Dynamic Video Token Representation for Video Large Language Models

投稿日: 2025年3月24日作成者: jarxiv

要約トークンベースのビデオ表現は、大きな言語モデルがビデオコンテンツを解釈でき … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Building Multilingual Datasets for Predicting Mental Health Severity through LLMs: Prospects and Challenges

投稿日: 2025年3月24日作成者: jarxiv

要約大規模な言語モデル（LLM）は、メンタルヘルスサポートシステムを含むさまざ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

投稿日: 2025年3月24日作成者: jarxiv

要約 Sphinx-Xは、Sphinxで開発された広範なマルチモダリティ大手言語 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Optimizing Attention with Mirror Descent: Generalized Max-Margin Token Selection

投稿日: 2025年3月24日作成者: jarxiv

要約注意メカニズムは、自然言語処理やコンピュータービジョンなど、人工知能のいく … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Modifying Large Language Model Post-Training for Diverse Creative Writing

投稿日: 2025年3月24日作成者: jarxiv

要約創造的なライティングタスクには特異な正解がないため、これらのタスクを実行す … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Glivenko-Cantelli for $f$-divergence

Gumbel-Softmax Flow Matching with Straight-Through Guidance for Controllable Biological Sequence Generation

Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs

Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees

Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks

Token Dynamics: Towards Efficient and Dynamic Video Token Representation for Video Large Language Models

Building Multilingual Datasets for Predicting Mental Health Severity through LLMs: Prospects and Challenges

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Optimizing Attention with Mirror Descent: Generalized Max-Margin Token Selection

Modifying Large Language Model Post-Training for Diverse Creative Writing

最近の投稿

最近のコメント

アーカイブ

カテゴリー