投稿者「jarxiv」のアーカイブ

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

投稿日: 2025年4月3日作成者: jarxiv

要約高品質のデータフィルタリング、マルチモーダルデータ混合戦略、シーケンスパッ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Horizon Scans can be accelerated using novel information retrieval and artificial intelligence tools

投稿日: 2025年4月3日作成者: jarxiv

要約はじめに：Herizon Scanning in Healthcareは、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

Testing Low-Resource Language Support in LLMs Using Language Proficiency Exams: the Case of Luxembourgish

投稿日: 2025年4月3日作成者: jarxiv

要約大規模な言語モデル（LLM）は、研究と社会全体でますます重要なツールになっ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Study of scaling laws in language families

投稿日: 2025年4月3日作成者: jarxiv

要約この記事では、言語家族内のスケーリング法則を調査し、6,000を超える言語 … 続きを読む →

カテゴリー: cs.CL, physics.soc-ph | コメントを受け付けていません

ToM-RL: Reinforcement Learning Unlocks Theory of Mind in Small LLMs

投稿日: 2025年4月3日作成者: jarxiv

要約大規模な言語モデル（LLM）のトレーニング後の段階で適用されるルールベース … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation

投稿日: 2025年4月3日作成者: jarxiv

要約コンテキスト内学習（ICL）は、大規模な言語モデル（LLM）にとって重要で … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

投稿日: 2025年4月3日作成者: jarxiv

要約 Financial LLMSは、金融タスクとドメイン固有のアプリケーション … 続きを読む →

カテゴリー: cs.CE, cs.CL, q-fin.CP | コメントを受け付けていません

OpenThaiGPT 1.6 and R1: Thai-Centric Open Source and Reasoning Large Language Models

投稿日: 2025年4月3日作成者: jarxiv

要約 Openthaigpt 1.6およびR1（OTG-1.6およびOTG-R1 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training

投稿日: 2025年4月3日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、トレーニング前のデータに極端な言語の不均 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Efficient Constant-Space Multi-Vector Retrieval

投稿日: 2025年4月3日作成者: jarxiv

要約コルバートアーキテクチャによって例示された多面検索方法は、検索の潜在性と有 … 続きを読む →

カテゴリー: cs.CL, cs.IR | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Horizon Scans can be accelerated using novel information retrieval and artificial intelligence tools

Testing Low-Resource Language Support in LLMs Using Language Proficiency Exams: the Case of Luxembourgish

Study of scaling laws in language families

ToM-RL: Reinforcement Learning Unlocks Theory of Mind in Small LLMs

InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

OpenThaiGPT 1.6 and R1: Thai-Centric Open Source and Reasoning Large Language Models

Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training

Efficient Constant-Space Multi-Vector Retrieval

最近の投稿

最近のコメント

アーカイブ

カテゴリー