「I.2.7」カテゴリーアーカイブ

$FastDoc$: Domain-Specific Fast Continual Pre-training Technique using Document-Level Metadata and Taxonomy

投稿日: 2024年11月4日作成者: jarxiv

要約本論文では、$FastDoc$(Fast Continual Pre-tr … 続きを読む →

カテゴリー: 68T50, cs.CL, cs.LG, I.2.7 | コメントを受け付けていません

CoTran: An LLM-based Code Translator using Reinforcement Learning with Feedback from Compiler and Symbolic Execution

投稿日: 2024年10月31日作成者: jarxiv

要約この論文では、LLM ベースのコード変換手法と、プログラム全体を 1 つの … 続きを読む →

カテゴリー: cs.AI, cs.PL, cs.SE, I.2.7 | コメントを受け付けていません

Distinguishing Ignorance from Error in LLM Hallucinations

投稿日: 2024年10月30日作成者: jarxiv

要約大規模言語モデル (LLM) は、根拠のない、事実に誤りがある、または前世 … 続きを読む →

カテゴリー: cs.CL, I.2.7 | コメントを受け付けていません

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

投稿日: 2024年10月29日作成者: jarxiv

要約大規模言語モデル (LLM) は、事実の不正確さ、偏見、推論の失敗などのエ … 続きを読む →

カテゴリー: 68T50, cs.AI, cs.CL, I.2.7 | コメントを受け付けていません

Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics

投稿日: 2024年10月29日作成者: jarxiv

要約大規模言語モデル (LLM) は、堅牢な一般化可能なアルゴリズムを学習する … 続きを読む →

カテゴリー: 68T5, cs.CL, I.2.7 | コメントを受け付けていません

Large-scale cloze evaluation reveals that token prediction tasks are neither lexically nor semantically aligned

投稿日: 2024年10月29日作成者: jarxiv

要約この研究では、いくつかの言語モデルにおける次のトークン予測レベルでの生成動 … 続きを読む →

カテゴリー: cs.AI, cs.CL, I.2.7 | コメントを受け付けていません

Parsing Akkadian Verbs with Prolog

投稿日: 2024年10月17日作成者: jarxiv

要約この論文では、Prolog で実装された、接尾辞の追加が可能な、アッカド語 … 続きを読む →

カテゴリー: cs.CL, I.2.7 | コメントを受け付けていません

Tokenization and Morphology in Multilingual Language Models: A~Comparative Analysis of mT5 and ByT5

投稿日: 2024年10月16日作成者: jarxiv

要約形態論はトークン化に直接的な課題をもたらすため、多言語言語モデリングにとっ … 続きを読む →

カテゴリー: cs.CL, I.2.7 | コメントを受け付けていません

Everyday Speech in the Indian Subcontinent

投稿日: 2024年10月15日作成者: jarxiv

要約インドには 1,369 の言語があり、そのうち 22 が公用語です。これ … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS, I.2.7 | コメントを受け付けていません

Uplifting Lower-Income Data: Strategies for Socioeconomic Perspective Shifts in Large Multi-modal Models

投稿日: 2024年10月15日作成者: jarxiv

要約最近の研究では、トレーニングデータにおける文化と社会経済的グループの不平 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.CY, I.2.7 | コメントを受け付けていません

「I.2.7」カテゴリーアーカイブ

$FastDoc$: Domain-Specific Fast Continual Pre-training Technique using Document-Level Metadata and Taxonomy

CoTran: An LLM-based Code Translator using Reinforcement Learning with Feedback from Compiler and Symbolic Execution

Distinguishing Ignorance from Error in LLM Hallucinations

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics

Large-scale cloze evaluation reveals that token prediction tasks are neither lexically nor semantically aligned

Parsing Akkadian Verbs with Prolog

Tokenization and Morphology in Multilingual Language Models: A~Comparative Analysis of mT5 and ByT5

Everyday Speech in the Indian Subcontinent

Uplifting Lower-Income Data: Strategies for Socioeconomic Perspective Shifts in Large Multi-modal Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー