「I.2.7」カテゴリーアーカイブ

Heimdall: test-time scaling on the generative verification

投稿日: 2025年4月17日作成者: jarxiv

要約 AIシステムは、知識自体を確認できる範囲でのみ、知識を作成および維持できま … 続きを読む →

カテゴリー: cs.AI, I.2.7 | コメントを受け付けていません

Bypassing Prompt Injection and Jailbreak Detection in LLM Guardrails

投稿日: 2025年4月17日作成者: jarxiv

要約大規模な言語モデル（LLMS）ガードレールシステムは、迅速な噴射および脱獄 … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.LG, I.2.7 | コメントを受け付けていません

Heimdall: test-time scaling on the generative verification

投稿日: 2025年4月15日作成者: jarxiv

要約 AIシステムは、知識自体を確認できる範囲でのみ、知識を作成および維持できま … 続きを読む →

カテゴリー: cs.AI, I.2.7 | コメントを受け付けていません

Performance of Large Language Models in Supporting Medical Diagnosis and Treatment

投稿日: 2025年4月15日作成者: jarxiv

要約大規模な言語モデル（LLMS）をヘルスケアに統合すると、診断の精度を高め、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.ET, cs.HC, I.2.7 | コメントを受け付けていません

MedHal: An Evaluation Dataset for Medical Hallucination Detection

投稿日: 2025年4月14日作成者: jarxiv

要約 Medhalは、モデルが医療テキストの幻覚を検出できるかどうかを評価するた … 続きを読む →

カテゴリー: cs.AI, cs.CL, I.2.7 | コメントを受け付けていません

Role of Databases in GenAI Applications

投稿日: 2025年4月14日作成者: jarxiv

要約生成AI（Genai）は、インテリジェントなコンテンツ生成、自動化、意思決 … 続きを読む →

カテゴリー: 97P30, cs.AI, cs.DB, I.2.7 | コメントを受け付けていません

Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining

投稿日: 2025年4月11日作成者: jarxiv

要約強化学習（RL）ベースの微調整は、高度な数学的推論とコーディングのためのト … 続きを読む →

カテゴリー: cs.LG, I.2.7 | コメントを受け付けていません

RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts

投稿日: 2025年4月10日作成者: jarxiv

要約この論文では、ロシアのニューステキストからの構造化された意見の抽出に関する … 続きを読む →

カテゴリー: cs.CL, I.2.7 | コメントを受け付けていません

Outlier dimensions favor frequent tokens in language models

投稿日: 2025年4月10日作成者: jarxiv

要約最後の層の外れ値の寸法、つまり、大部分の入力に対して極端な活性化を示す寸法 … 続きを読む →

カテゴリー: cs.AI, cs.CL, I.2.7 | コメントを受け付けていません

Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task

投稿日: 2025年4月9日作成者: jarxiv

要約人間の意図ベースのシステムにより、ロボットはユーザーアクションを認識して解 … 続きを読む →

カテゴリー: cs.AI, cs.HC, cs.RO, I.2.7 | コメントを受け付けていません

「I.2.7」カテゴリーアーカイブ

Heimdall: test-time scaling on the generative verification

Bypassing Prompt Injection and Jailbreak Detection in LLM Guardrails

Heimdall: test-time scaling on the generative verification

Performance of Large Language Models in Supporting Medical Diagnosis and Treatment

MedHal: An Evaluation Dataset for Medical Hallucination Detection

Role of Databases in GenAI Applications

Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining

RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts

Outlier dimensions favor frequent tokens in language models

Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task

最近の投稿

最近のコメント

アーカイブ

カテゴリー