「cs.CR」カテゴリーアーカイブ

Distract Large Language Models for Automatic Jailbreak Attack

投稿日: 2024年10月1日作成者: jarxiv

要約大規模言語モデル (LLM) の一般公開前に、その動作を人間の価値観に合わ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR | コメントを受け付けていません

Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey

投稿日: 2024年10月1日作成者: jarxiv

要約最近の調査によると、サービスとしての微調整の初期段階のビジネスモデルは、 … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.LG | コメントを受け付けていません

Read Over the Lines: Attacking LLMs and Toxicity Detection Systems with ASCII Art to Mask Profanity

投稿日: 2024年10月1日作成者: jarxiv

要約言語モデルが ASCII アートを解釈できないことを利用した、新しい敵対的 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR | コメントを受け付けていません

Bi-Directional Transformers vs. word2vec: Discovering Vulnerabilities in Lifted Compiled Code

投稿日: 2024年9月30日作成者: jarxiv

要約コンパイルされたバイナリ内の脆弱性を検出することは、高レベルのコード構造が … 続きを読む →

カテゴリー: cs.CL, cs.CR, cs.LG, cs.SE, I.2.6 | コメントを受け付けていません

Read Over the Lines: Attacking LLMs and Toxicity Detection Systems with ASCII Art to Mask Profanity

投稿日: 2024年9月30日作成者: jarxiv

要約言語モデルが ASCII アートを解釈できないことを利用した、新しい敵対的 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR | コメントを受け付けていません

LLM Detectors Still Fall Short of Real World: Case of LLM-Generated Short News-Like Posts

投稿日: 2024年9月30日作成者: jarxiv

要約広く利用可能な強力な LLM の出現により、大規模な言語モデル (LLM) … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG, I.2.7 | コメントを受け付けていません

Designing Short-Stage CDC-XPUFs: Balancing Reliability, Cost, and Security in IoT Devices

投稿日: 2024年9月27日作成者: jarxiv

要約モノのインターネット (IoT) デバイスの急速な拡大には、堅牢でリソース … 続きを読む →

カテゴリー: cs.CR, cs.LG | コメントを受け付けていません

AC4: Algebraic Computation Checker for Circuit Constraints in ZKPs

投稿日: 2024年9月27日作成者: jarxiv

要約ゼロ知識証明 (ZKP) システムは注目を集めており、現代の暗号化において … 続きを読む →

カテゴリー: cs.CL, cs.CR, cs.SE | コメントを受け付けていません

Weak-To-Strong Backdoor Attacks for LLMs with Contrastive Knowledge Distillation

投稿日: 2024年9月27日作成者: jarxiv

要約大規模言語モデル (LLM) は、その優れた機能により広く適用されているに … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR | コメントを受け付けていません

An Adversarial Perspective on Machine Unlearning for AI Safety

投稿日: 2024年9月27日作成者: jarxiv

要約大規模な言語モデルは、危険な知識に関する質問を拒否するように微調整されてい … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG | コメントを受け付けていません

「cs.CR」カテゴリーアーカイブ

Distract Large Language Models for Automatic Jailbreak Attack

Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey

Read Over the Lines: Attacking LLMs and Toxicity Detection Systems with ASCII Art to Mask Profanity

Bi-Directional Transformers vs. word2vec: Discovering Vulnerabilities in Lifted Compiled Code

Read Over the Lines: Attacking LLMs and Toxicity Detection Systems with ASCII Art to Mask Profanity

LLM Detectors Still Fall Short of Real World: Case of LLM-Generated Short News-Like Posts

Designing Short-Stage CDC-XPUFs: Balancing Reliability, Cost, and Security in IoT Devices

AC4: Algebraic Computation Checker for Circuit Constraints in ZKPs

Weak-To-Strong Backdoor Attacks for LLMs with Contrastive Knowledge Distillation

An Adversarial Perspective on Machine Unlearning for AI Safety

最近の投稿

最近のコメント

アーカイブ

カテゴリー