「cs.CR」カテゴリーアーカイブ

Input Reconstruction Attack against Vertical Federated Large Language Models

投稿日: 2023年11月27日作成者: jarxiv

要約最近、ChatGPT の出現により、大規模言語モデル (LLM) が学界や … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR | コメントを受け付けていません

Universal Jailbreak Backdoors from Poisoned Human Feedback

投稿日: 2023年11月27日作成者: jarxiv

要約ヒューマンフィードバックからの強化学習 (RLHF) は、大規模な言語モ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG | コメントを受け付けていません

FRAD: Front-Running Attacks Detection on Ethereum using Ternary Classification Model

投稿日: 2023年11月27日作成者: jarxiv

要約ブロックチェーン技術の進化に伴い、特にイーサリアムなどのプラットフォームに … 続きを読む →

カテゴリー: cs.AI, cs.CR | コメントを受け付けていません

Backdoor Activation Attack: Attack Large Language Models using Activation Steering for Safety-Alignment

投稿日: 2023年11月27日作成者: jarxiv

要約 AI の安全性を確保するために、命令調整されたラージ言語モデル (LLM) … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR | コメントを受け付けていません

Segment (Almost) Nothing: Prompt-Agnostic Adversarial Attacks on Segmentation Models

投稿日: 2023年11月27日作成者: jarxiv

要約汎用セグメンテーションモデルは、視覚的なプロンプト (点、ボックスなど) … 続きを読む →

カテゴリー: cs.CR, cs.CV, cs.LG | コメントを受け付けていません

Transfer Attacks and Defenses for Large Language Models on Coding Tasks

投稿日: 2023年11月23日作成者: jarxiv

要約 ChatGPT などの最新の大規模言語モデル (LLM) は、コードの記述 … 続きを読む →

カテゴリー: cs.CR, cs.LG | コメントを受け付けていません

Differentially Private Non-Convex Optimization under the KL Condition with Optimal Rates

投稿日: 2023年11月23日作成者: jarxiv

要約 $(\gamma,\kappa)$-Kurdyka-{\L}ojasiew … 続きを読む →

カテゴリー: cs.CR, cs.LG, math.OC, stat.ML | コメントを受け付けていません

Explaining high-dimensional text classifiers

投稿日: 2023年11月23日作成者: jarxiv

要約説明可能性はここ数年で貴重なツールとなり、人間が AI に基づいた意思決定 … 続きを読む →

カテゴリー: cs.CR, cs.LG, cs.NE, stat.ML | コメントを受け付けていません

A Survey of Adversarial CAPTCHAs on its History, Classification and Generation

投稿日: 2023年11月23日作成者: jarxiv

要約コンピュータと人間を区別するための完全に自動化された公開チューリングテス … 続きを読む →

カテゴリー: cs.AI, cs.CR | コメントを受け付けていません

From Principle to Practice: Vertical Data Minimization for Machine Learning

投稿日: 2023年11月23日作成者: jarxiv

要約予測モデルのトレーニングと展開を目的として、組織は大量の詳細な顧客データを … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.LG | コメントを受け付けていません

「cs.CR」カテゴリーアーカイブ

Input Reconstruction Attack against Vertical Federated Large Language Models

Universal Jailbreak Backdoors from Poisoned Human Feedback

FRAD: Front-Running Attacks Detection on Ethereum using Ternary Classification Model

Backdoor Activation Attack: Attack Large Language Models using Activation Steering for Safety-Alignment

Segment (Almost) Nothing: Prompt-Agnostic Adversarial Attacks on Segmentation Models

Transfer Attacks and Defenses for Large Language Models on Coding Tasks

Differentially Private Non-Convex Optimization under the KL Condition with Optimal Rates

Explaining high-dimensional text classifiers

A Survey of Adversarial CAPTCHAs on its History, Classification and Generation

From Principle to Practice: Vertical Data Minimization for Machine Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー