「cs.CR」カテゴリーアーカイブ

Certifiably Robust RAG against Retrieval Corruption

投稿日: 2024年5月27日作成者: jarxiv

要約検索拡張生成 (RAG) は、検索破損攻撃に対して脆弱であることがわかって … 続きを読む →

カテゴリー: cs.CL, cs.CR, cs.LG | コメントを受け付けていません

Harnessing Large Language Models for Software Vulnerability Detection: A Comprehensive Benchmarking Study

投稿日: 2024年5月27日作成者: jarxiv

要約脆弱性を検出するためにさまざまなアプローチが採用されているにもかかわらず、 … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.SE | コメントを受け付けていません

SynGhost: Imperceptible and Universal Task-agnostic Backdoor Attack in Pre-trained Language Models

投稿日: 2024年5月27日作成者: jarxiv

要約事前トレーニングは、事前トレーニングされた言語モデル (PLM) をデプロ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR | コメントを受け付けていません

Coordinated Disclosure for AI: Beyond Security Vulnerabilities

投稿日: 2024年5月27日作成者: jarxiv

要約人工知能 (AI) 分野における危害報告は現在、その場限りで行われており、 … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.CY | コメントを受け付けていません

A First Look at GPT Apps: Landscape and Vulnerability

投稿日: 2024年5月24日作成者: jarxiv

要約 OpenAI による GPT の導入後、GPT アプリの急増により、専用の … 続きを読む →

カテゴリー: cs.CL, cs.CR | コメントを受け付けていません

Towards General Conceptual Model Editing via Adversarial Representation Engineering

投稿日: 2024年5月24日作成者: jarxiv

要約大規模言語モデル (LLM) の開発が目覚ましい成功を収めて以来、その内部 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG, math.OC | コメントを受け付けていません

Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning

投稿日: 2024年5月24日作成者: jarxiv

要約ディープニューラルネットワークモデルをセキュリティクリティカルなさ … 続きを読む →

カテゴリー: cs.AI, cs.CR | コメントを受け付けていません

Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy

投稿日: 2024年5月24日作成者: jarxiv

要約テキストから画像への拡散モデルは、制御可能な画像生成の分野で多大な成功を収 … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

Hacking Predictors Means Hacking Cars: Using Sensitivity Analysis to Identify Trajectory Prediction Vulnerabilities for Autonomous Driving Security

投稿日: 2024年5月22日作成者: jarxiv

要約学習ベースのマルチモーダル軌道予測器に対する敵対的攻撃はすでに実証されてい … 続きを読む →

カテゴリー: cs.CR, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Adversarial Attacks and Defenses in Automated Control Systems: A Comprehensive Benchmark

投稿日: 2024年5月22日作成者: jarxiv

要約機械学習を自動制御システム (ACS) に統合することで、産業プロセス管理 … 続きを読む →

カテゴリー: cs.CR, cs.LG, cs.SY, eess.SY, I.2.1 | コメントを受け付けていません

「cs.CR」カテゴリーアーカイブ

Certifiably Robust RAG against Retrieval Corruption

Harnessing Large Language Models for Software Vulnerability Detection: A Comprehensive Benchmarking Study

SynGhost: Imperceptible and Universal Task-agnostic Backdoor Attack in Pre-trained Language Models

Coordinated Disclosure for AI: Beyond Security Vulnerabilities

A First Look at GPT Apps: Landscape and Vulnerability

Towards General Conceptual Model Editing via Adversarial Representation Engineering

Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning

Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy

Hacking Predictors Means Hacking Cars: Using Sensitivity Analysis to Identify Trajectory Prediction Vulnerabilities for Autonomous Driving Security

Adversarial Attacks and Defenses in Automated Control Systems: A Comprehensive Benchmark

最近の投稿

最近のコメント

アーカイブ

カテゴリー