「cs.CV」カテゴリーアーカイブ

GSE: Group-wise Sparse and Explainable Adversarial Attacks

投稿日: 2024年11月26日作成者: jarxiv

要約まばらな敵対的攻撃は、多くの場合 $\ell_0$ ノルムによって正規化さ … 続きを読む →

カテゴリー: cs.CR, cs.CV, cs.LG, math.OC | コメントを受け付けていません

Generating Out-Of-Distribution Scenarios Using Language Models

投稿日: 2024年11月26日作成者: jarxiv

要約機械学習技術によって制御される自動運転車の導入には、現実世界の多様な環境で … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

J-CaPA : Joint Channel and Pyramid Attention Improves Medical Image Segmentation

投稿日: 2024年11月26日作成者: jarxiv

要約医療画像のセグメンテーションは、診断と治療計画に不可欠です。 U-Net … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Review of Mechanistic Models of Event Comprehension

投稿日: 2024年11月26日作成者: jarxiv

要約このレビューでは、談話理解理論から現代の出来事認識フレームワークへの進化を … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Rethinking Diffusion for Text-Driven Human Motion Generation

投稿日: 2024年11月26日作成者: jarxiv

要約 2023 年以降、ベクトル量子化 (VQ) ベースの離散生成手法が人間のモ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features

投稿日: 2024年11月26日作成者: jarxiv

要約 CLIP のようなマルチモーダルエンコーダは、ゼロショット画像分類やクロ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.IR, cs.LG | コメントを受け付けていません

CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation

投稿日: 2024年11月26日作成者: jarxiv

要約新しく提案された Generalized Referring Expres … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification

投稿日: 2024年11月26日作成者: jarxiv

要約拡散ベースの浄化 (DBP) は、敵対的例 (AE) に対する防御であり、 … 続きを読む →

カテゴリー: cs.CR, cs.CV, cs.LG | コメントを受け付けていません

Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models

投稿日: 2024年11月26日作成者: jarxiv

要約スケーラブルベクターグラフィックス (SVG) は、デジタルデザイン … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

GeoFormer: A Multi-Polygon Segmentation Transformer

投稿日: 2024年11月26日作成者: jarxiv

要約リモートセンシングでは、建物などのオブジェクトのスケール不変の形状を学習 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

GSE: Group-wise Sparse and Explainable Adversarial Attacks

Generating Out-Of-Distribution Scenarios Using Language Models

J-CaPA : Joint Channel and Pyramid Attention Improves Medical Image Segmentation

A Review of Mechanistic Models of Event Comprehension

Rethinking Diffusion for Text-Driven Human Motion Generation

CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features

CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation

Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification

Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models

GeoFormer: A Multi-Polygon Segmentation Transformer

最近の投稿

最近のコメント

アーカイブ

カテゴリー