月別アーカイブ: 2022年6月

Identification via Retinal Vessels Combining LBP and HOG

投稿日: 2022年6月12日作成者: jarxiv

要約情報技術の発達と高度なセキュリティの必要性に伴い、さまざまな識別方法を使用 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Style-Content Disentanglement in Language-Image Pretraining Representations for Zero-Shot Sketch-to-Image Synthesis

投稿日: 2022年6月12日作成者: jarxiv

要約この作業では、トレーニングなしのゼロショットスケッチから画像への合成のため … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Egocentric Video-Language Pretraining

投稿日: 2022年6月12日作成者: jarxiv

要約ビデオテキストのダウンストリームタスクを幅広く進めるために転送可能な表現を … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Dynamic Kernel Selection for Improved Generalization and Memory Efficiency in Meta-learning

投稿日: 2022年6月12日作成者: jarxiv

要約勾配ベースのメタ学習方法は、メタトレーニングセットに過剰適合する傾向があり … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Gradient Obfuscation Checklist Test Gives a False Sense of Security

投稿日: 2022年6月12日作成者: jarxiv

要約敵対的攻撃に対する防御技術の一般的なグループの1つは、ネットワークに確率的 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

投稿日: 2022年6月12日作成者: jarxiv

要約ディープラーニングの最新の技術的進歩であるTransformerは、自然言 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Exploring Visual Prompts for Adapting Large-Scale Models

投稿日: 2022年6月12日作成者: jarxiv

要約視覚に大規模モデルを適応させるための視覚的プロンプトの有効性を調査します。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge

投稿日: 2022年6月12日作成者: jarxiv

要約視覚的質問応答（VQA）タスクは、視覚的および自然言語入力を共同で推論でき … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Revisiting the ‘Video’ in Video-Language Understanding

投稿日: 2022年6月12日作成者: jarxiv

要約単一の画像から理解できることを超えて、ビデオタスクがビデオに独自に適してい … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

SNAKE: Shape-aware Neural 3D Keypoint Field

投稿日: 2022年6月12日作成者: jarxiv

要約点群から3Dキーポイントを検出することは、形状の再構築にとって重要ですが、 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2022年6月

Identification via Retinal Vessels Combining LBP and HOG

Style-Content Disentanglement in Language-Image Pretraining Representations for Zero-Shot Sketch-to-Image Synthesis

Egocentric Video-Language Pretraining

Dynamic Kernel Selection for Improved Generalization and Memory Efficiency in Meta-learning

Gradient Obfuscation Checklist Test Gives a False Sense of Security

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Exploring Visual Prompts for Adapting Large-Scale Models

A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge

Revisiting the ‘Video’ in Video-Language Understanding

SNAKE: Shape-aware Neural 3D Keypoint Field

最近の投稿

最近のコメント

アーカイブ

カテゴリー