月別アーカイブ: 2024年5月

OpenGait: A Comprehensive Benchmark Study for Gait Recognition towards Better Practicality

投稿日: 2024年5月16日作成者: jarxiv

要約歩行認識は、離れた場所から人物を識別するための急速に進歩している視覚技術で … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Revisiting the Role of Language Priors in Vision-Language Models

投稿日: 2024年5月16日作成者: jarxiv

要約視覚言語モデル (VLM) が影響力を持つ理由の 1 つは、微調整を行わず … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

A Hierarchically Feature Reconstructed Autoencoder for Unsupervised Anomaly Detection

投稿日: 2024年5月16日作成者: jarxiv

要約手動による注釈や事前知識なしで異常を検出および位置特定することは、教師なし … 続きを読む →

カテゴリー: 68T01, cs.CV, I.2.10 | コメントを受け付けていません

Curriculum Dataset Distillation

投稿日: 2024年5月16日作成者: jarxiv

要約ほとんどのデータセット抽出手法は、膨大な計算量とメモリ要件があるため、大規 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Scalable Image Coding for Humans and Machines Using Feature Fusion Network

投稿日: 2024年5月16日作成者: jarxiv

要約画像認識モデルが普及するにつれて、機械と人間のためのスケーラブルなコーディ … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

Detail Reinforcement Diffusion Model: Augmentation Fine-Grained Visual Categorization in Few-Shot Conditions

投稿日: 2024年5月16日作成者: jarxiv

要約きめの細かい視覚的分類における課題は、異なるサブクラス間の微妙な違いを調査 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Vector-Symbolic Architecture for Event-Based Optical Flow

投稿日: 2024年5月16日作成者: jarxiv

要約特徴マッチングの観点から見ると、イベントカメラのオプティカルフロー推定 … 続きを読む →

カテゴリー: cs.CV, cs.SC | コメントを受け付けていません

Transforming gradient-based techniques into interpretable methods

投稿日: 2024年5月16日作成者: jarxiv

要約 xAI 技術による畳み込みニューラルネットワーク (CNN) の説明では … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Flexible image analysis for law enforcement agencies with deep neural networks to determine: where, who and what

投稿日: 2024年5月16日作成者: jarxiv

要約効果的なセキュリティ対策のニーズの高まりと商用製品へのカメラの統合により、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks

投稿日: 2024年5月16日作成者: jarxiv

要約カメラのみの Bird’s-Eye-View (BEV) セグ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年5月

OpenGait: A Comprehensive Benchmark Study for Gait Recognition towards Better Practicality

Revisiting the Role of Language Priors in Vision-Language Models

A Hierarchically Feature Reconstructed Autoencoder for Unsupervised Anomaly Detection

Curriculum Dataset Distillation

Scalable Image Coding for Humans and Machines Using Feature Fusion Network

Detail Reinforcement Diffusion Model: Augmentation Fine-Grained Visual Categorization in Few-Shot Conditions

Vector-Symbolic Architecture for Event-Based Optical Flow

Transforming gradient-based techniques into interpretable methods

Flexible image analysis for law enforcement agencies with deep neural networks to determine: where, who and what

OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks

最近の投稿

最近のコメント

アーカイブ

カテゴリー