「cs.CV」カテゴリーアーカイブ

Style-Editor: Text-driven object-centric style editing

投稿日: 2025年4月9日作成者: jarxiv

要約テキスト入力を使用してオブジェクト中心のレベルでスタイル編集をガイドする新 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

OSDM-MReg: Multimodal Image Registration based One Step Diffusion Model

投稿日: 2025年4月9日作成者: jarxiv

要約マルチモーダルリモートセンシング画像登録は、データの融合と分析のために、さ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Multi-Task Faces (MTF) Data Set: A Legally and Ethically Compliant Collection of Face Images for Various Classification Tasks

投稿日: 2025年4月9日作成者: jarxiv

要約人間の顔データは、顔認識、年齢の推定、性別の識別、感情分析、人種分類など、 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Enhanced Anomaly Detection for Capsule Endoscopy Using Ensemble Learning Strategies

投稿日: 2025年4月9日作成者: jarxiv

要約カプセル内視鏡検査は、胃腸管の画像をキャプチャし、標準的な内視鏡で調査され … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Self-Supervised Siamese Autoencoders

投稿日: 2025年4月9日作成者: jarxiv

要約完全に監視されたモデルとは対照的に、自己教師の表現学習は、ラベルを付けるた … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

A Multi-Scale Feature Fusion Framework Integrating Frequency Domain and Cross-View Attention for Dual-View X-ray Security Inspections

投稿日: 2025年4月9日作成者: jarxiv

要約近代的な輸送システムの急速な発展と物流量の指数関数的な成長により、インテリ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MAPLE: Encoding Dexterous Robotic Manipulation Priors Learned From Egocentric Videos

投稿日: 2025年4月9日作成者: jarxiv

要約大規模なエゴセントリックビデオデータセットは、幅広いシナリオにわたって多様 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

MCAT: Visual Query-Based Localization of Standard Anatomical Clips in Fetal Ultrasound Videos Using Multi-Tier Class-Aware Token Transformer

投稿日: 2025年4月9日作成者: jarxiv

要約胎児超音波（US）ビデオにおける正確な標準平面取得は、胎児の成長評価、異常 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Towards Varroa destructor mite detection using a narrow spectra illumination

投稿日: 2025年4月9日作成者: jarxiv

要約このペーパーでは、U-NET、セマンティックセグメンテーションアーキテクチ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

VIRES: Video Instance Repainting via Sketch and Text Guided Generation

投稿日: 2025年4月9日作成者: jarxiv

要約スケッチとテキストのガイダンスを使用したビデオインスタンスの補償方法である … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Style-Editor: Text-driven object-centric style editing

OSDM-MReg: Multimodal Image Registration based One Step Diffusion Model

Multi-Task Faces (MTF) Data Set: A Legally and Ethically Compliant Collection of Face Images for Various Classification Tasks

Enhanced Anomaly Detection for Capsule Endoscopy Using Ensemble Learning Strategies

Self-Supervised Siamese Autoencoders

A Multi-Scale Feature Fusion Framework Integrating Frequency Domain and Cross-View Attention for Dual-View X-ray Security Inspections

MAPLE: Encoding Dexterous Robotic Manipulation Priors Learned From Egocentric Videos

MCAT: Visual Query-Based Localization of Standard Anatomical Clips in Fetal Ultrasound Videos Using Multi-Tier Class-Aware Token Transformer

Towards Varroa destructor mite detection using a narrow spectra illumination

VIRES: Video Instance Repainting via Sketch and Text Guided Generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー