月別アーカイブ: 2025年5月

SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training

投稿日: 2025年5月6日作成者: jarxiv

要約本論文では、SimHandと呼ばれる、類似した手の特徴を共有する野生の手画 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Detect, Classify, Act: Categorizing Industrial Anomalies with Multi-Modal Large Language Models

投稿日: 2025年5月6日作成者: jarxiv

要約最近の工業用視覚的異常検出の進歩により、高速な推論速度を維持しながら、異常 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DeepSparse: A Foundation Model for Sparse-View CBCT Reconstruction

投稿日: 2025年5月6日作成者: jarxiv

要約コーンビームCT（CBCT）は、医療分野において重要な3次元画像技術である … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation

投稿日: 2025年5月6日作成者: jarxiv

要約拡散モデルは、テキストから画像への生成において優れた性能を示してきた。しか … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition

投稿日: 2025年5月6日作成者: jarxiv

要約連携学習は、分散化されたクライアントが、すべての学習データをローカルに保ち … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Sim2Real in endoscopy segmentation with a novel structure aware image translation

投稿日: 2025年5月6日作成者: jarxiv

要約内視鏡画像における解剖学的ランドマークの自動セグメンテーションは、医師や外 … 続きを読む →

カテゴリー: cs.CV, I.2.10 | コメントを受け付けていません

Grasp the Graph (GtG) 2.0: Ensemble of GNNs for High-Precision Grasp Pose Detection in Clutter

投稿日: 2025年5月6日作成者: jarxiv

要約雑然とした実環境における把持ポーズ検出は、ノイズが多く不完全な感覚データと … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Multimodal Deep Learning for Stroke Prediction and Detection using Retinal Imaging and Clinical Data

投稿日: 2025年5月6日作成者: jarxiv

要約脳卒中は公衆衛生上の大きな問題であり、世界中で数百万人が罹患している。ディ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Enhancing person re-identification via Uncertainty Feature Fusion Method and Auto-weighted Measure Combination

投稿日: 2025年5月6日作成者: jarxiv

要約人物の再同定（Re-ID）は、監視システムにおいて、異なるカメラビュー間で … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Active Data Curation Effectively Distills Large-Scale Multimodal Models

投稿日: 2025年5月6日作成者: jarxiv

要約知識蒸留（KD）は、大規模なモデルをより小さなモデルに圧縮するためのデファ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年5月

SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training

Detect, Classify, Act: Categorizing Industrial Anomalies with Multi-Modal Large Language Models

DeepSparse: A Foundation Model for Sparse-View CBCT Reconstruction

MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation

FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition

Sim2Real in endoscopy segmentation with a novel structure aware image translation

Grasp the Graph (GtG) 2.0: Ensemble of GNNs for High-Precision Grasp Pose Detection in Clutter

Multimodal Deep Learning for Stroke Prediction and Detection using Retinal Imaging and Clinical Data

Enhancing person re-identification via Uncertainty Feature Fusion Method and Auto-weighted Measure Combination

Active Data Curation Effectively Distills Large-Scale Multimodal Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー