月別アーカイブ: 2024年8月

Caltech Aerial RGB-Thermal Dataset in the Wild

投稿日: 2024年8月2日作成者: jarxiv

要約私たちは、自然環境で動作する航空ロボット用に設計された、初の一般公開された … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Outlier Detection in Large Radiological Datasets using UMAP

投稿日: 2024年8月2日作成者: jarxiv

要約機械学習アルゴリズムの成功は、サンプルの品質とそれに対応するラベルの精度に … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

DDU-Net: A Domain Decomposition-based CNN for High-Resolution Image Segmentation on Multiple GPUs

投稿日: 2024年8月2日作成者: jarxiv

要約超高解像度画像のセグメンテーションには、空間情報の損失や計算効率の低下など … 続きを読む →

カテゴリー: 65N55, 68T07, 68U10, 68W10, 68W15, cs.CV, cs.DC, cs.LG, I.2.6 | コメントを受け付けていません

Quantum Hamiltonian Embedding of Images for Data Reuploading Classifiers

投稿日: 2024年8月2日作成者: jarxiv

要約量子コンピューティングを機械学習タスクに適用する場合、最初に考慮すべきこと … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, quant-ph | コメントを受け付けていません

A Dual-way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking

投稿日: 2024年8月2日作成者: jarxiv

要約マルチモーダルエンティティリンク (MEL) は、マルチモーダル情報を … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Research on Image Super-Resolution Reconstruction Mechanism based on Convolutional Neural Network

投稿日: 2024年8月2日作成者: jarxiv

要約超解像度再構成技術では、ソフトウェアアルゴリズムを使用して、同じシーンか … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Adaptive Self-training Framework for Fine-grained Scene Graph Generation

投稿日: 2024年8月2日作成者: jarxiv

要約シーングラフ生成 (SGG) モデルは、長い尾の述語分布やアノテーション … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models

投稿日: 2024年8月2日作成者: jarxiv

要約最近、Web 画像の台頭により、大規模な画像データセットの管理と理解の重要 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets

投稿日: 2024年8月2日作成者: jarxiv

要約 Vision Transformer (ViT) は、Transforme … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SpatialBot: Precise Spatial Understanding with Vision Language Models

投稿日: 2024年8月2日作成者: jarxiv

要約ビジョン言語モデル (VLM) は、2D 画像理解において目覚ましいパフォ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年8月

Caltech Aerial RGB-Thermal Dataset in the Wild

Outlier Detection in Large Radiological Datasets using UMAP

DDU-Net: A Domain Decomposition-based CNN for High-Resolution Image Segmentation on Multiple GPUs

Quantum Hamiltonian Embedding of Images for Data Reuploading Classifiers

A Dual-way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking

Research on Image Super-Resolution Reconstruction Mechanism based on Convolutional Neural Network

Adaptive Self-training Framework for Fine-grained Scene Graph Generation

SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models

Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets

SpatialBot: Precise Spatial Understanding with Vision Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー