月別アーカイブ: 2024年8月

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities

投稿日: 2024年8月15日作成者: jarxiv

要約モデルのマージは、機械学習コミュニティにおける効率的なエンパワーメント手法 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Data Science for Geographic Information Systems

投稿日: 2024年8月15日作成者: jarxiv

要約データサイエンスを地理情報システム (GIS) に統合することで、これら … 続きを読む →

カテゴリー: cs.CV, eess.IV, I.2.10, physics.geo-ph | コメントを受け付けていません

G$^2$V$^2$former: Graph Guided Video Vision Transformer for Face Anti-Spoofing

投稿日: 2024年8月15日作成者: jarxiv

要約なりすましの顔を含むビデオでは、測光異常または動的異常のいずれか、あるいは … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Robust Curve Detection in Volumetric Medical Imaging via Attraction Field

投稿日: 2024年8月15日作成者: jarxiv

要約身体部分の形状を理解することは、正確な医療診断にとって非常に重要です。曲 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Spitting Image: Modular Superpixel Tokenization in Vision Transformers

投稿日: 2024年8月15日作成者: jarxiv

要約 Vision Transformer (ViT) アーキテクチャは伝統的に … 続きを読む →

カテゴリー: 68T45, cs.AI, cs.CV, cs.LG, I.2.10 | コメントを受け付けていません

NIGHT — Non-Line-of-Sight Imaging from Indirect Time of Flight Data

投稿日: 2024年8月15日作成者: jarxiv

要約カメラの視線外の物体の取得は、非常に興味深いものですが、非常に挑戦的な研究 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

RSD-DOG : A New Image Descriptor based on Second Order Derivatives

投稿日: 2024年8月15日作成者: jarxiv

要約この論文では、二次画像統計/導関数に基づいた新しい強力な画像パッチ記述子を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation

投稿日: 2024年8月15日作成者: jarxiv

要約 GPT-4V(ision)による画像設計・自動生成を用いたマルチモーダルな … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Detecting Near-Duplicate Face Images

投稿日: 2024年8月15日作成者: jarxiv

要約フォトメトリックおよび幾何学的な変換を繰り返し適用すると、元のイメージの知 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

End-to-end Semantic-centric Video-based Multimodal Affective Computing

投稿日: 2024年8月15日作成者: jarxiv

要約汎用人工知能 (AGI) への道において、人間の愛情を理解することは、機械 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

月別アーカイブ: 2024年8月

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities

Data Science for Geographic Information Systems

G$^2$V$^2$former: Graph Guided Video Vision Transformer for Face Anti-Spoofing

Robust Curve Detection in Volumetric Medical Imaging via Attraction Field

A Spitting Image: Modular Superpixel Tokenization in Vision Transformers

NIGHT — Non-Line-of-Sight Imaging from Indirect Time of Flight Data

RSD-DOG : A New Image Descriptor based on Second Order Derivatives

Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation

Detecting Near-Duplicate Face Images

End-to-end Semantic-centric Video-based Multimodal Affective Computing

最近の投稿

最近のコメント

アーカイブ

カテゴリー