月別アーカイブ: 2025年2月

Enhancing Power Grid Inspections with Machine Learning

投稿日: 2025年2月19日作成者: jarxiv

要約グローバルなエネルギー需要が増え続けているため、電力グリッドの安全性と信頼 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LieRE: Generalizing Rotary Position Encodings

投稿日: 2025年2月19日作成者: jarxiv

要約トランスアーキテクチャは、トークンの依存関係をキャプチャするために位置エン … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

VLMaterial: Procedural Material Generation with Large Vision-Language Models

投稿日: 2025年2月19日作成者: jarxiv

要約機能的なノードグラフとして表される手続き材料は、フォトリアリックな材料の外 … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Improved Fine-Tuning of Large Multimodal Models for Hateful Meme Detection

投稿日: 2025年2月19日作成者: jarxiv

要約憎しみのあるミームはインターネット上の重要な懸念となっており、堅牢な自動検 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

A Unified Framework for Event-based Frame Interpolation with Ad-hoc Deblurring in the Wild

投稿日: 2025年2月19日作成者: jarxiv

要約効果的なビデオフレーム補間は、入力シーンでの動きの熟練した取り扱いにかかっ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird’s Eye View for 3D Object Detection

投稿日: 2025年2月19日作成者: jarxiv

要約最近の低コストのレーダーカメラアプローチは、マルチモーダル3Dオブジェクト … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Multi-scale Attention Guided Pose Transfer

投稿日: 2025年2月19日作成者: jarxiv

要約ポーズ転送とは、異なるポーズをとっている人の別のイメージから、以前に見えな … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

BenthicNet: A global compilation of seafloor images for deep learning applications

投稿日: 2025年2月19日作成者: jarxiv

要約水中イメージングの進歩により、重要な底生生態系の監視に必要な広範な海底画像 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

TIPS: Text-Induced Pose Synthesis

投稿日: 2025年2月19日作成者: jarxiv

要約コンピュータービジョンでは、人間のポーズ統合と転送は、その人のすでに利用可 … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

L4P: Low-Level 4D Vision Perception Unified

投稿日: 2025年2月19日作成者: jarxiv

要約ビデオのピクセル間の時空間的関係は、低レベルの4D知覚の重要な情報をもたら … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年2月

Enhancing Power Grid Inspections with Machine Learning

LieRE: Generalizing Rotary Position Encodings

VLMaterial: Procedural Material Generation with Large Vision-Language Models

Improved Fine-Tuning of Large Multimodal Models for Hateful Meme Detection

A Unified Framework for Event-based Frame Interpolation with Ad-hoc Deblurring in the Wild

RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird’s Eye View for 3D Object Detection

Multi-scale Attention Guided Pose Transfer

BenthicNet: A global compilation of seafloor images for deep learning applications

TIPS: Text-Induced Pose Synthesis

L4P: Low-Level 4D Vision Perception Unified

最近の投稿

最近のコメント

アーカイブ

カテゴリー