投稿者「jarxiv」のアーカイブ

Unwarping Screen Content Images via Structure-texture Enhancement Network and Transformation Self-estimation

投稿日: 2025年4月22日作成者: jarxiv

要約既存の暗黙的なニューラルネットワークベースの画像の巻き上げメソッドは、自然 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Improving Sound Source Localization with Joint Slot Attention on Image and Audio

投稿日: 2025年4月22日作成者: jarxiv

要約サウンドソースのローカリゼーション（SSL）は、画像内の音源を見つけるタス … 続きを読む →

カテゴリー: cs.CV, cs.SD | コメントを受け付けていません

Robust and Real-time Surface Normal Estimation from Stereo Disparities using Affine Transformations

投稿日: 2025年4月22日作成者: jarxiv

要約この作業では、整流されたステレオ画像ペアからの表面正常推定の新しい方法を導 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video

投稿日: 2025年4月22日作成者: jarxiv

要約私たちは、エンドツーエンドの方法でぼやけた単眼動画からのシャープで高品質の … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A General Infrastructure and Workflow for Quadrotor Deep Reinforcement Learning and Reality Deployment

投稿日: 2025年4月22日作成者: jarxiv

要約構造化されていない屋外環境でロボット学習方法を四輪に展開することはエキサイ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

投稿日: 2025年4月22日作成者: jarxiv

要約表現セグメンテーション（RES）を参照するなどのタスクを含むピクセルの接地 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models

投稿日: 2025年4月22日作成者: jarxiv

要約このペーパーでは、EasyEdit2を紹介します。これは、大規模な言語モデ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.HC, cs.LG | コメントを受け付けていません

Instance-Adaptive Keypoint Learning with Local-to-Global Geometric Aggregation for Category-Level Object Pose Estimation

投稿日: 2025年4月22日作成者: jarxiv

要約カテゴリレベルのオブジェクトのポーズ推定は、事前に定義されたカテゴリから以 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

投稿日: 2025年4月22日作成者: jarxiv

要約高解像度の拡散モデルを加速するための自動エンコーダーモデルの新しいファミリ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

‘I Know It When I See It’: Mood Spaces for Connecting and Expressing Visual Concepts

投稿日: 2025年4月22日作成者: jarxiv

要約複雑な概念を表現することは、ラベル付けまたは定量化できる場合は簡単ですが、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Unwarping Screen Content Images via Structure-texture Enhancement Network and Transformation Self-estimation

Improving Sound Source Localization with Joint Slot Attention on Image and Audio

Robust and Real-time Surface Normal Estimation from Stereo Disparities using Affine Transformations

MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video

A General Infrastructure and Workflow for Quadrotor Deep Reinforcement Learning and Reality Deployment

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models

Instance-Adaptive Keypoint Learning with Local-to-Global Geometric Aggregation for Category-Level Object Pose Estimation

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

‘I Know It When I See It’: Mood Spaces for Connecting and Expressing Visual Concepts

最近の投稿

最近のコメント

アーカイブ

カテゴリー