月別アーカイブ: 2025年4月

OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentation

投稿日: 2025年4月30日作成者: jarxiv

要約テーブル構造の認識は、ドキュメント分析の重要なタスクです。ただし、変形テ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction

投稿日: 2025年4月30日作成者: jarxiv

要約 RGB入力のみを使用して、高速かつ正確な単眼シーンの再構成を実現する幾何学 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion

投稿日: 2025年4月30日作成者: jarxiv

要約ダイアディックな会話における現実的なリスナーの顔の動きを生成することは、高 … 続きを読む →

カテゴリー: cs.CV, cs.HC | コメントを受け付けていません

In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer

投稿日: 2025年4月30日作成者: jarxiv

要約命令ベースの画像編集により、自然言語プロンプトを介した堅牢な画像変更が可能 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Exploring AI-based System Design for Pixel-level Protected Health Information Detection in Medical Images

投稿日: 2025年4月30日作成者: jarxiv

要約医療画像の識別は、研究および臨床環境でのデータ共有中にプライバシーを確 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions

投稿日: 2025年4月30日作成者: jarxiv

要約人間の知性は、視覚と言語の両方の説明に従って、すべての人を取得できます。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SemEval-2025 Task 1: AdMIRe — Advancing Multimodal Idiomaticity Representation

投稿日: 2025年4月30日作成者: jarxiv

要約慣用的な表現は、NLPにユニークな課題を提示します。その意味は、構成要素の … 続きを読む →

カテゴリー: cs.CL, cs.CV, I.2.7 | コメントを受け付けていません

Practical solutions to the relative pose of three calibrated cameras

投稿日: 2025年4月30日作成者: jarxiv

要約 4つのポイント通信から3つの較正カメラの相対的なポーズを推定するという挑戦 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Learning a General Model: Folding Clothing with Topological Dynamics

投稿日: 2025年4月30日作成者: jarxiv

要約高度の自由度と衣服の複雑な構造は、衣服の操作に大きな課題をもたらします。 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

4D mmWave Radar for Sensing Enhancement in Adverse Environments: Advances and Challenges

投稿日: 2025年4月30日作成者: jarxiv

要約インテリジェントな輸送システムには、正確で信頼できるセンシングが必要です。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年4月

OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentation

HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction

Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion

In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer

Exploring AI-based System Design for Pixel-level Protected Health Information Detection in Medical Images

Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions

SemEval-2025 Task 1: AdMIRe — Advancing Multimodal Idiomaticity Representation

Practical solutions to the relative pose of three calibrated cameras

Learning a General Model: Folding Clothing with Topological Dynamics

4D mmWave Radar for Sensing Enhancement in Adverse Environments: Advances and Challenges

最近の投稿

最近のコメント

アーカイブ

カテゴリー