月別アーカイブ: 2025年4月

CLIP-SLA: Parameter-Efficient CLIP Adaptation for Continuous Sign Language Recognition

投稿日: 2025年4月3日作成者: jarxiv

要約継続的な手話認識（CSLR）は、ビデオの手話ジェスチャーの解釈と転写シーケ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation

投稿日: 2025年4月3日作成者: jarxiv

要約 3Dポイントクラウドセマンティックセグメンテーション（PCSS）は、ロボッ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

FriendNet: Detection-Friendly Dehazing Network

投稿日: 2025年4月3日作成者: jarxiv

要約有害な気象条件は、多くの場合、キャプチャされた画像の品質を損ない、必然的に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

InvFussion: Bridging Supervised and Zero-shot Diffusion for Inverse Problems

投稿日: 2025年4月3日作成者: jarxiv

要約拡散モデルは、逆の問題の処理において顕著な能力を示しており、高品質の後サン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Towards Physically Plausible Video Generation via VLM Planning

投稿日: 2025年4月3日作成者: jarxiv

要約ビデオ拡散モデル（VDMS）は近年大幅に進歩しており、非常に現実的なビデオ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Why Autonomous Vehicles Are Not Ready Yet: A Multi-Disciplinary Review of Problems, Attempted Solutions, and Future Directions

投稿日: 2025年4月3日作成者: jarxiv

要約個人の自動運転車は、周囲の環境を感知し、ルートを計画し、人間のドライバーの … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation

投稿日: 2025年4月3日作成者: jarxiv

要約この論文では、潜在空間で適応的な時間的圧縮を利用できるトレーニングなしのパ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

{GSR4B}: Biomass Map Super-Resolution with Sentinel-1/2 Guidance

投稿日: 2025年4月3日作成者: jarxiv

要約大規模および高空間的解像度の両方での正確な地上バイオマス（AGB）マッピン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation

投稿日: 2025年4月3日作成者: jarxiv

要約インターリーブされた画像テキスト生成は、クエリを与えられたインターリーブ視 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance

投稿日: 2025年4月3日作成者: jarxiv

要約最近の画像ベースのヒューマンアニメーション方法は、現実的な身体と顔の動きの … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年4月

CLIP-SLA: Parameter-Efficient CLIP Adaptation for Continuous Sign Language Recognition

Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation

FriendNet: Detection-Friendly Dehazing Network

InvFussion: Bridging Supervised and Zero-shot Diffusion for Inverse Problems

Towards Physically Plausible Video Generation via VLM Planning

Why Autonomous Vehicles Are Not Ready Yet: A Multi-Disciplinary Review of Problems, Attempted Solutions, and Future Directions

DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation

{GSR4B}: Biomass Map Super-Resolution with Sentinel-1/2 Guidance

CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation

DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance

最近の投稿

最近のコメント

アーカイブ

カテゴリー