「cs.CV」カテゴリーアーカイブ

SR+Codec: a Benchmark of Super-Resolution for Video Compression Bitrate Reduction

投稿日: 2024年12月5日作成者: jarxiv

要約近年、低解像度の入力から高解像度の画像を生成する超解像（SR）に大きな関心 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

A Bidirectional Siamese Recurrent Neural Network for Accurate Gait Recognition Using Body Landmarks

投稿日: 2024年12月5日作成者: jarxiv

要約歩行認識は、特に他の生理学的バイオメトリクスが実用的でない、あるいは有効で … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Distillation of Diffusion Features for Semantic Correspondence

投稿日: 2024年12月5日作成者: jarxiv

要約画像の異なる部分間の関係を決定するタスクである意味的対応は、3D再構成、画 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

KKLIP: Knowledge Distillation Exploiting K-means Clustering for Language-Image Pre-Training

投稿日: 2024年12月5日作成者: jarxiv

要約近年、CLIPは、マルチモーダルなシナリオにおいて画像とテキスト情報を整合 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion

投稿日: 2024年12月5日作成者: jarxiv

要約拡散モデルは、その強い学習安定性と高い補完品質により、3D LiDARシー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

投稿日: 2024年12月5日作成者: jarxiv

要約我々は、視覚中心のアプローチで設計されたマルチモーダルLLM（MLLM）フ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images

投稿日: 2024年12月5日作成者: jarxiv

要約近年の生成モデルの進歩により、マルチビューデータからの新規ビュー合成（NV … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Boosting Latent Diffusion with Flow Matching

投稿日: 2024年12月5日作成者: jarxiv

要約近年、視覚合成の性能は大きく飛躍しているが、これは主に生成モデルの飛躍的な … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter

投稿日: 2024年12月5日作成者: jarxiv

要約本論文では、強いローリングシャッター（RS）効果を補正できる、ライトフィー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention

投稿日: 2024年12月5日作成者: jarxiv

要約自律走行トレーニングのためのマルチビュー映像の生成は最近注目を集めており、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

SR+Codec: a Benchmark of Super-Resolution for Video Compression Bitrate Reduction

A Bidirectional Siamese Recurrent Neural Network for Accurate Gait Recognition Using Body Landmarks

Distillation of Diffusion Features for Semantic Correspondence

KKLIP: Knowledge Distillation Exploiting K-means Clustering for Language-Image Pre-Training

Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images

Boosting Latent Diffusion with Flow Matching

Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter

Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention

最近の投稿

最近のコメント

アーカイブ

カテゴリー