「cs.CV」カテゴリーアーカイブ

GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection

投稿日: 2025年3月12日作成者: jarxiv

要約 LIDARベースの3D検出器には、トレーニングのために大きなデータセットが … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark

投稿日: 2025年3月12日作成者: jarxiv

要約大規模なマルチモーダルモデル（LMM）は、単一の画像に対して視覚的な質問を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Rethinking Diffusion Model in High Dimension

投稿日: 2025年3月12日作成者: jarxiv

要約次元の呪いは、統計的確率モデルでは避けられない課題ですが、拡散モデルはこの … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, stat.ML | コメントを受け付けていません

MF-VITON: High-Fidelity Mask-Free Virtual Try-On with Minimal Input

投稿日: 2025年3月12日作成者: jarxiv

要約 Virtual Try-On（VITON）の最近の進歩により、強力なテキス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

TED-VITON: Transformer-Empowered Diffusion Models for Virtual Try-On

投稿日: 2025年3月12日作成者: jarxiv

要約 Virtual Try-On（VTO）の最近の進歩は、現実的な画像を生成し … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Task-Oriented Co-Design of Communication, Computing, and Control for Edge-Enabled Industrial Cyber-Physical Systems

投稿日: 2025年3月12日作成者: jarxiv

要約このペーパーでは、ミッションクリティカルな産業サイバー物理システム（CPS … 続きを読む →

カテゴリー: cs.CV, cs.IT, eess.IV, math.IT | コメントを受け付けていません

Generating Robot Constitutions & Benchmarks for Semantic Safety

投稿日: 2025年3月12日作成者: jarxiv

要約最近まで、ロボットの安全研究は、主に衝突回避とロボットのすぐ近くの危険の減 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.CY, cs.HC, cs.RO | コメントを受け付けていません

MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention

投稿日: 2025年3月12日作成者: jarxiv

要約マルチビュー拡散モデルは、一般的なオブジェクトの画像から3Dの生成でかなり … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder

投稿日: 2025年3月12日作成者: jarxiv

要約生成モデリングのためのビデオ埋め込み装置の学習に関する新しい視点を提示しま … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting

投稿日: 2025年3月12日作成者: jarxiv

要約ベクター量子化（VQ）は、特に極端な圧縮シナリオでは、多様なモデル全体で均 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection

Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark

Rethinking Diffusion Model in High Dimension

MF-VITON: High-Fidelity Mask-Free Virtual Try-On with Minimal Input

TED-VITON: Transformer-Empowered Diffusion Models for Virtual Try-On

Task-Oriented Co-Design of Communication, Computing, and Control for Edge-Enabled Industrial Cyber-Physical Systems

Generating Robot Constitutions & Benchmarks for Semantic Safety

MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention

REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder

SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting

最近の投稿

最近のコメント

アーカイブ

カテゴリー