月別アーカイブ: 2025年2月

L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration

投稿日: 2025年2月6日作成者: jarxiv

要約ポイントクラウド登録は、コンピュータービジョンとロボット工学の多くのアプリ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

TruePose: Human-Parsing-guided Attention Diffusion for Full-ID Preserving Pose Transfer

投稿日: 2025年2月6日作成者: jarxiv

要約ポーズガイド付きの個人画像合成（PGPI）は、指定されたターゲットポーズ（ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A Temporal Convolutional Network-Based Approach and a Benchmark Dataset for Colonoscopy Video Temporal Segmentation

投稿日: 2025年2月6日作成者: jarxiv

要約大腸内視鏡検査のコンピューター支援検出および診断システムの最近の進歩に続い … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

An Optimized Toolbox for Advanced Image Processing with Tsetlin Machine Composites

投稿日: 2025年2月6日作成者: jarxiv

要約 Tsetlinマシン（TM）は、MNIST、K-MNIST、F-MNIST … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models

投稿日: 2025年2月6日作成者: jarxiv

要約視力を脅かす眼疾患の有病率は重大な世界的な負担であり、多くの場合、診断され … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Masked Autoencoders Are Effective Tokenizers for Diffusion Models

投稿日: 2025年2月6日作成者: jarxiv

要約潜在的な拡散モデルの最近の進歩により、高解像度の画像合成に対する有効性が実 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics

投稿日: 2025年2月6日作成者: jarxiv

要約大規模なモデルの最近の進歩により、画像から3Dの再構成が大幅に進歩していま … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living

投稿日: 2025年2月6日作成者: jarxiv

要約 Clipのようなビジョン言語モデルの導入により、目に見えないビデオや人間の … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Seeing World Dynamics in a Nutshell

投稿日: 2025年2月6日作成者: jarxiv

要約私たちは、空間的に一時的に一貫した方法で、さりげなくキャプチャされたモノク … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.MM | コメントを受け付けていません

OverThink: Slowdown Attacks on Reasoning LLMs

投稿日: 2025年2月6日作成者: jarxiv

要約 LLMS-We Forceモデルの推論に依存しているアプリケーションのオー … 続きを読む →

カテゴリー: cs.CR, cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年2月

L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration

TruePose: Human-Parsing-guided Attention Diffusion for Full-ID Preserving Pose Transfer

A Temporal Convolutional Network-Based Approach and a Benchmark Dataset for Colonoscopy Video Temporal Segmentation

An Optimized Toolbox for Advanced Image Processing with Tsetlin Machine Composites

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models

Masked Autoencoders Are Effective Tokenizers for Diffusion Models

Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics

SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living

Seeing World Dynamics in a Nutshell

OverThink: Slowdown Attacks on Reasoning LLMs

最近の投稿

最近のコメント

アーカイブ

カテゴリー