「cs.CV」カテゴリーアーカイブ

UNB StepUP: A footStep database for gait analysis and recognition using Underfoot Pressure

投稿日: 2025年2月27日作成者: jarxiv

要約歩行とは、歩行中に生成される四肢の動きのパターンを指します。これは、物理的 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

投稿日: 2025年2月27日作成者: jarxiv

要約高解像度の拡散モデルを加速するための自動エンコーダーモデルの新しいファミリ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

OASIS Uncovers: High-Quality T2I Models, Same Old Stereotypes

投稿日: 2025年2月27日作成者: jarxiv

要約テキストからイメージ（T2I）モデルによって生成された画像は、しばしば文化 … 続きを読む →

カテゴリー: cs.CV, cs.CY, cs.LG | コメントを受け付けていません

ARCON: Advancing Auto-Regressive Continuation for Driving Videos

投稿日: 2025年2月27日作成者: jarxiv

要約オートエレクッシブ大型言語モデル（LLMS）の最近の進歩により、ビデオ生成 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

3D ReX: Causal Explanations in 3D Neuroimaging Classification

投稿日: 2025年2月27日作成者: jarxiv

要約説明可能性は、医療イメージングにおけるAIモデルにとって重要な問題のままで … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Multi-modal Contrastive Learning for Tumor-specific Missing Modality Synthesis

投稿日: 2025年2月27日作成者: jarxiv

要約マルチモーダル磁気共鳴画像（MRI）は、脳の解剖学と病理に関する補完的な情 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

投稿日: 2025年2月27日作成者: jarxiv

要約ドメイン固有の定理を理解するには、多くの場合、単なるテキストベースの推論以 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MM | コメントを受け付けていません

ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models

投稿日: 2025年2月27日作成者: jarxiv

要約画像のシーケンス上の推論は、マルチモーダルの大手言語モデル（MLLMS）に … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Aligned Datasets Improve Detection of Latent Diffusion-Generated Images

投稿日: 2025年2月27日作成者: jarxiv

要約潜在的な拡散モデル（LDM）が画像生成機能を民主化するにつれて、偽の画像を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GHOST 2.0: generative high-fidelity one shot transfer of heads

投稿日: 2025年2月27日作成者: jarxiv

要約フェイススワッピングのタスクは最近、研究コミュニティで注目を集めていますが … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

UNB StepUP: A footStep database for gait analysis and recognition using Underfoot Pressure

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

OASIS Uncovers: High-Quality T2I Models, Same Old Stereotypes

ARCON: Advancing Auto-Regressive Continuation for Driving Videos

3D ReX: Causal Explanations in 3D Neuroimaging Classification

Multi-modal Contrastive Learning for Tumor-specific Missing Modality Synthesis

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models

Aligned Datasets Improve Detection of Latent Diffusion-Generated Images

GHOST 2.0: generative high-fidelity one shot transfer of heads

最近の投稿

最近のコメント

アーカイブ

カテゴリー