投稿者「jarxiv」のアーカイブ

Targeted Forgetting of Image Subgroups in CLIP Models

投稿日: 2025年6月4日作成者: jarxiv

要約 CLIPのような基盤モデル(FM)は、大規模な教師なし事前学習を活用するこ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Controllable Human-centric Keyframe Interpolation with Generative Prior

投稿日: 2025年6月4日作成者: jarxiv

要約既存の補間手法は、疎にサンプリングされたキーフレーム間の中間フレームを生成 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HumanRAM: Feed-forward Human Reconstruction and Animation Model using Transformers

投稿日: 2025年6月4日作成者: jarxiv

要約人間の3D再構成とアニメーションは、コンピュータグラフィックスとビジョンに … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation

投稿日: 2025年6月4日作成者: jarxiv

要約最近のAI生成コンテンツ（AIGC）の進歩により、アニメーション制作が大幅 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Native-Resolution Image Synthesis

投稿日: 2025年6月4日作成者: jarxiv

要約任意の解像度とアスペクト比の画像合成を可能にする、新しい生成モデリングパラ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding

投稿日: 2025年6月4日作成者: jarxiv

要約我々は、SA-Radar (Simulate Any Radar)を発表す … 続きを読む →

カテゴリー: cs.CV, eess.SP | コメントを受け付けていません

OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

投稿日: 2025年6月4日作成者: jarxiv

要約空間推論は認知心理学の重要な側面であり、現在の視覚言語モデル（VLM）の大 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation

投稿日: 2025年6月4日作成者: jarxiv

要約ラージ・ランゲージ・モデル（LLM）とマルチモーダルLLMはSVG処理に有 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

CamCloneMaster: Enabling Reference-based Camera Control for Video Generation

投稿日: 2025年6月4日作成者: jarxiv

要約表現力豊かで映画的な映像を生成するためには、カメラの制御が重要である。既存 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval

投稿日: 2025年6月4日作成者: jarxiv

要約近年のインタラクティブビデオ生成の進歩は有望な結果を示しているが、既存のア … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Targeted Forgetting of Image Subgroups in CLIP Models

Controllable Human-centric Keyframe Interpolation with Generative Prior

HumanRAM: Feed-forward Human Reconstruction and Animation Model using Transformers

AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation

Native-Resolution Image Synthesis

Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding

OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation

CamCloneMaster: Enabling Reference-based Camera Control for Video Generation

Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval

最近の投稿

最近のコメント

アーカイブ

カテゴリー