月別アーカイブ: 2024年6月

Detecting Brittle Decisions for Free: Leveraging Margin Consistency in Deep Robust Classifiers

投稿日: 2024年6月27日作成者: jarxiv

要約堅牢性を向上させるための敵対的トレーニング戦略に関する広範な研究にもかかわ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference

投稿日: 2024年6月27日作成者: jarxiv

要約人間は、単一のクエリと参照画像のペアだけが与えられれば、ラベルやトレーニン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance

投稿日: 2024年6月27日作成者: jarxiv

要約最近の大規模生成モデルの急増により、コンピュータービジョンの広大な分野の … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality

投稿日: 2024年6月27日作成者: jarxiv

要約最近、3D ガウススプラッティング (3D-GS) は、現実世界のシーン … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Unsupervised Open-Vocabulary Object Localization in Videos

投稿日: 2024年6月27日作成者: jarxiv

要約この論文では、ビデオ表現学習と事前トレーニングされた視覚言語モデルの最近の … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Robust Surgical Phase Recognition From Annotation Efficient Supervision

投稿日: 2024年6月27日作成者: jarxiv

要約手術段階認識は、コンピューター支援手術における重要なタスクであり、手術手順 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration

投稿日: 2024年6月27日作成者: jarxiv

要約深層学習ベースの画像復元手法は大幅な進歩を遂げていますが、合成データでのト … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

投稿日: 2024年6月27日作成者: jarxiv

要約科学論文や財務レポートの分析など、現実世界のタスクにマルチモーダル大規模言 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation

投稿日: 2024年6月27日作成者: jarxiv

要約私たちは、タイムラプスビデオ生成における T2V モデル (Sora や … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

MultiDiff: Consistent Novel View Synthesis from a Single Image

投稿日: 2024年6月27日作成者: jarxiv

要約単一の RGB 画像からシーンを一貫して新しいビュー合成するための新しいア … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年6月

Detecting Brittle Decisions for Free: Leveraging Margin Consistency in Deep Robust Classifiers

Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference

DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance

GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality

Unsupervised Open-Vocabulary Object Localization in Videos

Robust Surgical Phase Recognition From Annotation Efficient Supervision

Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation

MultiDiff: Consistent Novel View Synthesis from a Single Image

最近の投稿

最近のコメント

アーカイブ

カテゴリー