月別アーカイブ: 2025年1月

Causal Deep Learning

投稿日: 2025年1月3日作成者: jarxiv

要約私たちは、因果的ディープニューラルネットワークのセットを導出します。そ … 続きを読む →

カテゴリー: (Primary), 15A09, 15A69, 15A72, 62D20, 62H25, 62H30, 62H35, 62J10, 68T45, cs.AI, cs.CV, cs.LG, G.3, stat.ML | コメントを受け付けていません

World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving

投稿日: 2025年1月3日作成者: jarxiv

要約広範な世界知識を備えたマルチモーダル大規模言語モデル (MLLM) は、特 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks

投稿日: 2025年1月3日作成者: jarxiv

要約埋め込みモデルは、意味的類似性、情報検索、クラスタリングなどのさまざまな下 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Photoacoustic Iterative Optimization Algorithm with Shape Prior Regularization

投稿日: 2025年1月3日作成者: jarxiv

要約光音響イメージング (PAI) には、ノイズ、アーティファクト、まばらなサ … 続きを読む →

カテゴリー: cs.CV, physics.optics | コメントを受け付けていません

Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models

投稿日: 2025年1月3日作成者: jarxiv

要約 Midjourney や DALLE 3 などのイノベーションに代表される … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

TOPIC: A Parallel Association Paradigm for Multi-Object Tracking under Complex Motions and Diverse Scenes

投稿日: 2025年1月3日作成者: jarxiv

要約ビデオデータとアルゴリズムは、マルチオブジェクトトラッキング (MOT … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation

投稿日: 2025年1月3日作成者: jarxiv

要約 Direct Preference Optimization (DPO) … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HunyuanVideo: A Systematic Framework For Large Video Generative Models

投稿日: 2025年1月3日作成者: jarxiv

要約ビデオ生成における最近の進歩は、個人と業界の両方の日常生活に大きな影響を与 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ARNet: Self-Supervised FG-SBIR with Unified Sample Feature Alignment and Multi-Scale Token Recycling

投稿日: 2025年1月3日作成者: jarxiv

要約 Fine-Grained Sketch-Based Image Retri … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Dynamic Negative Guidance of Diffusion Models

投稿日: 2025年1月3日作成者: jarxiv

要約ネガティブプロンプティング (NP) は、拡散モデル、特にテキストから画 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年1月

Causal Deep Learning

World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving

VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks

Photoacoustic Iterative Optimization Algorithm with Shape Prior Regularization

Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models

TOPIC: A Parallel Association Paradigm for Multi-Object Tracking under Complex Motions and Diverse Scenes

Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation

HunyuanVideo: A Systematic Framework For Large Video Generative Models

ARNet: Self-Supervised FG-SBIR with Unified Sample Feature Alignment and Multi-Scale Token Recycling

Dynamic Negative Guidance of Diffusion Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー