月別アーカイブ: 2025年2月

ARCON: Advancing Auto-Regressive Continuation for Driving Videos

投稿日: 2025年2月27日作成者: jarxiv

要約オートエレクッシブ大型言語モデル（LLMS）の最近の進歩により、ビデオ生成 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

3D ReX: Causal Explanations in 3D Neuroimaging Classification

投稿日: 2025年2月27日作成者: jarxiv

要約説明可能性は、医療イメージングにおけるAIモデルにとって重要な問題のままで … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Multi-modal Contrastive Learning for Tumor-specific Missing Modality Synthesis

投稿日: 2025年2月27日作成者: jarxiv

要約マルチモーダル磁気共鳴画像（MRI）は、脳の解剖学と病理に関する補完的な情 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

投稿日: 2025年2月27日作成者: jarxiv

要約ドメイン固有の定理を理解するには、多くの場合、単なるテキストベースの推論以 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MM | コメントを受け付けていません

ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models

投稿日: 2025年2月27日作成者: jarxiv

要約画像のシーケンス上の推論は、マルチモーダルの大手言語モデル（MLLMS）に … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Learning Decentralized Swarms Using Rotation Equivariant Graph Neural Networks

投稿日: 2025年2月27日作成者: jarxiv

要約集中制御なしで集合的な目標を最適化するエージェントのオーケストレーションは … 続きを読む →

カテゴリー: (Primary), 68Q32, 68T42, cs.LG, cs.RO | コメントを受け付けていません

Aligned Datasets Improve Detection of Latent Diffusion-Generated Images

投稿日: 2025年2月27日作成者: jarxiv

要約潜在的な拡散モデル（LDM）が画像生成機能を民主化するにつれて、偽の画像を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GHOST 2.0: generative high-fidelity one shot transfer of heads

投稿日: 2025年2月27日作成者: jarxiv

要約フェイススワッピングのタスクは最近、研究コミュニティで注目を集めていますが … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

The FFT Strikes Back: An Efficient Alternative to Self-Attention

投稿日: 2025年2月27日作成者: jarxiv

要約従来の自己関節メカニズムには二次の複雑さが発生し、長いシーケンスでのスケー … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation

投稿日: 2025年2月27日作成者: jarxiv

要約少数の3Dポイントクラウドセグメンテーション（FS-PCS）は、最小限の注 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年2月

ARCON: Advancing Auto-Regressive Continuation for Driving Videos

3D ReX: Causal Explanations in 3D Neuroimaging Classification

Multi-modal Contrastive Learning for Tumor-specific Missing Modality Synthesis

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models

Learning Decentralized Swarms Using Rotation Equivariant Graph Neural Networks

Aligned Datasets Improve Detection of Latent Diffusion-Generated Images

GHOST 2.0: generative high-fidelity one shot transfer of heads

The FFT Strikes Back: An Efficient Alternative to Self-Attention

Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation

最近の投稿

最近のコメント

アーカイブ

カテゴリー