月別アーカイブ: 2025年3月

RELD: Regularization by Latent Diffusion Models for Image Restoration

投稿日: 2025年3月31日作成者: jarxiv

要約近年、拡散モデルは深い生成モデリングにおける新しい最先端のモデルになり、生 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Advancing the Biological Plausibility and Efficacy of Hebbian Convolutional Neural Networks

投稿日: 2025年3月31日作成者: jarxiv

要約このペーパーで提示された研究は、イメージ処理のためのヘビアン学習の畳み込み … 続きを読む →

カテゴリー: cs.CV, cs.NE, I.2.6 | コメントを受け付けていません

Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints

投稿日: 2025年3月31日作成者: jarxiv

要約拡散トランス（DIT）は、画像とビデオ生成の強力なアーキテクチャとして浮上 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization

投稿日: 2025年3月31日作成者: jarxiv

要約視覚言語モデル（VLM）の急速な進歩は、マルチモーダルの理解を変えましたが … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Next-Best-Trajectory Planning of Robot Manipulators for Effective Observation and Exploration

投稿日: 2025年3月31日作成者: jarxiv

要約オブジェクトの視覚的観測は、オブジェクトの再構築と操作、ナビゲーション、シ … 続きを読む →

カテゴリー: cs.CV, cs.HC, cs.RO | コメントを受け付けていません

Using AI to Summarize US Presidential Campaign TV Advertisement Videos, 1952-2012

投稿日: 2025年3月31日作成者: jarxiv

要約このペーパーでは、デジタル形式で入手可能な米国大統領キャンペーンテレビ広告 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

KEVS: Enhancing Segmentation of Visceral Adipose Tissue in Pre-Cystectomy CT with Gaussian Kernel Density Estimation

投稿日: 2025年3月31日作成者: jarxiv

要約目的：膀胱切除患者における内臓脂肪組織（VAT）の分布は、術後合併症の発生 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

USC: Uncompromising Spatial Constraints for Safety-Oriented 3D Object Detectors in Autonomous Driving

投稿日: 2025年3月31日作成者: jarxiv

要約この作業では、自律運転コンテキストでの3Dオブジェクト検出器の安全指向のパ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

TULIP: Token-length Upgraded CLIP

投稿日: 2025年3月31日作成者: jarxiv

要約クリップなどのビジョン言語モデルで長いキャプションを表現するという課題に対 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis

投稿日: 2025年3月31日作成者: jarxiv

要約トーキングヘッド合成は、コンピューターグラフィックスとマルチメディアの重要 … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.SD, eess.AS | コメントを受け付けていません

月別アーカイブ: 2025年3月

RELD: Regularization by Latent Diffusion Models for Image Restoration

Advancing the Biological Plausibility and Efficacy of Hebbian Convolutional Neural Networks

Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints

Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization

Next-Best-Trajectory Planning of Robot Manipulators for Effective Observation and Exploration

Using AI to Summarize US Presidential Campaign TV Advertisement Videos, 1952-2012

KEVS: Enhancing Segmentation of Visceral Adipose Tissue in Pre-Cystectomy CT with Gaussian Kernel Density Estimation

USC: Uncompromising Spatial Constraints for Safety-Oriented 3D Object Detectors in Autonomous Driving

TULIP: Token-length Upgraded CLIP

Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis

最近の投稿

最近のコメント

アーカイブ

カテゴリー