月別アーカイブ: 2024年6月

Enhancing Vision Models for Text-Heavy Content Understanding and Interaction

投稿日: 2024年6月3日作成者: jarxiv

要約複数の画像を含むテキストの多いビジュアルコンテンツを操作して理解すること … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Predicting ptychography probe positions using single-shot phase retrieval neural network

投稿日: 2024年6月3日作成者: jarxiv

要約タイコグラフィーは、材料科学、生物学、ナノテクノロジーなどのさまざまな分野 … 続きを読む →

カテゴリー: 94A08, cs.AI, cs.CV, I.4.0, physics.app-ph, physics.data-an | コメントを受け付けていません

Fast yet Safe: Early-Exiting with Risk Control

投稿日: 2024年6月3日作成者: jarxiv

要約機械学習モデルをスケーリングすると、パフォーマンスが大幅に向上します。た … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, stat.ML | コメントを受け付けていません

Memory Consolidation Enables Long-Context Video Understanding

投稿日: 2024年6月3日作成者: jarxiv

要約ほとんどのトランスフォーマーベースのビデオエンコーダーは、二次的な複雑さ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization

投稿日: 2024年6月3日作成者: jarxiv

要約拡散トランス (DiT) は、U-Net を使用する従来の拡散モデルを上回 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

The Victim and The Beneficiary: Exploiting a Poisoned Model to Train a Clean Model on Poisoned Data

投稿日: 2024年6月3日作成者: jarxiv

要約最近、バックドア攻撃がディープニューラルネットワーク (DNN) のト … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Towards Imbalanced Motion: Part-Decoupling Network for Video Portrait Segmentation

投稿日: 2024年6月3日作成者: jarxiv

要約ビデオポートレートセグメンテーション (VPS) は、ビデオフレーム … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation

投稿日: 2024年6月3日作成者: jarxiv

要約この論文では、次の 2 つの重要な課題のために十分に研究されていないパノラ … 続きを読む →

カテゴリー: cs.CV, cs.RO, eess.IV | コメントを受け付けていません

Pre- to Post-Contrast Breast MRI Synthesis for Enhanced Tumour Segmentation

投稿日: 2024年6月3日作成者: jarxiv

要約ダイナミック造影 MRI (DCE-MRI) における造影剤の投与は、腫瘍 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Amortizing intractable inference in diffusion models for vision, language, and control

投稿日: 2024年6月3日作成者: jarxiv

要約拡散モデルは、視覚、言語、強化学習における効果的な分布推定器として登場しま … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年6月

Enhancing Vision Models for Text-Heavy Content Understanding and Interaction

Predicting ptychography probe positions using single-shot phase retrieval neural network

Fast yet Safe: Early-Exiting with Risk Control

Memory Consolidation Enables Long-Context Video Understanding

HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization

The Victim and The Beneficiary: Exploiting a Poisoned Model to Train a Clean Model on Poisoned Data

Towards Imbalanced Motion: Part-Decoupling Network for Video Portrait Segmentation

Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation

Pre- to Post-Contrast Breast MRI Synthesis for Enhanced Tumour Segmentation

Amortizing intractable inference in diffusion models for vision, language, and control

最近の投稿

最近のコメント

アーカイブ

カテゴリー