月別アーカイブ: 2024年4月

DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue Dataset

投稿日: 2024年4月1日作成者: jarxiv

要約インスタントメッセージで画像を共有することは重要な要素であるため、画像と … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos

投稿日: 2024年4月1日作成者: jarxiv

要約この論文では、自然言語の質問応答と検索を通じて長文の自己中心的なビデオグラ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Relation Rectification in Diffusion Model

投稿日: 2024年4月1日作成者: jarxiv

要約並外れた生成能力にもかかわらず、大規模なテキストから画像への拡散モデルは、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Descriptor and Word Soups: Overcoming the Parameter Efficiency Accuracy Tradeoff for Out-of-Distribution Few-shot Learning

投稿日: 2024年4月1日作成者: jarxiv

要約過去 1 年間にわたり、GPT 記述子を使用したゼロショット評価を中心とし … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Latent Embedding Clustering for Occlusion Robust Head Pose Estimation

投稿日: 2024年4月1日作成者: jarxiv

要約頭姿勢推定は、ロボット工学、監視、ドライバーの注意監視などの幅広い用途で有 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation

投稿日: 2024年4月1日作成者: jarxiv

要約解剖学的構造と病理の医療画像セグメンテーションは、現代の臨床診断、疾患研究 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions

投稿日: 2024年4月1日作成者: jarxiv

要約時間的アクション検出 (TAD) は、トリミングされていない長期間のビデオ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Prototype-based Interpretable Breast Cancer Prediction Models: Analysis and Challenges

投稿日: 2024年4月1日作成者: jarxiv

要約深層学習モデルは医療アプリケーションで高いパフォーマンスを実現していますが … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation

投稿日: 2024年4月1日作成者: jarxiv

要約セマンティックセグメンテーションは本質的に広範なピクセルレベルの注釈付 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Joint chest X-ray diagnosis and clinical visual attention prediction with multi-stage cooperative learning: enhancing interpretability

投稿日: 2024年4月1日作成者: jarxiv

要約ディープラーニングはコンピューター支援診断の最先端技術となっているため、臨 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

月別アーカイブ: 2024年4月

DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue Dataset

LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos

Relation Rectification in Diffusion Model

Descriptor and Word Soups: Overcoming the Parameter Efficiency Accuracy Tradeoff for Out-of-Distribution Few-shot Learning

Latent Embedding Clustering for Occlusion Robust Head Pose Estimation

MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation

Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions

Prototype-based Interpretable Breast Cancer Prediction Models: Analysis and Challenges

EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation

Joint chest X-ray diagnosis and clinical visual attention prediction with multi-stage cooperative learning: enhancing interpretability

最近の投稿

最近のコメント

アーカイブ

カテゴリー