月別アーカイブ: 2024年1月

Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & Beyond

投稿日: 2024年1月3日作成者: jarxiv

要約マルチタスク学習 (MTL) は、複数の関連タスクが共同で学習され、共有表 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Exploiting Causality Signals in Medical Images: A Pilot Study with Empirical Results

投稿日: 2024年1月3日作成者: jarxiv

要約我々は、分類目的でニューラルネットワークを介して画像から弱い因果信号を直接 … 続きを読む →

カテゴリー: cs.AI, cs.CV, I.2.6 | コメントを受け付けていません

PointDC:Unsupervised Semantic Segmentation of 3D Point Clouds via Cross-modal Distillation and Super-Voxel Clustering

投稿日: 2024年1月3日作成者: jarxiv

要約点群のセマンティックセグメンテーションには、通常、人間による注釈の骨の折 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

IdentiFace : A VGG Based Multimodal Facial Biometric System

投稿日: 2024年1月3日作成者: jarxiv

要約顔生体認証システムの開発は、コンピュータビジョン分野の発展に大きく貢献し … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Data-Efficient Multimodal Fusion on a Single GPU

投稿日: 2024年1月3日作成者: jarxiv

要約マルチモーダルアライメントの目標は、マルチモーダル入力間で共有される単一 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Temporal Adaptive RGBT Tracking with Modality Prompt

投稿日: 2024年1月3日作成者: jarxiv

要約 RGBT トラッキングは、ロボット工学、監視処理、自動運転などのさまざまな … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Deep Learning-Based Computational Model for Disease Identification in Cocoa Pods (Theobroma cacao L.)

投稿日: 2024年1月3日作成者: jarxiv

要約カカオの実の病気を早期に特定することは、高品質のカカオの生産を保証するため … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ArtGPT-4: Towards Artistic-understanding Large Vision-Language Models with Enhanced Adapter

投稿日: 2024年1月3日作成者: jarxiv

要約近年、大規模言語モデルの進歩は目覚ましく、ChatGPT などのモデルはさ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Recovering 3D Human Mesh from Monocular Images: A Survey

投稿日: 2024年1月3日作成者: jarxiv

要約単眼画像から人間の姿勢や形状を推定することは、コンピュータービジョンにお … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM

投稿日: 2024年1月3日作成者: jarxiv

要約拡散モデルにおける最近の技術革新と画期的な進歩により、指定されたプロンプト … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年1月

Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & Beyond

Exploiting Causality Signals in Medical Images: A Pilot Study with Empirical Results

PointDC:Unsupervised Semantic Segmentation of 3D Point Clouds via Cross-modal Distillation and Super-Voxel Clustering

IdentiFace : A VGG Based Multimodal Facial Biometric System

Data-Efficient Multimodal Fusion on a Single GPU

Temporal Adaptive RGBT Tracking with Modality Prompt

Deep Learning-Based Computational Model for Disease Identification in Cocoa Pods (Theobroma cacao L.)

ArtGPT-4: Towards Artistic-understanding Large Vision-Language Models with Enhanced Adapter

Recovering 3D Human Mesh from Monocular Images: A Survey

VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM

最近の投稿

最近のコメント

アーカイブ

カテゴリー