月別アーカイブ: 2024年8月

RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation with Occlusion Handling

投稿日: 2024年8月7日作成者: jarxiv

要約日常的に広く応用されている 3D 人間の姿勢推定の分野では、便利な取得装置 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers

投稿日: 2024年8月7日作成者: jarxiv

要約ビジョントランスフォーマー (ViT) は、ビジョンタスクにおけるパフ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images

投稿日: 2024年8月7日作成者: jarxiv

要約単一の RGB 画像から 3D 形状を生成することは、ロボット工学などのさ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

TextIM: Part-aware Interactive Motion Synthesis from Text

投稿日: 2024年8月7日作成者: jarxiv

要約この研究では、パーツレベルのセマンティクスの正確な調整に焦点を当て、TEX … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Deep-learning Assisted Detection and Quantification of (oo)cysts of Giardia and Cryptosporidium on Smartphone Microscopy Images

投稿日: 2024年8月7日作成者: jarxiv

要約微生物に汚染された食品や水の摂取は、毎年何百万人もの人々の命を奪っています … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Fusing Forces: Deep-Human-Guided Refinement of Segmentation Masks

投稿日: 2024年8月7日作成者: jarxiv

要約エトルリアの鏡はエトルリア美術の重要なカテゴリーを構成しており、裏面に描か … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC, cs.LG | コメントを受け付けていません

Comprehensive Attribution: Inherently Explainable Vision Model with Feature Detector

投稿日: 2024年8月7日作成者: jarxiv

要約深視野モデルの人気が急速に高まるにつれ、モデル予測の説明がますます重要視さ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation

投稿日: 2024年8月7日作成者: jarxiv

要約拡散トランスの分野における最近の進歩により、高品質の 2D 画像、3D ビ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation

投稿日: 2024年8月7日作成者: jarxiv

要約このペーパーでは、オブジェクトとその色を一致させるマルチモーダルモデルの … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Robustness Assessment of a Runway Object Classifier for Safe Aircraft Taxiing

投稿日: 2024年8月7日作成者: jarxiv

要約ディープニューラルネットワーク (DNN) が多くの計算問題に対する有 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.LO | コメントを受け付けていません

月別アーカイブ: 2024年8月

RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation with Occlusion Handling

DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers

PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images

TextIM: Part-aware Interactive Motion Synthesis from Text

Deep-learning Assisted Detection and Quantification of (oo)cysts of Giardia and Cryptosporidium on Smartphone Microscopy Images

Fusing Forces: Deep-Human-Guided Refinement of Segmentation Masks

Comprehensive Attribution: Inherently Explainable Vision Model with Feature Detector

MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation

ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation

Robustness Assessment of a Runway Object Classifier for Safe Aircraft Taxiing

最近の投稿

最近のコメント

アーカイブ

カテゴリー