月別アーカイブ: 2024年3月

Robust Shape Fitting for 3D Scene Abstraction

投稿日: 2024年3月18日作成者: jarxiv

要約人間は世界を単純なパラメトリックモデルの配置として認識し、構築します。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SimPLR: A Simple and Plain Transformer for Scaling-Efficient Object Detection and Segmentation

投稿日: 2024年3月18日作成者: jarxiv

要約さまざまなスケールの画像内の物体を検出する機能は、最新の物体検出器の設計に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Understanding the Double Descent Phenomenon in Deep Learning

投稿日: 2024年3月18日作成者: jarxiv

要約モデルクラスの容量が大きくなるにつれて、汎化ギャップを制御して過剰適合を回 … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

DeepRepViz: Identifying Confounders in Deep Learning Model Predictions

投稿日: 2024年3月18日作成者: jarxiv

要約深層学習 (DL) モデルは、心理的行動、認知特性、脳の病理を予測するため … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Approximate Nullspace Augmented Finetuning for Robust Vision Transformers

投稿日: 2024年3月18日作成者: jarxiv

要約ディープラーニングモデルの堅牢性を強化することは、特にビジョントランス … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

MRC-Net: 6-DoF Pose Estimation with MultiScale Residual Correlation

投稿日: 2024年3月18日作成者: jarxiv

要約我々は、単一の RGB 画像から利用可能な 3D コンピュータ支援設計 ( … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Geometry of the Visual Cortex with Applications to Image Inpainting and Enhancement

投稿日: 2024年3月18日作成者: jarxiv

要約ロトトランスレーション群 $SE(2)$ に視覚野 V1 から着想を得たサ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Joint Multimodal Transformer for Dimensional Emotional Recognition in the Wild

投稿日: 2024年3月18日作成者: jarxiv

要約ビデオにおける視聴覚感情認識 (ER) には、単峰性のパフォーマンスに比べ … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models

投稿日: 2024年3月18日作成者: jarxiv

要約私たちは、テキストのプロンプトによって駆動されるリアルな 3D 人間とオブ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Mitigating Dialogue Hallucination for Large Multi-modal Models via Adversarial Instruction Tuning

投稿日: 2024年3月18日作成者: jarxiv

要約大規模マルチモーダルモデル (LMM) の幻覚を軽減することは、汎用アシ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年3月

Robust Shape Fitting for 3D Scene Abstraction

SimPLR: A Simple and Plain Transformer for Scaling-Efficient Object Detection and Segmentation

Understanding the Double Descent Phenomenon in Deep Learning

DeepRepViz: Identifying Confounders in Deep Learning Model Predictions

Approximate Nullspace Augmented Finetuning for Robust Vision Transformers

MRC-Net: 6-DoF Pose Estimation with MultiScale Residual Correlation

Geometry of the Visual Cortex with Applications to Image Inpainting and Enhancement

Joint Multimodal Transformer for Dimensional Emotional Recognition in the Wild

HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models

Mitigating Dialogue Hallucination for Large Multi-modal Models via Adversarial Instruction Tuning

最近の投稿

最近のコメント

アーカイブ

カテゴリー