月別アーカイブ: 2024年6月

Evaluating Task-based Effectiveness of MLLMs on Charts

投稿日: 2024年6月18日作成者: jarxiv

要約このペーパーでは、GPT-4V はチャート上の低レベルのデータ分析タスクに … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding

投稿日: 2024年6月18日作成者: jarxiv

要約ビジョン言語モデル (VLM) は、多くの言語の画像に関するクエリに応答で … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Effective Rank Analysis and Regularization for Enhanced 3D Gaussian Splatting

投稿日: 2024年6月18日作成者: jarxiv

要約マルチビュー画像からの 3D 再構成は、コンピュータビジョンとグラフィッ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Lightweight Model Pre-training via Language Guided Knowledge Distillation

投稿日: 2024年6月18日作成者: jarxiv

要約この論文では、多くのモバイルデバイスにとって不可欠な、小規模モデルの事前 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A First Physical-World Trajectory Prediction Attack via LiDAR-induced Deceptions in Autonomous Driving

投稿日: 2024年6月18日作成者: jarxiv

要約軌道予測は、過去の軌道に基づいて近くのエージェントの動きを予測します。自 … 続きを読む →

カテゴリー: cs.CR, cs.CV, cs.LG | コメントを受け付けていません

OGNI-DC: Robust Depth Completion with Optimization-Guided Neural Iterations

投稿日: 2024年6月18日作成者: jarxiv

要約深度補完は、画像と疎な深度マップを入力として与えられて、密な深度マップを生 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Latent Denoising Diffusion GAN: Faster sampling, Higher image quality

投稿日: 2024年6月18日作成者: jarxiv

要約拡散モデルは、高忠実度で多様な画像を生成するための強力なソリューションとし … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

投稿日: 2024年6月18日作成者: jarxiv

要約このペーパーでは、ビデオおよびオーディオ指向のタスクにおける時空間モデリン … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Novel Fundus Image Preprocessing for Retcam Images to Improve Deep Learning Classification of Retinopathy of Prematurity

投稿日: 2024年6月18日作成者: jarxiv

要約未熟児網膜症（ROP）は、目の網膜が損傷するため、未熟児で生まれた赤ちゃん … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV, I.2.1 | コメントを受け付けていません

Correspondence Free Multivector Cloud Registration using Conformal Geometric Algebra

投稿日: 2024年6月18日作成者: jarxiv

要約我々は、等角幾何代数における対応自由マルチベクトルクラウド登録の問題に対処 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年6月

Evaluating Task-based Effectiveness of MLLMs on Charts

See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding

Effective Rank Analysis and Regularization for Enhanced 3D Gaussian Splatting

Lightweight Model Pre-training via Language Guided Knowledge Distillation

A First Physical-World Trajectory Prediction Attack via LiDAR-induced Deceptions in Autonomous Driving

OGNI-DC: Robust Depth Completion with Optimization-Guided Neural Iterations

Latent Denoising Diffusion GAN: Faster sampling, Higher image quality

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Novel Fundus Image Preprocessing for Retcam Images to Improve Deep Learning Classification of Retinopathy of Prematurity

Correspondence Free Multivector Cloud Registration using Conformal Geometric Algebra

最近の投稿

最近のコメント

アーカイブ

カテゴリー