月別アーカイブ: 2024年7月

Multi-label Cluster Discrimination for Visual Representation Learning

投稿日: 2024年7月25日作成者: jarxiv

要約対照言語画像事前トレーニング (CLIP) は、画像とテキストの対照学習に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Cascaded Light Propagation Volumes using Spherical Radial Basis Functions

投稿日: 2024年7月25日作成者: jarxiv

要約このペーパーでは、動的シーンで間接照明をシミュレートするための最新の方法の … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Preliminary study on artificial intelligence methods for cybersecurity threat detection in computer networks based on raw data packets

投稿日: 2024年7月25日作成者: jarxiv

要約コンピュータネットワークにおける侵入検出方法のほとんどは、トラフィック … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.CV, I.2.1 | コメントを受け付けていません

MM-Soc: Benchmarking Multimodal Large Language Models in Social Media Platforms

投稿日: 2024年7月25日作成者: jarxiv

要約ソーシャルメディアプラットフォームは、テキスト、画像、ビデオを含むマル … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.CY | コメントを受け付けていません

Deep Spherical Superpixels

投稿日: 2024年7月25日作成者: jarxiv

要約長年にわたり、スーパーピクセルセグメンテーションの使用はさまざまなアプリ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MuST: Multi-Scale Transformers for Surgical Phase Recognition

投稿日: 2024年7月25日作成者: jarxiv

要約手術ビデオにおける位相認識は、一連の手術段階の自動理解を可能にするため、コ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

ViPer: Visual Personalization of Generative Models via Individual Preference Learning

投稿日: 2024年7月25日作成者: jarxiv

要約異なるユーザーは、同じプロンプトに対して生成された異なる画像が望ましいと感 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PrevPredMap: Exploring Temporal Modeling with Previous Predictions for Online Vectorized HD Map Construction

投稿日: 2024年7月25日作成者: jarxiv

要約時間情報は、遮蔽されたインスタンスを検出するために重要です。既存の時間表 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MMRA: A Benchmark for Multi-granularity Multi-image Relational Association

投稿日: 2024年7月25日作成者: jarxiv

要約大規模視覚言語モデル (LVLM) が画像認識タスクで達成した目覚ましい成 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

投稿日: 2024年7月25日作成者: jarxiv

要約人間のパフォーマンスを忠実にキャプチャし、まばらな RGB 観察からフリー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年7月

Multi-label Cluster Discrimination for Visual Representation Learning

Cascaded Light Propagation Volumes using Spherical Radial Basis Functions

Preliminary study on artificial intelligence methods for cybersecurity threat detection in computer networks based on raw data packets

MM-Soc: Benchmarking Multimodal Large Language Models in Social Media Platforms

Deep Spherical Superpixels

MuST: Multi-Scale Transformers for Surgical Phase Recognition

ViPer: Visual Personalization of Generative Models via Individual Preference Learning

PrevPredMap: Exploring Temporal Modeling with Previous Predictions for Online Vectorized HD Map Construction

MMRA: A Benchmark for Multi-granularity Multi-image Relational Association

MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

最近の投稿

最近のコメント

アーカイブ

カテゴリー