月別アーカイブ: 2024年9月

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

投稿日: 2024年9月13日作成者: jarxiv

要約大規模言語モデル (LLM) は、計画と推論を必要とするマルチモーダルなタ … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

AnySkin: Plug-and-play Skin Sensing for Robotic Touch

投稿日: 2024年9月13日作成者: jarxiv

要約触覚センシングは重要かつ有用なセンシングモダリティとして広く受け入れられて … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

DEAR: Depth-Enhanced Action Recognition

投稿日: 2024年9月13日作成者: jarxiv

要約ビデオ、特に乱雑なシーン内のアクションを検出することは、カメラの観点から見 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis

投稿日: 2024年9月13日作成者: jarxiv

要約可視光に基づいた斬新な視点の合成は広く研究されています。可視光イメージン … 続きを読む →

カテゴリー: cs.CV, cs.GR, I.3.3 | コメントを受け付けていません

GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge Learning

投稿日: 2024年9月13日作成者: jarxiv

要約フォトリアリスティックジェネレーターの急速な進歩により、本物の画像と加工 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Learning Video Context as Interleaved Multimodal Sequences

投稿日: 2024年9月13日作成者: jarxiv

要約映画などのナラティブビデオは、その豊富なコンテキスト (キャラクター、会話 … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

Expansive Supervision for Neural Radiance Field

投稿日: 2024年9月13日作成者: jarxiv

要約 Neural Radiance Fields は、その優れた再構成機能によ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

NITRO-D: Native Integer-only Training of Deep Convolutional Neural Networks

投稿日: 2024年9月13日作成者: jarxiv

要約量子化は、ディープニューラルネットワーク (DNN) の着実に増加する … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.NE, I.2.6 | コメントを受け付けていません

AutoPET Challenge: Tumour Synthesis for Data Augmentation

投稿日: 2024年9月13日作成者: jarxiv

要約全身 PET/CT スキャンにおける正確な病変セグメンテーションは、がんの … 続きを読む →

カテゴリー: cs.CV, eess.IV, physics.med-ph | コメントを受け付けていません

Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation

投稿日: 2024年9月13日作成者: jarxiv

要約私たちは、拡散ベースの画像間の変換に合わせた、シンプルだが効果的なトレーニ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年9月

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

AnySkin: Plug-and-play Skin Sensing for Robotic Touch

DEAR: Depth-Enhanced Action Recognition

Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis

GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge Learning

Learning Video Context as Interleaved Multimodal Sequences

Expansive Supervision for Neural Radiance Field

NITRO-D: Native Integer-only Training of Deep Convolutional Neural Networks

AutoPET Challenge: Tumour Synthesis for Data Augmentation

Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation

最近の投稿

最近のコメント

アーカイブ

カテゴリー