「cs.AI」カテゴリーアーカイブ

Training-Free Consistency Pipeline for Fashion Repose

投稿日: 2025年1月24日作成者: jarxiv

要約拡散モデルの最近の進歩により、実際のオブジェクトの画像を編集する可能性が大 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.SE | コメントを受け付けていません

EventVL: Understand Event Streams via Multimodal Large Language Model

投稿日: 2025年1月24日作成者: jarxiv

要約イベントベースのビジョン言語モデル（VLM）は、最近、実用的なビジョンタス … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID

投稿日: 2025年1月24日作成者: jarxiv

要約リアルタイムオブジェクトの検出と自己監視の再識別（REID）を組み合わせた … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Skin Disease Detection and Classification of Actinic Keratosis and Psoriasis Utilizing Deep Transfer Learning

投稿日: 2025年1月24日作成者: jarxiv

要約皮膚疾患は、感染症、アレルギー、遺伝的要因、自己免疫疾患、ホルモンの不均衡 … 続きを読む →

カテゴリー: 68T07, cs.AI, cs.CV, J.3 | コメントを受け付けていません

You Only Crash Once v2: Perceptually Consistent Strong Features for One-Stage Domain Adaptive Detection of Space Terrain

投稿日: 2025年1月24日作成者: jarxiv

要約惑星、月、および小体の表面地形の現場検出は、学習ベースのコンピュータービジ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Solving the long-tailed distribution problem by exploiting the synergies and balance of different techniques

投稿日: 2025年1月24日作成者: jarxiv

要約現実世界のデータでは、ロングテールのデータ分布が一般的であるため、経験に基 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

MuMA-ToM: Multi-modal Multi-Agent Theory of Mind

投稿日: 2025年1月24日作成者: jarxiv

要約複雑な現実世界のシナリオで人々の社会的相互作用を理解することは、しばしば複 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Ensuring Medical AI Safety: Explainable AI-Driven Detection and Mitigation of Spurious Model Behavior and Associated Data

投稿日: 2025年1月24日作成者: jarxiv

要約ディープニューラルネットワークは、実際には致命的な結果をもたらす可能性 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Where Do You Go? Pedestrian Trajectory Prediction using Scene Features

投稿日: 2025年1月24日作成者: jarxiv

要約歩行者の軌跡を正確に予測することは、自動運転車の安全性を高め、歩行者が巻き … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning

投稿日: 2025年1月24日作成者: jarxiv

要約私たちは、きめ細かい視覚的理解を促進するために設計された初のパノプティック … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Training-Free Consistency Pipeline for Fashion Repose

EventVL: Understand Event Streams via Multimodal Large Language Model

YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID

Skin Disease Detection and Classification of Actinic Keratosis and Psoriasis Utilizing Deep Transfer Learning

You Only Crash Once v2: Perceptually Consistent Strong Features for One-Stage Domain Adaptive Detection of Space Terrain

Solving the long-tailed distribution problem by exploiting the synergies and balance of different techniques

MuMA-ToM: Multi-modal Multi-Agent Theory of Mind

Ensuring Medical AI Safety: Explainable AI-Driven Detection and Mitigation of Spurious Model Behavior and Associated Data

Where Do You Go? Pedestrian Trajectory Prediction using Scene Features

Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning

最近の投稿

最近のコメント

アーカイブ

カテゴリー