「cs.AI」カテゴリーアーカイブ

PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting

投稿日: 2024年6月6日作成者: jarxiv

要約テキスト条件付き拡散モデル (DM) が画像、ビデオ、および 3D 生成に … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Global Clipper: Enhancing Safety and Reliability of Transformer-based Object Detection Models

投稿日: 2024年6月6日作成者: jarxiv

要約変圧器ベースの物体検出モデルが進歩するにつれて、自動運転車や航空などの重要 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Position: Quo Vadis, Unsupervised Time Series Anomaly Detection?

投稿日: 2024年6月6日作成者: jarxiv

要約 Timeseries Anomaly Detection (TAD) にお … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Prompt-based Visual Alignment for Zero-shot Policy Transfer

投稿日: 2024年6月6日作成者: jarxiv

要約 RL の過学習は、強化学習 (RL) への応用に対する主な障害の 1 つと … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Deep Generative Models for Proton Zero Degree Calorimeter Simulations in ALICE, CERN

投稿日: 2024年6月6日作成者: jarxiv

要約検出器の応答をシミュレーションすることは、CERN の大型ハドロン衝突型加 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

FindingEmo: An Image Dataset for Emotion Recognition in the Wild

投稿日: 2024年6月6日作成者: jarxiv

要約 FindingEmo は、感情認識に特化した 25,000 画像の注釈を含 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding

投稿日: 2024年6月6日作成者: jarxiv

要約大規模視覚言語モデル (LVLM) は、視覚入力から状況に応じて詳細で一貫 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MM | コメントを受け付けていません

Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models

投稿日: 2024年6月6日作成者: jarxiv

要約 OpenFlamingo、LLaVA、GPT-4 などのマルチモーダル基盤 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, stat.ML | コメントを受け付けていません

SelfReDepth: Self-Supervised Real-Time Depth Restoration for Consumer-Grade Sensors

投稿日: 2024年6月6日作成者: jarxiv

要約民生用センサーによって生成された深度マップには、不正確な測定値や、システム … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC | コメントを受け付けていません

Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input

投稿日: 2024年6月6日作成者: jarxiv

要約イベントカメラは、低遅延で出力応答がまばらなビジョンセンサーを必要とす … 続きを読む →

カテゴリー: 68T99, cs.AI, cs.CV, I.2.10 | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting

Global Clipper: Enhancing Safety and Reliability of Transformer-based Object Detection Models

Position: Quo Vadis, Unsupervised Time Series Anomaly Detection?

Prompt-based Visual Alignment for Zero-shot Policy Transfer

Deep Generative Models for Proton Zero Degree Calorimeter Simulations in ALICE, CERN

FindingEmo: An Image Dataset for Emotion Recognition in the Wild

Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding

Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models

SelfReDepth: Self-Supervised Real-Time Depth Restoration for Consumer-Grade Sensors

Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input

最近の投稿

最近のコメント

アーカイブ

カテゴリー