「cs.AI」カテゴリーアーカイブ

Evaluating Modern Visual Anomaly Detection Approaches in Semiconductor Manufacturing: A Comparative Study

投稿日: 2025年5月13日作成者: jarxiv

要約半導体製造は、複雑で多段階のプロセスです。走査型電子顕微鏡（SEM）画像 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Neural Brain: A Neuroscience-inspired Framework for Embodied Agents

投稿日: 2025年5月13日作成者: jarxiv

要約人工知能（AI）の急速な進化は、静的なデータ駆動型モデルから、実際の環境を … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Simple Semi-supervised Knowledge Distillation from Vision-Language Models via $\mathbf{\texttt{D}}$ual-$\mathbf{\texttt{H}}$ead $\mathbf{\texttt{O}}$ptimization

投稿日: 2025年5月13日作成者: jarxiv

要約 Vision-Language Models（VLMS）は、最小限のラベル … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Hybrid Spiking Vision Transformer for Object Detection with Event Cameras

投稿日: 2025年5月13日作成者: jarxiv

要約イベントベースのオブジェクト検出は、高い時間分解能、広いダイナミックレンジ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

GP-GS: Gaussian Processes for Enhanced Gaussian Splatting

投稿日: 2025年5月13日作成者: jarxiv

要約 3Dガウスのスプラッティングは、効率的なフォトリアリスティックな新規ビュー … 続きを読む →

カテゴリー: 68T45, cs.AI, cs.CV | コメントを受け付けていません

DexWild: Dexterous Human Interactions for In-the-Wild Robot Policies

投稿日: 2025年5月13日作成者: jarxiv

要約大規模で多様なロボットデータセットは、目覚る操作ポリシーが新しい環境に一般 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Prompt to Polyp: Medical Text-Conditioned Image Synthesis with Diffusion Models

投稿日: 2025年5月13日作成者: jarxiv

要約テキストの説明から現実的な医療画像の生成は、患者のプライバシーを維持しなが … 続きを読む →

カテゴリー: 68T07, 68U10, 92C55, cs.AI, cs.CV, I.2.10 | コメントを受け付けていません

H$^{\mathbf{3}}$DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning

投稿日: 2025年5月13日作成者: jarxiv

要約視覚運動の政策学習は、ロボット操作の大きな進歩を目撃しており、最近のアプロ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

LLMs Outperform Experts on Challenging Biology Benchmarks

投稿日: 2025年5月13日作成者: jarxiv

要約この研究では、分子生物学、遺伝学、クローニング、ウイルス学、およびバイオセ … 続きを読む →

カテゴリー: cs.AI, cs.LG, q-bio.QM | コメントを受け付けていません

CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory

投稿日: 2025年5月12日作成者: jarxiv

要約自然言語の指示を解釈し、複雑な都市環境をナビゲートするためにドローンを要求 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Evaluating Modern Visual Anomaly Detection Approaches in Semiconductor Manufacturing: A Comparative Study

Neural Brain: A Neuroscience-inspired Framework for Embodied Agents

Simple Semi-supervised Knowledge Distillation from Vision-Language Models via $\mathbf{\texttt{D}}$ual-$\mathbf{\texttt{H}}$ead $\mathbf{\texttt{O}}$ptimization

Hybrid Spiking Vision Transformer for Object Detection with Event Cameras

GP-GS: Gaussian Processes for Enhanced Gaussian Splatting

DexWild: Dexterous Human Interactions for In-the-Wild Robot Policies

Prompt to Polyp: Medical Text-Conditioned Image Synthesis with Diffusion Models

H$^{\mathbf{3}}$DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning

LLMs Outperform Experts on Challenging Biology Benchmarks

CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory

最近の投稿

最近のコメント

アーカイブ

カテゴリー