「cs.AI」カテゴリーアーカイブ

DPD-NeuralEngine: A 22-nm 6.6-TOPS/W/mm$^2$ Recurrent Neural Network Accelerator for Wideband Power Amplifier Digital Pre-Distortion

投稿日: 2024年10月16日作成者: jarxiv

要約最新の通信システムではディープニューラルネットワーク (DNN) ベー … 続きを読む →

カテゴリー: cs.AI, cs.AR, cs.CV | コメントを受け付けていません

U-MedSAM: Uncertainty-aware MedSAM for Medical Image Segmentation

投稿日: 2024年10月16日作成者: jarxiv

要約 Medical Image Foundation Model は、さまざま … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

投稿日: 2024年10月16日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は頻繁に幻覚現象を示しますが … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation

投稿日: 2024年10月16日作成者: jarxiv

要約私たちは、単一のビデオデモンストレーションを模倣して人型ロボットの操作スキ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

VIA: Unified Spatiotemporal Video Adaptation Framework for Global and Local Video Editing

投稿日: 2024年10月16日作成者: jarxiv

要約ビデオ編集は、エンターテインメントや教育からプロフェッショナルなコミュニケ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM | コメントを受け付けていません

MoH: Multi-Head Attention as Mixture-of-Head Attention

投稿日: 2024年10月16日作成者: jarxiv

要約この作業では、Transformer モデルの中核であるマルチヘッドアテ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

投稿日: 2024年10月16日作成者: jarxiv

要約マルチモーダルビデオの理解と生成には、きめの細かい時間ダイナミクスを理解す … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Learning Quadruped Locomotion Using Differentiable Simulation

投稿日: 2024年10月16日作成者: jarxiv

要約この研究では、四足歩行の学習に微分可能なシミュレーションを使用する可能性を … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Make the Pertinent Salient: Task-Relevant Reconstruction for Visual Control with Distractions

投稿日: 2024年10月15日作成者: jarxiv

要約モデルベース強化学習 (MBRL) の最近の進歩により、MBRL は視覚的 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Input-to-State Stable Coupled Oscillator Networks for Closed-form Model-based Control in Latent Space

投稿日: 2024年10月15日作成者: jarxiv

要約文献ではさまざまな方法が提案されていますが、物理システムの効率的かつ効果的 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

DPD-NeuralEngine: A 22-nm 6.6-TOPS/W/mm$^2$ Recurrent Neural Network Accelerator for Wideband Power Amplifier Digital Pre-Distortion

U-MedSAM: Uncertainty-aware MedSAM for Medical Image Segmentation

MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation

VIA: Unified Spatiotemporal Video Adaptation Framework for Global and Local Video Editing

MoH: Multi-Head Attention as Mixture-of-Head Attention

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Learning Quadruped Locomotion Using Differentiable Simulation

Make the Pertinent Salient: Task-Relevant Reconstruction for Visual Control with Distractions

Input-to-State Stable Coupled Oscillator Networks for Closed-form Model-based Control in Latent Space

最近の投稿

最近のコメント

アーカイブ

カテゴリー