月別アーカイブ: 2024年9月

LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba

投稿日: 2024年9月19日作成者: jarxiv

要約最近の Transformer ベースの拡散モデルは、顕著なパフォーマンス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Massively Multi-Person 3D Human Motion Forecasting with Scene Context

投稿日: 2024年9月19日作成者: jarxiv

要約長期的な 3D 人間の動きを予測することは困難です。人間の行動には確率性が … 続きを読む →

カテゴリー: cs.CV, cs.LG, I.2 | コメントを受け付けていません

Bundle Adjustment in the Eager Mode

投稿日: 2024年9月19日作成者: jarxiv

要約バンドル調整 (BA) は、同時位置特定とマッピング (SLAM)、拡張現 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution

投稿日: 2024年9月19日作成者: jarxiv

要約我々は、視覚処理における従来の所定解像度アプローチを再定義する、以前の Q … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control

投稿日: 2024年9月19日作成者: jarxiv

要約模倣学習は、複雑な視覚運動ポリシーをトレーニングするための強力なツールであ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Vista3D: Unravel the 3D Darkside of a Single Image

投稿日: 2024年9月19日作成者: jarxiv

要約私たちは、目に見える部分を垣間見るだけで、オブジェクトの隠された次元を明ら … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GT, cs.MM | コメントを受け付けていません

Autonomous Navigation in Ice-Covered Waters with Learned Predictions on Ship-Ice Interactions

投稿日: 2024年9月19日作成者: jarxiv

要約氷に覆われた水域での自律航行は、実行可能な衝突のない軌道が頻繁に欠如してい … 続きを読む →

カテゴリー: cs.RO | コメントを受け付けていません

BEATLE — Self-Reconfigurable Aerial Robot: Design, Control and Experimental Validation

投稿日: 2024年9月19日作成者: jarxiv

要約モジュール式自己再構成ロボット (MSRR) は、各タスクに適したさまざま … 続きを読む →

カテゴリー: cs.RO | コメントを受け付けていません

The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives

投稿日: 2024年9月19日作成者: jarxiv

要約この論文では、子供向けのストーリーテリングを強化するために生成人工知能 ( … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

LOLA — An Open-Source Massively Multilingual Large Language Model

投稿日: 2024年9月19日作成者: jarxiv

要約この論文では、疎な専門家混合トランスフォーマーアーキテクチャを使用して … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年9月

LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba

Massively Multi-Person 3D Human Motion Forecasting with Scene Context

Bundle Adjustment in the Eager Mode

Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution

DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control

Vista3D: Unravel the 3D Darkside of a Single Image

Autonomous Navigation in Ice-Covered Waters with Learned Predictions on Ship-Ice Interactions

BEATLE — Self-Reconfigurable Aerial Robot: Design, Control and Experimental Validation

The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives

LOLA — An Open-Source Massively Multilingual Large Language Model

最近の投稿

最近のコメント

アーカイブ

カテゴリー