月別アーカイブ: 2024年9月

LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

投稿日: 2024年9月27日作成者: jarxiv

要約大規模マルチモーダルモデル (LMM) の最近の進歩により、2D 視覚理 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

EgoLM: Multi-Modal Language Model of Egocentric Motions

投稿日: 2024年9月27日作成者: jarxiv

要約ウェアラブルデバイスの普及に伴い、コンテキストAIの開発には自己中心的な動 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner

投稿日: 2024年9月27日作成者: jarxiv

要約ビジュアル生成における拡散モデルの成功を基礎として、フローベースのモデルは … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Force-Guided Bridge Matching for Full-Atom Time-Coarsened Dynamics of Peptides

投稿日: 2024年9月27日作成者: jarxiv

要約分子動力学 (MD) は、いくつか例を挙げると、材料科学、化学、薬学などの … 続きを読む →

カテゴリー: cs.LG, physics.chem-ph, physics.comp-ph, q-bio.BM | コメントを受け付けていません

Characterizing stable regions in the residual stream of LLMs

投稿日: 2024年9月27日作成者: jarxiv

要約トランスフォーマーの残留ストリーム内の「安定領域」を特定します。この領域で … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Investigating OCR-Sensitive Neurons to Improve Entity Recognition in Historical Documents

投稿日: 2024年9月27日作成者: jarxiv

要約この論文では、Transformer アーキテクチャ内の OCR 感受性ニ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

LingoQA: Visual Question Answering for Autonomous Driving

投稿日: 2024年9月27日作成者: jarxiv

要約自動運転における視覚的な質問応答のための新しいデータセットおよびベンチマー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models

投稿日: 2024年9月27日作成者: jarxiv

要約自動運転における共変量シフト問題に対処するために、潜在空間生成世界モデルの … 続きを読む →

カテゴリー: (Primary), 68T45, cs.CV, cs.LG, cs.RO, cs.SY, eess.SY, I.2.10 | コメントを受け付けていません

INT-FlashAttention: Enabling Flash Attention for INT8 Quantization

投稿日: 2024年9月27日作成者: jarxiv

要約大規模言語モデル (LLM) の基礎として、セルフアテンションモジュー … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

ManiFoundation Model for General-Purpose Robotic Manipulation of Contact Synthesis with Arbitrary Objects and Robots

投稿日: 2024年9月26日作成者: jarxiv

要約ロボットの知能を大幅に強化するには、LLM が示す多用途のタスク計画能力と … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

月別アーカイブ: 2024年9月

LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

EgoLM: Multi-Modal Language Model of Egocentric Motions

FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner

Force-Guided Bridge Matching for Full-Atom Time-Coarsened Dynamics of Peptides

Characterizing stable regions in the residual stream of LLMs

Investigating OCR-Sensitive Neurons to Improve Entity Recognition in Historical Documents

LingoQA: Visual Question Answering for Autonomous Driving

Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models

INT-FlashAttention: Enabling Flash Attention for INT8 Quantization

ManiFoundation Model for General-Purpose Robotic Manipulation of Contact Synthesis with Arbitrary Objects and Robots

最近の投稿

最近のコメント

アーカイブ

カテゴリー