「cs.AI」カテゴリーアーカイブ

Vision-Language Models under Cultural and Inclusive Considerations

投稿日: 2024年7月9日作成者: jarxiv

要約大規模視覚言語モデル (VLM) は、視覚障害のある人々の日常生活の画像を … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.CY | コメントを受け付けていません

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

投稿日: 2024年7月9日作成者: jarxiv

要約 Large Vision Language Model (LVLM) のパ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Multi-Object Hallucination in Vision-Language Models

投稿日: 2024年7月9日作成者: jarxiv

要約 Large Vision Language Model (LVLM) は、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

EventChat: Implementation and user-centric evaluation of a large language model-driven conversational recommender system for exploring leisure events in an SME context

投稿日: 2024年7月9日作成者: jarxiv

要約大規模言語モデル (LLM) は、会話型レコメンダーシステム (CRS) … 続きを読む →

カテゴリー: 68T50, cs.AI, cs.CL, cs.IR, cs.LG, H.5.2 | コメントを受け付けていません

PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers

投稿日: 2024年7月9日作成者: jarxiv

要約オブジェクトの部分を明示的に検出し、それを基に推論するコンピュータービジ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Autonomous Control of a Novel Closed Chain Five Bar Active Suspension via Deep Reinforcement Learning

投稿日: 2024年7月8日作成者: jarxiv

要約惑星探査では、起伏の激しい地形での移動が必要となる。さらに、火星探査機やそ … 続きを読む →

カテゴリー: cs.AI, cs.RO, I.2.9 | コメントを受け付けていません

DexCap: Scalable and Portable Mocap Data Collection System for Dexterous Manipulation

投稿日: 2024年7月8日作成者: jarxiv

要約人間の手の動きデータからの模倣学習は、実世界の操作タスクにおいて人間のよう … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions

投稿日: 2024年7月8日作成者: jarxiv

要約 Vision-and-Language Navigation (VLN)は … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks

投稿日: 2024年7月8日作成者: jarxiv

要約マルチモーダルモデルの汎化能力を、分布外データに対する性能のみに基づいて評 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.RO | コメントを受け付けていません

ROER: Regularized Optimal Experience Replay

投稿日: 2024年7月8日作成者: jarxiv

要約経験再生はオンライン強化学習(RL)の成功の鍵となる要素である。優先経験再 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Vision-Language Models under Cultural and Inclusive Considerations

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Multi-Object Hallucination in Vision-Language Models

EventChat: Implementation and user-centric evaluation of a large language model-driven conversational recommender system for exploring leisure events in an SME context

PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers

Autonomous Control of a Novel Closed Chain Five Bar Active Suspension via Deep Reinforcement Learning

DexCap: Scalable and Portable Mocap Data Collection System for Dexterous Manipulation

Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions

Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks

ROER: Regularized Optimal Experience Replay

最近の投稿

最近のコメント

アーカイブ

カテゴリー