「cs.AI」カテゴリーアーカイブ

Theia: Distilling Diverse Vision Foundation Models for Robot Learning

投稿日: 2024年7月30日作成者: jarxiv

要約視覚入力をアクションにマッピングする視覚ベースのロボットポリシー学習では … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

SANGRIA: Surgical Video Scene Graph Optimization for Surgical Workflow Prediction

投稿日: 2024年7月30日作成者: jarxiv

要約グラフベースの全体的なシーン表現は、外科ワークフローの理解を容易にし、最近 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Matryoshka Multimodal Models

投稿日: 2024年7月30日作成者: jarxiv

要約 LLaVA などの大規模マルチモーダルモデル (LMM) は、視覚言語推 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

SAPG: Split and Aggregate Policy Gradients

投稿日: 2024年7月30日作成者: jarxiv

要約極端なサンプルの非効率にもかかわらず、ポリシーに基づく強化学習、別名ポリシ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing

投稿日: 2024年7月30日作成者: jarxiv

要約テキストベースの編集普及モデルは、ユーザーの入力指示があいまいな場合、パフ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training

投稿日: 2024年7月30日作成者: jarxiv

要約正確なリアルタイムの物体検出は、安全監視から品質管理に至るまで、数多くの産 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement Network

投稿日: 2024年7月30日作成者: jarxiv

要約軌道予測は、交通参加者の将来の動きを予測することを目的としているため、自動 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

A Role-specific Guided Large Language Model for Ophthalmic Consultation Based on Stylistic Differentiation

投稿日: 2024年7月30日作成者: jarxiv

要約眼科の診察は、目の病気の診断、治療、予防にとって非常に重要です。しかし、 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery

投稿日: 2024年7月30日作成者: jarxiv

要約因果関係発見は、観測データに基づいて変数間の因果構造を推定することを目的と … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving

投稿日: 2024年7月29日作成者: jarxiv

要約現実世界の複雑なシナリオを安全にナビゲートするには、自動運転車はさまざまな … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Theia: Distilling Diverse Vision Foundation Models for Robot Learning

SANGRIA: Surgical Video Scene Graph Optimization for Surgical Workflow Prediction

Matryoshka Multimodal Models

SAPG: Split and Aggregate Policy Gradients

Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing

DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training

Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement Network

A Role-specific Guided Large Language Model for Ophthalmic Consultation Based on Stylistic Differentiation

Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery

CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving

最近の投稿

最近のコメント

アーカイブ

カテゴリー