「cs.CV」カテゴリーアーカイブ

AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation

投稿日: 2024年12月20日作成者: jarxiv

要約 Text-to-Image (T2I) 拡散モデルは、画像生成において目覚 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling

投稿日: 2024年12月20日作成者: jarxiv

要約画像生成用の連続トークンを使用した自己回帰 (AR) モデルの最近の進歩に … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Enhancing Vision-Language Models with Scene Graphs for Traffic Accident Understanding

投稿日: 2024年12月19日作成者: jarxiv

要約交通事故の認識は、自動運転システムや道路監視システムにとって不可欠な部分で … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Exploring Transformer-Augmented LSTM for Temporal and Spatial Feature Learning in Trajectory Prediction

投稿日: 2024年12月19日作成者: jarxiv

要約安全で効率的な自動運転を確保するには、正確な車両軌道予測が不可欠です。こ … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

ManipGPT: Is Affordance Segmentation by Large Vision Models Enough for Articulated Object Manipulation?

投稿日: 2024年12月19日作成者: jarxiv

要約視覚的に実行可能なアフォーダンスは、ロボット工学における革新的なアプローチ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Semantics-Aware Next-best-view Planning for Efficient Search and Detection of Task-relevant Plant Parts

投稿日: 2024年12月19日作成者: jarxiv

要約植物のタスクに関連する部分を検索および検出することは、ロボットを使用してト … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?

投稿日: 2024年12月19日作成者: jarxiv

要約ピクセルや点群などの高次元の視覚入力からポリシーを学習することは、さまざま … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Unified Understanding of Environment, Task, and Human for Human-Robot Interaction in Real-World Environments

投稿日: 2024年12月19日作成者: jarxiv

要約現実世界のシナリオで人間とロボットの相互作用 (HRI) タスクを促進する … 続きを読む →

カテゴリー: cs.CV, cs.HC, cs.RO | コメントを受け付けていません

CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers?

投稿日: 2024年12月19日作成者: jarxiv

要約 AI支援設計のための汎用CADエージェント「CAD-Assistant」を … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Signal Reconstruction from Samples at Unknown Locations with Application to 2D Unknown View Tomography

投稿日: 2024年12月19日作成者: jarxiv

要約サンプリングレートが十分に高い場合、帯域制限された信号を均一間隔のサンプ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation

E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling

Enhancing Vision-Language Models with Scene Graphs for Traffic Accident Understanding

Exploring Transformer-Augmented LSTM for Temporal and Spatial Feature Learning in Trajectory Prediction

ManipGPT: Is Affordance Segmentation by Large Vision Models Enough for Articulated Object Manipulation?

Semantics-Aware Next-best-view Planning for Efficient Search and Detection of Task-relevant Plant Parts

When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?

Unified Understanding of Environment, Task, and Human for Human-Robot Interaction in Real-World Environments

CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers?

Signal Reconstruction from Samples at Unknown Locations with Application to 2D Unknown View Tomography

最近の投稿

最近のコメント

アーカイブ

カテゴリー