「cs.CV」カテゴリーアーカイブ

Corn Ear Detection and Orientation Estimation Using Deep Learning

投稿日: 2024年12月20日作成者: jarxiv

要約穂の発達などのトウモロコシ植物の成長挙動を監視すると、植物の健康状態と発育 … 続きを読む →

カテゴリー: (Primary), 68T45, cs.CV, cs.LG, I.2.10 | コメントを受け付けていません

G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o

投稿日: 2024年12月20日作成者: jarxiv

要約視覚的なキャプションの評価指標は重要ですが、十分に検討されていません。 B … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination

投稿日: 2024年12月20日作成者: jarxiv

要約世界モデルは、エージェントにその環境の表現を提供し、エージェントがその行動 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

TDCNet: Transparent Objects Depth Completion with CNN-Transformer Dual-Branch Parallel Network

投稿日: 2024年12月20日作成者: jarxiv

要約透明な物体の感知と操作は、産業用および実験用ロボット工学において重大な課題 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

IDOL: Instant Photorealistic 3D Human Creation from a Single Image

投稿日: 2024年12月20日作成者: jarxiv

要約人間の外観やポーズは多様であり、利用できる高品質のトレーニングデータが限 … 続きを読む →

カテゴリー: 68T07, 68T45, 68U05, cs.CV, cs.GR, cs.LG, I.2.10 | コメントを受け付けていません

Movie2Story: A framework for understanding videos and telling stories in the form of novel text

投稿日: 2024年12月20日作成者: jarxiv

要約マルチモーダルビデオからテキストへのモデルは、主にビデオコンテンツの簡単な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

PhotoHolmes: a Python library for forgery detection in digital images

投稿日: 2024年12月20日作成者: jarxiv

要約このペーパーでは、デジタル画像に対する偽造検出方法を簡単に実行してベンチマ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Arti-PG: A Toolbox for Procedurally Synthesizing Large-Scale and Diverse Articulated Objects with Rich Annotations

投稿日: 2024年12月20日作成者: jarxiv

要約相当量の 3D 多関節オブジェクトデータの取得には費用と時間がかかり、そ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Optimized Gradient Clipping for Noisy Label Learning

投稿日: 2024年12月20日作成者: jarxiv

要約これまでの研究では、モデルの予測確率に関して損失関数の勾配を制約すると、ノ … 続きを読む →

カテゴリー: 68T07, 68T10, cs.CV, cs.LG, I.2.6 | コメントを受け付けていません

Stitch Contrast and Segment_Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos

投稿日: 2024年12月20日作成者: jarxiv

要約既存のスケルトンベースの人間の行動分類モデルは、トレーニングとテストの両方 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Corn Ear Detection and Orientation Estimation Using Deep Learning

G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o

Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination

TDCNet: Transparent Objects Depth Completion with CNN-Transformer Dual-Branch Parallel Network

IDOL: Instant Photorealistic 3D Human Creation from a Single Image

Movie2Story: A framework for understanding videos and telling stories in the form of novel text

PhotoHolmes: a Python library for forgery detection in digital images

Arti-PG: A Toolbox for Procedurally Synthesizing Large-Scale and Diverse Articulated Objects with Rich Annotations

Optimized Gradient Clipping for Noisy Label Learning

Stitch Contrast and Segment_Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos

最近の投稿

最近のコメント

アーカイブ

カテゴリー