「cs.AI」カテゴリーアーカイブ

Adaptive Visual Imitation Learning for Robotic Assisted Feeding Across Varied Bowl Configurations and Food Types

投稿日: 2024年3月20日作成者: jarxiv

要約この研究では、ロボット支援給餌 (RAF) のための空間注意モジュールを備 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

SynCDR : Training Cross Domain Retrieval Models with Synthetic Data

投稿日: 2024年3月20日作成者: jarxiv

要約クロスドメイン検索では、2 つの視覚ドメインにわたって同じ意味カテゴリから … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Vertical Federated Image Segmentation

投稿日: 2024年3月20日作成者: jarxiv

要約画像ベースの問題に対する AI ソリューションの普及に伴い、データのプライ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.DC, cs.LG, I.2.8 | コメントを受け付けていません

Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition

投稿日: 2024年3月20日作成者: jarxiv

要約大規模な視覚言語の事前トレーニング済みモデルは、さまざまなビデオタスクで … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Exploring Facial Expression Recognition through Semi-Supervised Pretraining and Temporal Modeling

投稿日: 2024年3月20日作成者: jarxiv

要約顔表情認識 (FER) は、コンピュータビジョンにおいて重要な役割を果た … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models

投稿日: 2024年3月20日作成者: jarxiv

要約ビジョン言語モデル (VLM) の進歩により、特にゼロショット学習設定にお … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

WHAC: World-grounded Humans and Cameras

投稿日: 2024年3月20日作成者: jarxiv

要約単眼ビデオからワールド座標系で正確なスケールで人間とカメラの軌跡を推定する … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.LG, cs.RO | コメントを受け付けていません

TexTile: A Differentiable Metric for Texture Tileability

投稿日: 2024年3月20日作成者: jarxiv

要約我々は、繰り返しアーティファクトを導入することなくテクスチャ画像をそれ自体 … 続きを読む →

カテゴリー: 68T07, 68U05, cs.AI, cs.CV, cs.GR, cs.LG, I.2.10 | コメントを受け付けていません

SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction

投稿日: 2024年3月20日作成者: jarxiv

要約自動運転車 (AV) が動的で人間とロボットが混在する環境で安全に動作する … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs

投稿日: 2024年3月20日作成者: jarxiv

要約大規模言語モデル (LLM) で生成されたカテゴリ固有のプロンプトのプロン … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Adaptive Visual Imitation Learning for Robotic Assisted Feeding Across Varied Bowl Configurations and Food Types

SynCDR : Training Cross Domain Retrieval Models with Synthetic Data

Vertical Federated Image Segmentation

Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition

Exploring Facial Expression Recognition through Semi-Supervised Pretraining and Temporal Modeling

Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models

WHAC: World-grounded Humans and Cameras

TexTile: A Differentiable Metric for Texture Tileability

SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction

Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs

最近の投稿

最近のコメント

アーカイブ

カテゴリー