「cs.AI」カテゴリーアーカイブ

PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning

投稿日: 2024年5月17日作成者: jarxiv

要約リモートセンシングによる画像とテキストの検索は、リモートセンシングによ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Two-Phase Dynamics of Interactions Explains the Starting Point of a DNN Learning Over-Fitted Features

投稿日: 2024年5月17日作成者: jarxiv

要約この論文では、ディープニューラルネットワーク (DNN) 学習相互作用 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Faces that Speak: Jointly Synthesising Talking Face and Speech from Text

投稿日: 2024年5月17日作成者: jarxiv

要約この作業の目標は、自然な話し顔とテキストからの音声出力を同時に生成すること … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.SD, eess.AS, eess.IV | コメントを受け付けていません

FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models

投稿日: 2024年5月17日作成者: jarxiv

要約ノイズとキャプションの品質は視覚言語対比事前トレーニングに影響を与える重要 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

投稿日: 2024年5月17日作成者: jarxiv

要約特殊な視覚指示に従うデータに基づいて微調整された大規模なビジョン言語モデル … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

4D Panoptic Scene Graph Generation

投稿日: 2024年5月17日作成者: jarxiv

要約私たちは 3 次元空間に生きながら、時間という 4 次元を進んでいます。 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Learning Reward for Robot Skills Using Large Language Models via Self-Alignment

投稿日: 2024年5月17日作成者: jarxiv

要約ロボットに幅広いスキルのレパートリーを持たせるには、報酬関数の学習が依然と … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

CoFiI2P: Coarse-to-Fine Correspondences for Image-to-Point Cloud Registration

投稿日: 2024年5月16日作成者: jarxiv

要約画像からポイントクラウド (I2P) への登録は、ロボットや自動運転車が … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Perception Without Vision for Trajectory Prediction: Ego Vehicle Dynamics as Scene Representation for Efficient Active Learning in Autonomous Driving

投稿日: 2024年5月16日作成者: jarxiv

要約この研究では、自動運転機械学習タスクにおける効率的なデータキュレーションの … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Explainable AI for Ship Collision Avoidance: Decoding Decision-Making Processes and Behavioral Intentions

投稿日: 2024年5月16日作成者: jarxiv

要約この研究では、船舶の衝突回避のための説明可能な AI を開発しました。当 … 続きを読む →

カテゴリー: cs.AI, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning

Two-Phase Dynamics of Interactions Explains the Starting Point of a DNN Learning Over-Fitted Features

Faces that Speak: Jointly Synthesising Talking Face and Speech from Text

FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

4D Panoptic Scene Graph Generation

Learning Reward for Robot Skills Using Large Language Models via Self-Alignment

CoFiI2P: Coarse-to-Fine Correspondences for Image-to-Point Cloud Registration

Perception Without Vision for Trajectory Prediction: Ego Vehicle Dynamics as Scene Representation for Efficient Active Learning in Autonomous Driving

Explainable AI for Ship Collision Avoidance: Decoding Decision-Making Processes and Behavioral Intentions

最近の投稿

最近のコメント

アーカイブ

カテゴリー