「cs.AI」カテゴリーアーカイブ

MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces

投稿日: 2024年7月12日作成者: jarxiv

要約街並みや広場などの公共の都市空間は、住民にサービスを提供し、あらゆる活気に … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Video Diffusion Alignment via Reward Gradients

投稿日: 2024年7月12日作成者: jarxiv

要約私たちは、基礎的なビデオ普及モデルの構築に向けて大きな進歩を遂げました。 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding

投稿日: 2024年7月12日作成者: jarxiv

要約大規模言語モデル (LLM) は、自然言語理解において大幅な進歩をもたらし … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Toto: Time Series Optimized Transformer for Observability

投稿日: 2024年7月12日作成者: jarxiv

要約この技術レポートでは、Datadog によって開発された時系列予測のための … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark

投稿日: 2024年7月12日作成者: jarxiv

要約モバイル双手動デモ駆動ロボット操作のための新しいベンチマークおよび学習環境 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Missile detection and destruction robot using detection algorithm

投稿日: 2024年7月12日作成者: jarxiv

要約この研究は、世界の現在のミサイル探知技術と、バングラデシュでシステムを導入 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Tuning Vision-Language Models with Candidate Labels by Prompt Alignment

投稿日: 2024年7月12日作成者: jarxiv

要約ビジョン言語モデル (VLM) は、画像とテキストのペアの大規模なトレーニ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard

投稿日: 2024年7月12日作成者: jarxiv

要約三目並べ、コネクトフォー、五目並べなどのグリッドベースのゲームを通じて、大 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.NE | コメントを受け付けていません

SaMoye: Zero-shot Singing Voice Conversion Based on Feature Disentanglement and Synthesis

投稿日: 2024年7月12日作成者: jarxiv

要約歌声変換 (SVC) は、元の内容を維持したまま、特定の音楽作品内の歌手の … 続きを読む →

カテゴリー: 68Txx(Primary)14F05, 91Fxx(Secondary), cs.AI, cs.MM, cs.SD, eess.AS, I.2.7 | コメントを受け付けていません

RASP: A Drone-based Reconfigurable Actuation and Sensing Platform for Engaging Physical Environments with Foundation Models

投稿日: 2024年7月11日作成者: jarxiv

要約基礎モデルと大規模言語モデルは、テキストやデジタルメディアを生成するため … 続きを読む →

カテゴリー: cs.AI, cs.HC, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces

Video Diffusion Alignment via Reward Gradients

Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding

Toto: Time Series Optimized Transformer for Observability

BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark

Missile detection and destruction robot using detection algorithm

Tuning Vision-Language Models with Candidate Labels by Prompt Alignment

Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard

SaMoye: Zero-shot Singing Voice Conversion Based on Feature Disentanglement and Synthesis

RASP: A Drone-based Reconfigurable Actuation and Sensing Platform for Engaging Physical Environments with Foundation Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー