「cs.AI」カテゴリーアーカイブ

Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering

投稿日: 2024年12月24日作成者: jarxiv

要約 Text-to-Image（TTI）生成モデルは目覚ましい成功を収めている … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SCBench: A Sports Commentary Benchmark for Video LLMs

投稿日: 2024年12月24日作成者: jarxiv

要約最近、学術界と産業界の両方でビデオ大規模言語モデル (ビデオ LLM) が … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Enhanced Temporal Processing in Spiking Neural Networks for Static Object Detection Using 3D Convolutions

投稿日: 2024年12月24日作成者: jarxiv

要約スパイキングニューラルネットワーク (SNN) は、時空間情報を処理で … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.NE | コメントを受け付けていません

VidTwin: Video VAE with Decoupled Structure and Dynamics

投稿日: 2024年12月24日作成者: jarxiv

要約ビデオオートエンコーダ (ビデオ AE) の最近の進歩により、ビデオ生成 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions

投稿日: 2024年12月24日作成者: jarxiv

要約 3D で自然な手とオブジェクトのインタラクションを生成することは、結果とし … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.LG | コメントを受け付けていません

Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy

投稿日: 2024年12月24日作成者: jarxiv

要約人工知能の急速に進化している分野であるマルチモーダル学習は、テキスト、画像 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Cross-View Referring Multi-Object Tracking

投稿日: 2024年12月24日作成者: jarxiv

要約マルチオブジェクト追跡 (RMOT) の参照は、現在の追跡分野における重要 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Enhancing Trust in Autonomous Agents: An Architecture for Accountability and Explainability through Blockchain and Large Language Models

投稿日: 2024年12月23日作成者: jarxiv

要約人間の対話を伴う環境に自律エージェントを導入すると、セキュリティ上の懸念が … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models

投稿日: 2024年12月23日作成者: jarxiv

要約 TalkWithMachines は、特に安全性が重要なアプリケーション向 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC, cs.LG, cs.RO | コメントを受け付けていません

System Safety Monitoring of Learned Components Using Temporal Metric Forecasting

投稿日: 2024年12月23日作成者: jarxiv

要約学習可能な自律システムでは、システムの動作コンテキストを考慮して、その出力 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO, cs.SE | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering

SCBench: A Sports Commentary Benchmark for Video LLMs

Enhanced Temporal Processing in Spiking Neural Networks for Static Object Detection Using 3D Convolutions

VidTwin: Video VAE with Decoupled Structure and Dynamics

DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions

Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy

Cross-View Referring Multi-Object Tracking

Enhancing Trust in Autonomous Agents: An Architecture for Accountability and Explainability through Blockchain and Large Language Models

TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models

System Safety Monitoring of Learned Components Using Temporal Metric Forecasting

最近の投稿

最近のコメント

アーカイブ

カテゴリー