「cs.AI」カテゴリーアーカイブ

D$^3$epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes

投稿日: 2024年11月8日作成者: jarxiv

要約深度推定はロボット工学において重要な技術です。最近、自己教師あり深度推定 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

A multi-purpose automatic editing system based on lecture semantics for remote education

投稿日: 2024年11月8日作成者: jarxiv

要約遠隔授業は、その利便性と安全性により、特にパンデミックのような極端な状況下 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM | コメントを受け付けていません

ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset

投稿日: 2024年11月8日作成者: jarxiv

要約ファサードのセマンティックセグメンテーションは、写真測量とコンピュータ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

C3T: Cross-modal Transfer Through Time for Human Action Recognition

投稿日: 2024年11月8日作成者: jarxiv

要約多様なセンサーの可能性を解き放つために、人間行動認識 (HAR) のための … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC, cs.LG, eess.SP | コメントを受け付けていません

Exploring QUIC Dynamics: A Large-Scale Dataset for Encrypted Traffic Analysis

投稿日: 2024年11月8日作成者: jarxiv

要約 QUIC は、ますます使用されている新しいトランスポートプロトコルであり … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.NI | コメントを受け付けていません

StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration

投稿日: 2024年11月8日作成者: jarxiv

要約 AI 生成コンテンツ (AIGC) の出現により、従来のプロセスを合理化す … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MA | コメントを受け付けていません

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

投稿日: 2024年11月8日作成者: jarxiv

要約このペーパーでは、ビデオ拡散を使用して単一の画像からフォトリアリスティック … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR | コメントを受け付けていません

M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding

投稿日: 2024年11月8日作成者: jarxiv

要約ドキュメントからの質問に答えるドキュメントビジュアル質問応答 (DocV … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification

投稿日: 2024年11月8日作成者: jarxiv

要約潜在ビデオ拡散モデルは、生成された画質と時間的一貫性のおかげで、一般の観察 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

HourVideo: 1-Hour Video-Language Understanding

投稿日: 2024年11月8日作成者: jarxiv

要約 1 時間のビデオ言語理解のためのベンチマークデータセットである Hour … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

D$^3$epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes

A multi-purpose automatic editing system based on lecture semantics for remote education

ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset

C3T: Cross-modal Transfer Through Time for Human Action Recognition

Exploring QUIC Dynamics: A Large-Scale Dataset for Encrypted Traffic Analysis

StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding

Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification

HourVideo: 1-Hour Video-Language Understanding

最近の投稿

最近のコメント

アーカイブ

カテゴリー