月別アーカイブ: 2024年7月

SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches

投稿日: 2024年7月2日作成者: jarxiv

要約このペーパーでは、スケッチベースのクエリインターフェイスを使用してビデオ … 続きを読む →

カテゴリー: cs.CV, cs.DB, cs.LG | コメントを受け付けていません

Is Synthetic Data all We Need? Benchmarking the Robustness of Models Trained with Synthetic Images

投稿日: 2024年7月2日作成者: jarxiv

要約機械学習アプローチの開発における長年の課題は、高品質のラベル付きデータが不 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Video Anomaly Detection in 10 Years: A Survey and Outlook

投稿日: 2024年7月2日作成者: jarxiv

要約ビデオ異常検出 (VAD) は、監視、医療、環境モニタリングなどのさまざま … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation

投稿日: 2024年7月2日作成者: jarxiv

要約このレポートでは、Ego4D チャレンジの 5 トラックと EPIC-Ki … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Long Context Transfer from Language to Vision

投稿日: 2024年7月2日作成者: jarxiv

要約ビデオシーケンスは貴重な時間情報を提供しますが、既存の大規模マルチモーダ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach

投稿日: 2024年7月2日作成者: jarxiv

要約衛星画像 (SAI) におけるシーングラフ生成 (SGG) は、知覚から … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Deep Active Audio Feature Learning in Resource-Constrained Environments

投稿日: 2024年7月2日作成者: jarxiv

要約ラベル付きデータが不足しているため、生体音響アプリケーションでのディープ … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS | コメントを受け付けていません

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

投稿日: 2024年7月2日作成者: jarxiv

要約 Image-to-Video (I2V) 生成は、最初のフレームを (テキ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DifAttack++: Query-Efficient Black-Box Adversarial Attack via Hierarchical Disentangled Feature Space in Cross-Domain

投稿日: 2024年7月2日作成者: jarxiv

要約この研究では、高い攻撃成功率 (\textbf{ASR}) と優れた汎用性 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SemanticFormer: Holistic and Semantic Traffic Scene Representation for Trajectory Prediction using Knowledge Graphs

投稿日: 2024年7月2日作成者: jarxiv

要約自動運転における軌道予測は、交通参加者、道路トポロジー、交通標識、およびそ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

月別アーカイブ: 2024年7月

SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches

Is Synthetic Data all We Need? Benchmarking the Robustness of Models Trained with Synthetic Images

Video Anomaly Detection in 10 Years: A Survey and Outlook

EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation

Long Context Transfer from Language to Vision

Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach

Deep Active Audio Feature Learning in Resource-Constrained Environments

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

DifAttack++: Query-Efficient Black-Box Adversarial Attack via Hierarchical Disentangled Feature Space in Cross-Domain

SemanticFormer: Holistic and Semantic Traffic Scene Representation for Trajectory Prediction using Knowledge Graphs

最近の投稿

最近のコメント

アーカイブ

カテゴリー