月別アーカイブ: 2024年8月

Smart Multi-Modal Search: Contextual Sparse and Dense Embedding Integration in Adobe Express

投稿日: 2024年8月30日作成者: jarxiv

要約ユーザーのコンテンツとクエリがますますマルチモーダルになるにつれて、効果的 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.IR | コメントを受け付けていません

Towards Infusing Auxiliary Knowledge for Distracted Driver Detection

投稿日: 2024年8月30日作成者: jarxiv

要約わき見運転は世界的に交通事故の主な原因となっています。注意散漫運転の特定 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, I.2.0 | コメントを受け付けていません

Verification of Geometric Robustness of Neural Networks via Piecewise Linear Approximation and Lipschitz Optimisation

投稿日: 2024年8月30日作成者: jarxiv

要約私たちは、回転、スケーリング、せん断、平行移動などの入力画像の幾何学的変換 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Sparse Signal Reconstruction for Overdispersed Low-photon Count Biomedical Imaging Using $\ell_p$ Total Variation

投稿日: 2024年8月30日作成者: jarxiv

要約ポアソン分布モデルを一般化した負の二項モデルは、医療用画像処理などの低光子 … 続きを読む →

カテゴリー: cs.CV, eess.IV, eess.SP, math.OC | コメントを受け付けていません

Turbulence Strength $C_n^2$ Estimation from Video using Physics-based Deep Learning

投稿日: 2024年8月30日作成者: jarxiv

要約長距離から撮影した画像は、温度がランダムな空気セルの乱流により、屈折率が変 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Trajectory Forecasting through Low-Rank Adaptation of Discrete Latent Codes

投稿日: 2024年8月30日作成者: jarxiv

要約軌跡予測は、一連のエージェント (例: エージェント) の将来の動きを予測 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

3D Pose-Based Temporal Action Segmentation for Figure Skating: A Fine-Grained and Jump Procedure-Aware Annotation Approach

投稿日: 2024年8月30日作成者: jarxiv

要約ビデオから人間の行動を理解することは、スポーツを含む多くの分野で不可欠です … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection

投稿日: 2024年8月30日作成者: jarxiv

要約顕著なオブジェクト検出 (SOD) は従来、ImageNet の事前トレー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving

投稿日: 2024年8月30日作成者: jarxiv

要約自動運転技術の進歩により、現実世界のシナリオを理解して予測するための、ます … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition

投稿日: 2024年8月30日作成者: jarxiv

要約視覚的なストーリーテリングは、時間的に順序付けられた一連の画像を与えられて … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年8月

Smart Multi-Modal Search: Contextual Sparse and Dense Embedding Integration in Adobe Express

Towards Infusing Auxiliary Knowledge for Distracted Driver Detection

Verification of Geometric Robustness of Neural Networks via Piecewise Linear Approximation and Lipschitz Optimisation

Sparse Signal Reconstruction for Overdispersed Low-photon Count Biomedical Imaging Using $\ell_p$ Total Variation

Turbulence Strength $C_n^2$ Estimation from Video using Physics-based Deep Learning

Trajectory Forecasting through Low-Rank Adaptation of Discrete Latent Codes

3D Pose-Based Temporal Action Segmentation for Figure Skating: A Fine-Grained and Jump Procedure-Aware Annotation Approach

SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection

DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving

Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition

最近の投稿

最近のコメント

アーカイブ

カテゴリー