月別アーカイブ: 2024年6月

D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video

投稿日: 2024年6月17日作成者: jarxiv

要約非剛体変形シーンの動的再構成と時空間的な斬新な視点の合成は、最近ますます注 … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.LG | コメントを受け付けていません

Localizing Events in Videos with Multimodal Queries

投稿日: 2024年6月17日作成者: jarxiv

要約ビデオの理解はデジタル時代において極めて重要なタスクですが、ビデオの動的か … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation

投稿日: 2024年6月17日作成者: jarxiv

要約 Audio-Visual Speech Recognition (AVSR … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS | コメントを受け付けていません

Retraining-free Model Quantization via One-Shot Weight-Coupling Learning

投稿日: 2024年6月17日作成者: jarxiv

要約量子化は、過剰にパラメータ化されたディープニューラルモデルを圧縮し、リ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding

投稿日: 2024年6月17日作成者: jarxiv

要約リモートセンシング大型マルチモーダルモデル (RSLMM) は急速に開 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval

投稿日: 2024年6月17日作成者: jarxiv

要約ディープメトリックラーニング (DML) は、リモートセンシング (RS) … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors

投稿日: 2024年6月17日作成者: jarxiv

要約低解像度の入力ビューから高解像度の新規ビュー合成 (HRNVS) を実現す … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations

投稿日: 2024年6月17日作成者: jarxiv

要約部分認識パノプティックセグメンテーション (PPS) では、(a) 画像 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Shelf-Supervised Multi-Modal Pre-Training for 3D Object Detection

投稿日: 2024年6月17日作成者: jarxiv

要約最先端の 3D オブジェクト検出器は、多くの場合、大量のラベル付きデータセ … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Modified Risk Formulation for Improving the Prediction of Knee Osteoarthritis Progression

投稿日: 2024年6月17日作成者: jarxiv

要約変形性関節症 (OA) の転帰を予測する現在の方法には、転帰予測モデルを改 … 続きを読む →

カテゴリー: cs.CV, eess.IV, q-bio.QM | コメントを受け付けていません

月別アーカイブ: 2024年6月

D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video

Localizing Events in Videos with Multimodal Queries

Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation

Retraining-free Model Quantization via One-Shot Weight-Coupling Learning

SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding

Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval

GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors

Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations

Shelf-Supervised Multi-Modal Pre-Training for 3D Object Detection

Modified Risk Formulation for Improving the Prediction of Knee Osteoarthritis Progression

最近の投稿

最近のコメント

アーカイブ

カテゴリー