月別アーカイブ: 2024年1月

Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction

投稿日: 2024年1月8日作成者: jarxiv

要約人間の軌跡予測は、ロボット工学や自動運転などの分野において重要な課題です。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Language-free Compositional Action Generation via Decoupling Refinement

投稿日: 2024年1月8日作成者: jarxiv

要約単純な要素を複雑なコンセプトに組み込むことは、特に 3D アクション生成の … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SPFormer: Enhancing Vision Transformer with Superpixel Representation

投稿日: 2024年1月8日作成者: jarxiv

要約この作品では、スーパーピクセル表現によって強化された新しいビジョントラン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Enhancing Network Initialization for Medical AI Models Using Large-Scale, Unlabeled Natural Images

投稿日: 2024年1月8日作成者: jarxiv

要約 ImageNet のような事前トレーニングデータセットは、医療画像分析の … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Locally Adaptive Neural 3D Morphable Models

投稿日: 2024年1月8日作成者: jarxiv

要約 3D メッシュの生成と操作を学習するための柔軟性の高い自動エンコーダー ( … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

TreeLearn: A Comprehensive Deep Learning Method for Segmenting Individual Trees from Ground-Based LiDAR Forest Point Clouds

投稿日: 2024年1月8日作成者: jarxiv

要約レーザースキャンされた森林の点群により、森林管理のための貴重な情報を抽出す … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively

投稿日: 2024年1月8日作成者: jarxiv

要約 CLIP および Segment Anything Model (SAM) … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MC-ViViT: Multi-branch Classifier-ViViT to detect Mild Cognitive Impairment in older adults using facial videos

投稿日: 2024年1月8日作成者: jarxiv

要約畳み込みニューラルネットワーク (CNN) を含む深層機械学習モデルは、 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Denoising Vision Transformers

投稿日: 2024年1月8日作成者: jarxiv

要約私たちは、ビジョントランスフォーマー (ViT) に固有の微妙だが重要な … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Retrieval-Augmented Generation for Large Language Models: A Survey

投稿日: 2024年1月8日作成者: jarxiv

要約大規模言語モデル (LLM) は重要な機能を実証していますが、幻覚、古い知 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

月別アーカイブ: 2024年1月

Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction

Language-free Compositional Action Generation via Decoupling Refinement

SPFormer: Enhancing Vision Transformer with Superpixel Representation

Enhancing Network Initialization for Medical AI Models Using Large-Scale, Unlabeled Natural Images

Locally Adaptive Neural 3D Morphable Models

TreeLearn: A Comprehensive Deep Learning Method for Segmenting Individual Trees from Ground-Based LiDAR Forest Point Clouds

Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively

MC-ViViT: Multi-branch Classifier-ViViT to detect Mild Cognitive Impairment in older adults using facial videos

Denoising Vision Transformers

Retrieval-Augmented Generation for Large Language Models: A Survey

最近の投稿

最近のコメント

アーカイブ

カテゴリー