「cs.CV」カテゴリーアーカイブ

PACE: marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization

投稿日: 2024年9月26日作成者: jarxiv

要約 Parameter-Efficient Fine-Tuning (PEFT … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation

投稿日: 2024年9月26日作成者: jarxiv

要約 VQ-VAE などの画像トークナイザーの最近の進歩により、言語モデリングと … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Attention Prompting on Image for Large Vision-Language Models

投稿日: 2024年9月26日作成者: jarxiv

要約大規模言語モデル (LLM) と比較して、大規模視覚言語モデル (LVLM … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion

投稿日: 2024年9月26日作成者: jarxiv

要約事前トレーニングされた 2D 拡散モデルとスコア蒸留サンプリング (SDS … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.LG | コメントを受け付けていません

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

投稿日: 2024年9月26日作成者: jarxiv

要約現在の最も先進的なマルチモーダルモデルは独自の仕様のままです。最強のオ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed

投稿日: 2024年9月26日作成者: jarxiv

要約効率的かつ安全な自動運転のためには、自動運転車両が他の交通エージェントの動 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

A Computer Vision Approach for Autonomous Cars to Drive Safe at Construction Zone

投稿日: 2024年9月26日作成者: jarxiv

要約よりスマートで安全な都市を構築するには、安全で効率的で持続可能な交通システ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Toward Unified Practices in Trajectory Prediction Research on Drone Datasets

投稿日: 2024年9月26日作成者: jarxiv

要約高品質のデータセットの利用可能性は、自動運転車の挙動予測アルゴリズムの開発 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation

投稿日: 2024年9月26日作成者: jarxiv

要約視覚-言語-動作 (VLA) モデルは、エンドツーエンドの学習プロセスを通 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients

投稿日: 2024年9月26日作成者: jarxiv

要約画像から画像への変換は、コアの内容と構造を維持しながら、画像を 1 つの視 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

PACE: marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization

Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation

Attention Prompting on Image for Large Vision-Language Models

DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed

A Computer Vision Approach for Autonomous Cars to Drive Safe at Construction Zone

Toward Unified Practices in Trajectory Prediction Research on Drone Datasets

TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation

Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients

最近の投稿

最近のコメント

アーカイブ

カテゴリー