月別アーカイブ: 2024年1月

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

投稿日: 2024年1月5日作成者: jarxiv

要約 3Dパノプティックセグメンテーションは、シーン内の3D点に対する意味的注釈 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs

投稿日: 2024年1月5日作成者: jarxiv

要約 3D対応Generative Adversarial Networks（G … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.LG | コメントを受け付けていません

LLM Augmented LLMs: Expanding Capabilities through Composition

投稿日: 2024年1月5日作成者: jarxiv

要約大規模なデータ・コーパスで学習された数十億のパラメータを持つ基礎モデルは、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Bring Metric Functions into Diffusion Models

投稿日: 2024年1月5日作成者: jarxiv

要約本論文では、学習において付加的なメトリック関数を効果的に組み込むことにより … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ODIN: A Single Model for 2D and 3D Perception

投稿日: 2024年1月5日作成者: jarxiv

要約 ScanNetのような現代の3D知覚ベンチマークにおける最先端のモデルは、 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Learning to Prompt with Text Only Supervision for Vision-Language Models

投稿日: 2024年1月5日作成者: jarxiv

要約 CLIPのような基礎的な視覚言語モデルは、その優れた汎化能力により、視覚の … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Anatomy-aware and acquisition-agnostic joint registration with SynthMorph

投稿日: 2024年1月5日作成者: jarxiv

要約アフィン画像レジストレーションは医用画像解析の要である。古典的なアルゴリズ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Vietnamese Poem Generation & The Prospect Of Cross-Language Poem-To-Poem Translation

投稿日: 2024年1月5日作成者: jarxiv

要約詩の生成は、言語、感情、文体のニュアンスを理解するモデルを必要とするため、 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Few-shot Adaptation of Multi-modal Foundation Models: A Survey

投稿日: 2024年1月5日作成者: jarxiv

要約 CLIPのようなマルチモーダル（視覚言語）モデルは、新世代の視覚基盤モデル … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HawkRover: An Autonomous mmWave Vehicular Communication Testbed with Multi-sensor Fusion and Deep Learning

投稿日: 2024年1月5日作成者: jarxiv

要約コネクテッドカーと自動運転車（CAV）は、私たちの日常生活を一変させる技術 … 続きを読む →

カテゴリー: cs.CV, cs.IT, math.IT | コメントを受け付けていません

月別アーカイブ: 2024年1月

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs

LLM Augmented LLMs: Expanding Capabilities through Composition

Bring Metric Functions into Diffusion Models

ODIN: A Single Model for 2D and 3D Perception

Learning to Prompt with Text Only Supervision for Vision-Language Models

Anatomy-aware and acquisition-agnostic joint registration with SynthMorph

Vietnamese Poem Generation & The Prospect Of Cross-Language Poem-To-Poem Translation

Few-shot Adaptation of Multi-modal Foundation Models: A Survey

HawkRover: An Autonomous mmWave Vehicular Communication Testbed with Multi-sensor Fusion and Deep Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー