「cs.AI」カテゴリーアーカイブ

Hallucination Benchmark in Medical Visual Question Answering

投稿日: 2024年1月12日作成者: jarxiv

要約視覚質問応答 (VQA) に関する大規模な言語および視覚モデルの最近の成功 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models

投稿日: 2024年1月12日作成者: jarxiv

要約 Arbitrary Style Transfer (AST) の目標は、ス … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Machine Learning Applications in Traumatic Brain Injury: A Spotlight on Mild TBI

投稿日: 2024年1月12日作成者: jarxiv

要約外傷性脳損傷（TBI）は、世界的な公衆衛生上の重大な課題を引き起こしており … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV | コメントを受け付けていません

CoSSegGaussians: Compact and Swift Scene Segmenting 3D Gaussians

投稿日: 2024年1月12日作成者: jarxiv

要約我々は、RGB画像のみを入力して、高速なレンダリング速度でコンパクトな3D … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Heterogeneous Generative Knowledge Distillation with Masked Image Modeling

投稿日: 2024年1月12日作成者: jarxiv

要約通常、小規模な CNN ベースのモデルは、計算リソースが制限されたエッジ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

An attempt to generate new bridge types from latent space of PixelCNN

投稿日: 2024年1月12日作成者: jarxiv

要約生成人工知能テクノロジーを使用して、新しい種類の橋を生成してみます。 Py … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

How does the primate brain combine generative and discriminative computations in vision?

投稿日: 2024年1月12日作成者: jarxiv

要約ビジョンは推論問題として広く理解されています。しかし、推論プロセスの 2 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, q-bio.NC | コメントを受け付けていません

Surgical-DINO: Adapter Learning of Foundation Model for Depth Estimation in Endoscopic Surgery

投稿日: 2024年1月12日作成者: jarxiv

要約目的: ロボット手術における深さの推定は、3D 再構成、手術ナビゲーション … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Linear Spaces of Meanings: Compositional Structures in Vision-Language Models

投稿日: 2024年1月12日作成者: jarxiv

要約私たちは、事前にトレーニングされたビジョン言語モデル (VLM) からデー … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Manipulating Feature Visualizations with Gradient Slingshots

投稿日: 2024年1月12日作成者: jarxiv

要約ディープニューラルネットワーク (DNN) は、複雑で多彩な表現を学習 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Hallucination Benchmark in Medical Visual Question Answering

HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models

Machine Learning Applications in Traumatic Brain Injury: A Spotlight on Mild TBI

CoSSegGaussians: Compact and Swift Scene Segmenting 3D Gaussians

Heterogeneous Generative Knowledge Distillation with Masked Image Modeling

An attempt to generate new bridge types from latent space of PixelCNN

How does the primate brain combine generative and discriminative computations in vision?

Surgical-DINO: Adapter Learning of Foundation Model for Depth Estimation in Endoscopic Surgery

Linear Spaces of Meanings: Compositional Structures in Vision-Language Models

Manipulating Feature Visualizations with Gradient Slingshots

最近の投稿

最近のコメント

アーカイブ

カテゴリー