月別アーカイブ: 2024年8月

microYOLO: Towards Single-Shot Object Detection on Microcontrollers

投稿日: 2024年8月29日作成者: jarxiv

要約この進行中の論文では、YOLO を使用したマイクロコントローラーでのシング … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model

投稿日: 2024年8月29日作成者: jarxiv

要約自動運転トレーニングには、さまざまな交通状況、気象シナリオ、道路の種類を含 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

投稿日: 2024年8月29日作成者: jarxiv

要約このペーパーでは、オーディオおよび言語参照ビデオオブジェクトセグメンテ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

投稿日: 2024年8月29日作成者: jarxiv

要約大規模な MLLM (l-MLLM) から知識を抽出することで、小規模なマ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors

投稿日: 2024年8月29日作成者: jarxiv

要約 3D 臨床医療画像の正確なセグメンテーションは、脊椎疾患の診断と治療におい … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data

投稿日: 2024年8月29日作成者: jarxiv

要約複数の部位とスキャナーからの神経画像データセットを組み合わせると、統計的検 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones

投稿日: 2024年8月29日作成者: jarxiv

要約 Gen-Swarms は、ディープ生成モデルの機能を活用し、リアクティブ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Infusion: internal diffusion for inpainting of dynamic textures and complex motion

投稿日: 2024年8月29日作成者: jarxiv

要約ビデオ修復は、視覚的に説得力のある方法でビデオ内の領域を塗りつぶすタスクで … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization

投稿日: 2024年8月29日作成者: jarxiv

要約テキストから画像へのパーソナライゼーションの最近の進歩により、ユーザーが提 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

投稿日: 2024年8月29日作成者: jarxiv

要約特定領域のタスクを解決するための大規模言語モデル (LLM) の専門知識を … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年8月

microYOLO: Towards Single-Shot Object Detection on Microcontrollers

GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors

Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data

Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones

Infusion: internal diffusion for inpainting of dynamic textures and complex motion

CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー