月別アーカイブ: 2024年5月

Exploring 3D-aware Latent Spaces for Efficiently Learning Numerous Scenes

投稿日: 2024年5月20日作成者: jarxiv

要約意味的に類似した多数のシーンを学習するために NeRF のスケーリングを可 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

EasyGen: Easing Multimodal Generation with BiDiffuser and LLMs

投稿日: 2024年5月20日作成者: jarxiv

要約我々は、主に CLIP や ImageBind などのエンコーダに依存し、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Dual-band feature selection for maturity classification of specialty crops by hyperspectral imaging

投稿日: 2024年5月20日作成者: jarxiv

要約イチゴやトマトなどの特殊作物の成熟度分類は、生産現場や包装現場での選択収穫 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DSD-DA: Distillation-based Source Debiasing for Domain Adaptive Object Detection

投稿日: 2024年5月20日作成者: jarxiv

要約特徴アライメントに基づくドメイン適応物体検出 (DAOD) 手法は目覚まし … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

From Sora What We Can See: A Survey of Text-to-Video Generation

投稿日: 2024年5月20日作成者: jarxiv

要約目覚ましい成果を上げ、人工知能は汎用人工知能への道を歩み始めています。 O … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Pose2Gest: A Few-Shot Model-Free Approach Applied In South Indian Classical Dance Gesture Recognition

投稿日: 2024年5月20日作成者: jarxiv

要約インドの古典舞踊は、ムドラとして知られる一連の手の動作を利用しており、姿勢 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Prospective Role of Foundation Models in Advancing Autonomous Vehicles

投稿日: 2024年5月20日作成者: jarxiv

要約人工知能の発展と深層学習の進歩に伴い、GPT、Sora などの大規模基盤モ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing

投稿日: 2024年5月20日作成者: jarxiv

要約弱監視オーディオビジュアルビデオ解析 (AVVP) 手法は、ビデオレベルの … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LoCI-DiffCom: Longitudinal Consistency-Informed Diffusion Model for 3D Infant Brain Image Completion

投稿日: 2024年5月20日作成者: jarxiv

要約乳児の脳は、出生後最初の数年間で急速に発達します。横断的研究と比較して、縦 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Autonomous AI-enabled Industrial Sorting Pipeline for Advanced Textile Recycling

投稿日: 2024年5月20日作成者: jarxiv

要約繊維廃棄物の量が世界的に増加しているため、環境への影響を軽減し、ファッショ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年5月

Exploring 3D-aware Latent Spaces for Efficiently Learning Numerous Scenes

EasyGen: Easing Multimodal Generation with BiDiffuser and LLMs

Dual-band feature selection for maturity classification of specialty crops by hyperspectral imaging

DSD-DA: Distillation-based Source Debiasing for Domain Adaptive Object Detection

From Sora What We Can See: A Survey of Text-to-Video Generation

Pose2Gest: A Few-Shot Model-Free Approach Applied In South Indian Classical Dance Gesture Recognition

Prospective Role of Foundation Models in Advancing Autonomous Vehicles

CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing

LoCI-DiffCom: Longitudinal Consistency-Informed Diffusion Model for 3D Infant Brain Image Completion

Autonomous AI-enabled Industrial Sorting Pipeline for Advanced Textile Recycling

最近の投稿

最近のコメント

アーカイブ

カテゴリー