「cs.AI」カテゴリーアーカイブ

Conformal Predictions for Human Action Recognition with Vision-Language Models

投稿日: 2025年2月11日作成者: jarxiv

要約 Human-in-the-Loop（HITL）フレームワークは、多くの現実 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Few-Shot Classification and Anatomical Localization of Tissues in SPECT Imaging

投稿日: 2025年2月11日作成者: jarxiv

要約正確な分類と解剖学的局在は、効果的な医療診断と研究に不可欠であり、深い学習 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Do generative video models learn physical principles from watching videos?

投稿日: 2025年2月11日作成者: jarxiv

要約 AIビデオ生成は革命を起こしており、品質とリアリズムが急速に進歩しています … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.LG | コメントを受け付けていません

CHIRLA: Comprehensive High-resolution Identification and Re-identification for Large-scale Analysis

投稿日: 2025年2月11日作成者: jarxiv

要約人の再識別（REID）は、コンピュータービジョンの重要な課題であり、さまざ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

GHOST: Gaussian Hypothesis Open-Set Technique

投稿日: 2025年2月11日作成者: jarxiv

要約大規模な認識方法の評価は、通常、全体的なパフォーマンスに焦点を当てています … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning

投稿日: 2025年2月11日作成者: jarxiv

要約このペーパーでは、生の波形に直接適用されるリアルタイムの因果オーディオ除去 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.SD, eess.AS | コメントを受け付けていません

DPD-NeuralEngine: A 22-nm 6.6-TOPS/W/mm$^2$ Recurrent Neural Network Accelerator for Wideband Power Amplifier Digital Pre-Distortion

投稿日: 2025年2月11日作成者: jarxiv

要約最新の通信システムにおけるDeep Neural Network（DNN） … 続きを読む →

カテゴリー: cs.AI, cs.AR, cs.CV | コメントを受け付けていません

Grounding Text-to-Image Diffusion Models for Controlled High-Quality Image Generation

投稿日: 2025年2月11日作成者: jarxiv

要約テキストからイメージ（T2I）生成拡散モデルは、テキストキャプションから多 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Guided and Variance-Corrected Fusion with One-shot Style Alignment for Large-Content Image Generation

投稿日: 2025年2月11日作成者: jarxiv

要約小さな拡散モデルを使用して大きな画像を生成すると、大規模なモデルのトレーニ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual Classification

投稿日: 2025年2月11日作成者: jarxiv

要約特定のタスクの事前訓練を受けたビジョンモデルを微調整することは、コンピュー … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Conformal Predictions for Human Action Recognition with Vision-Language Models

Few-Shot Classification and Anatomical Localization of Tissues in SPECT Imaging

Do generative video models learn physical principles from watching videos?

CHIRLA: Comprehensive High-resolution Identification and Re-identification for Large-scale Analysis

GHOST: Gaussian Hypothesis Open-Set Technique

CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning

DPD-NeuralEngine: A 22-nm 6.6-TOPS/W/mm$^2$ Recurrent Neural Network Accelerator for Wideband Power Amplifier Digital Pre-Distortion

Grounding Text-to-Image Diffusion Models for Controlled High-Quality Image Generation

Guided and Variance-Corrected Fusion with One-shot Style Alignment for Large-Content Image Generation

KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual Classification

最近の投稿

最近のコメント

アーカイブ

カテゴリー