「cs.AI」カテゴリーアーカイブ

Everyone Can Be Picasso? A Computational Framework into the Myth of Human versus AI Painting

投稿日: 2024年2月23日作成者: jarxiv

要約最近の AI テクノロジー、特に AI 生成コンテンツ (AIGC) の進 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC, H.5.2 | コメントを受け付けていません

RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation

投稿日: 2024年2月23日作成者: jarxiv

要約オープンワールドのロボット操作のための高レベルのタスク計画とコード生成の急 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.RO, I.2.10 | コメントを受け付けていません

Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data

投稿日: 2024年2月23日作成者: jarxiv

要約このペーパーでは、さまざまなプロンプト戦略とデータ形式を通じて表形式データ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Visual Hallucinations of Multi-modal Large Language Models

投稿日: 2024年2月23日作成者: jarxiv

要約幻視 (VH) とは、マルチモーダル LLM (MLLM) が視覚的な質問 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Mitigating Gender Bias in Face Recognition Using the von Mises-Fisher Mixture Model

投稿日: 2024年2月23日作成者: jarxiv

要約日常の幅広い用途における深層学習アルゴリズムの高いパフォーマンスと信頼性に … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis

投稿日: 2024年2月23日作成者: jarxiv

要約画像を生成するための現代のモデルは、驚くべき品質と多用途性を示しています。 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset

投稿日: 2024年2月23日作成者: jarxiv

要約大規模マルチモーダルモデル (LMM) の最近の進歩により、MathVi … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, math.HO | コメントを受け付けていません

AesFA: An Aesthetic Feature-Aware Arbitrary Neural Style Transfer

投稿日: 2024年2月23日作成者: jarxiv

要約ニューラルスタイルトランスファー (NST) は近年大幅に進化しました … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion

投稿日: 2024年2月23日作成者: jarxiv

要約この研究では、手とオブジェクトのインタラクション (HOI) のノイズを除 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.LG | コメントを受け付けていません

WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition

投稿日: 2024年2月23日作成者: jarxiv

要約不正確な監視を使用した弱い監視による視覚認識は、重要かつ困難な学習問題です … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Everyone Can Be Picasso? A Computational Framework into the Myth of Human versus AI Painting

RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation

Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data

Visual Hallucinations of Multi-modal Large Language Models

Mitigating Gender Bias in Face Recognition Using the von Mises-Fisher Mixture Model

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis

Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset

AesFA: An Aesthetic Feature-Aware Arbitrary Neural Style Transfer

GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion

WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition

最近の投稿

最近のコメント

アーカイブ

カテゴリー