「cs.AI」カテゴリーアーカイブ

Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models

投稿日: 2024年7月29日作成者: jarxiv

要約テキストから画像への拡散モデルは、印象的でリアルな画像を生成しますが、2D … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Quality Assured: Rethinking Annotation Strategies in Imaging AI

投稿日: 2024年7月29日作成者: jarxiv

要約この論文では新しい方法については説明しません。その代わりに、信頼性の高い … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Learning to Visually Connect Actions and their Effects

投稿日: 2024年7月29日作成者: jarxiv

要約ビデオ理解におけるアクションとその効果の視覚的接続 (CATE) という新 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment

投稿日: 2024年7月29日作成者: jarxiv

要約画像分類モデルは、対象オブジェクトの視覚的な視点の違いや照明の不一致によっ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Learn from the Learnt: Source-Free Active Domain Adaptation via Contrastive Sampling and Visual Persistence

投稿日: 2024年7月29日作成者: jarxiv

要約ドメインアダプテーション (DA) は、ソースドメインから関連するター … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

A Scalable Quantum Non-local Neural Network for Image Classification

投稿日: 2024年7月29日作成者: jarxiv

要約非ローカル演算はコンピュータビジョンにおいて重要な役割を果たし、入力全体 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.IT, cs.LG, math.IT, quant-ph | コメントを受け付けていません

Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic

投稿日: 2024年7月29日作成者: jarxiv

要約最近の進歩により、画像からテキストへのコンテンツの生成と理解におけるマルチ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Exploring Scaling Trends in LLM Robustness

投稿日: 2024年7月29日作成者: jarxiv

要約言語モデルの機能は、モデルのサイズとトレーニングデータをスケーリングする … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG, I.2.7 | コメントを受け付けていません

Recursive Introspection: Teaching Language Model Agents How to Self-Improve

投稿日: 2024年7月29日作成者: jarxiv

要約基礎モデルでインテリジェントなエージェントの動作を可能にするための中心的な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

AutoRE: Document-Level Relation Extraction with Large Language Models

投稿日: 2024年7月29日作成者: jarxiv

要約大規模言語モデル (LLM) は、テキストの理解と生成において優れた能力を … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models

Quality Assured: Rethinking Annotation Strategies in Imaging AI

Learning to Visually Connect Actions and their Effects

Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment

Learn from the Learnt: Source-Free Active Domain Adaptation via Contrastive Sampling and Visual Persistence

A Scalable Quantum Non-local Neural Network for Image Classification

Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic

Exploring Scaling Trends in LLM Robustness

Recursive Introspection: Teaching Language Model Agents How to Self-Improve

AutoRE: Document-Level Relation Extraction with Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー