「cs.AI」カテゴリーアーカイブ

Baichuan-Omni Technical Report

投稿日: 2024年12月30日作成者: jarxiv

要約 GPT-4o の顕著なマルチモーダル機能とインタラクティブなエクスペリエン … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization

投稿日: 2024年12月30日作成者: jarxiv

要約画像内の改ざんされたピクセルを特定することに重点を置いた画像偽造位置特定は … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A Review on the Integration of Artificial Intelligence and Medical Imaging in IVF Ovarian Stimulation

投稿日: 2024年12月30日作成者: jarxiv

要約人工知能 (AI) は、体外受精 (IVF) における意思決定を強化し、治 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

投稿日: 2024年12月30日作成者: jarxiv

要約ビジョン言語モデル (VLM) を活用したグラフィカルユーザーインター … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.HC | コメントを受け付けていません

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

投稿日: 2024年12月30日作成者: jarxiv

要約ラージビジョン言語モデル (LVLM) は文書理解機能を大幅に向上させ、 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

TableRAG: Million-Token Table Understanding with Language Models

投稿日: 2024年12月30日作成者: jarxiv

要約言語モデル (LM) の最近の進歩により、主に表を操作および分析するプログ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR, cs.LG | コメントを受け付けていません

ChaI-TeA: A Benchmark for Evaluating Autocompletion of Interactions with LLM-based Chatbots

投稿日: 2024年12月30日作成者: jarxiv

要約 LLM の台頭により、人間とコンピューターのやり取りの一部が LLM ベー … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

SoK: On the Offensive Potential of AI

投稿日: 2024年12月30日作成者: jarxiv

要約私たちの社会は人工知能 (AI) の恩恵をますます受けています。残念なこ … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.CY, cs.LG | コメントを受け付けていません

RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction

投稿日: 2024年12月30日作成者: jarxiv

要約拡散確率モデル (DPM) は、高忠実度画像合成の事実上のアプローチとして … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

Causal Composition Diffusion Model for Closed-loop Traffic Generation

投稿日: 2024年12月25日作成者: jarxiv

要約シミュレーションは、自動運転における安全性評価、特に複雑なインタラクティブ … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Baichuan-Omni Technical Report

A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization

A Review on the Integration of Artificial Intelligence and Medical Imaging in IVF Ovarian Stimulation

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

TableRAG: Million-Token Table Understanding with Language Models

ChaI-TeA: A Benchmark for Evaluating Autocompletion of Interactions with LLM-based Chatbots

SoK: On the Offensive Potential of AI

RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction

Causal Composition Diffusion Model for Closed-loop Traffic Generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー