月別アーカイブ: 2025年5月

Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?

投稿日: 2025年5月23日作成者: jarxiv

要約大規模な言語モデル（LLM）は、複雑な論理推論タスクで画期的なパフォーマン … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Guided Diffusion Sampling on Function Spaces with Applications to PDEs

投稿日: 2025年5月23日作成者: jarxiv

要約 PDEベースの逆問題における条件付きサンプリングのための一般的なフレームワ … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.NA, math.NA, stat.ML | コメントを受け付けていません

R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

投稿日: 2025年5月23日作成者: jarxiv

要約大規模な言語モデル（LLM）は強力ですが、静的な知識のために幻覚を起こしや … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

Understanding Prompt Tuning and In-Context Learning via Meta-Learning

投稿日: 2025年5月23日作成者: jarxiv

要約プロンプトは、タスクをターゲットにするために優先モデルを適応させる主な方法 … 続きを読む →

カテゴリー: cs.AI, cs.LG, stat.ML | コメントを受け付けていません

InSTA: Towards Internet-Scale Training For Agents

投稿日: 2025年5月23日作成者: jarxiv

要約 Webナビゲーションエージェントをトレーニングするための主なアプローチは、 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Perceptual Quality Assessment for Embodied AI

投稿日: 2025年5月23日作成者: jarxiv

要約具体化されたAIは近年急速に発達していますが、それでも主に研究所に展開され … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts

投稿日: 2025年5月23日作成者: jarxiv

要約シーンベースのビデオ生成の最近の進歩により、システムは構造化されたプロンプ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Retrieval-Augmented Perception: High-Resolution Image Perception Meets Visual RAG

投稿日: 2025年5月23日作成者: jarxiv

要約高解像度（HR）画像認識は、マルチモーダル大手言語モデル（MLLM）の重要 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms

投稿日: 2025年5月23日作成者: jarxiv

要約 Dongbaの絵文字は、世界でまだ使用されている唯一の絵文字スクリプトです … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization

投稿日: 2025年5月23日作成者: jarxiv

要約拡散モデルや大規模な視覚言語モデル（LVLMS）などの基礎モデル（FMS） … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年5月

Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?

Guided Diffusion Sampling on Function Spaces with Applications to PDEs

R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Understanding Prompt Tuning and In-Context Learning via Meta-Learning

InSTA: Towards Internet-Scale Training For Agents

Perceptual Quality Assessment for Embodied AI

Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts

Retrieval-Augmented Perception: High-Resolution Image Perception Meets Visual RAG

DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms

From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization

最近の投稿

最近のコメント

アーカイブ

カテゴリー