「cs.CL」カテゴリーアーカイブ

OrionBench: A Benchmark for Chart and Human-Recognizable Object Detection in Infographics

投稿日: 2025年5月28日作成者: jarxiv

要約科学、ビジネス、およびコミュニケーションのコンテキストにおけるチャートの中 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Can Large Language Models Understand Symbolic Graphics Programs?

投稿日: 2025年5月28日作成者: jarxiv

要約大規模な言語モデル（LLM）に対する熱意を背景に、能力と欠点を科学的に評価 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

ID-Align: RoPE-Conscious Position Remapping for Dynamic High-Resolution Adaptation in Vision-Language Models

投稿日: 2025年5月28日作成者: jarxiv

要約現在、ビジョン言語モデル（VLMS）パフォーマンスを強化するための一般的な … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Mitigating Hallucination in Large Vision-Language Models via Adaptive Attention Calibration

投稿日: 2025年5月28日作成者: jarxiv

要約大きなビジョン言語モデル（LVLMS）は、マルチモーダルタスクで印象的なパ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents

投稿日: 2025年5月28日作成者: jarxiv

要約このペーパーでは、GUIエージェントの2つの重要な課題に対処する自己改善フ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

投稿日: 2025年5月28日作成者: jarxiv

要約アカデミックポスターの世代は、科学的コミュニケーションにおいて重要でありな … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MA | コメントを受け付けていません

ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models

投稿日: 2025年5月28日作成者: jarxiv

要約ビジョン言語モデル（VLM）は、視覚コンテンツについての理解と推論において … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Adaptive Deep Reasoning: Triggering Deep Thinking When Needed

投稿日: 2025年5月28日作成者: jarxiv

要約大規模な言語モデル（LLM）は、長鎖の推論を通じて複雑なタスクを処理する上 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

SCIRGC: Multi-Granularity Citation Recommendation and Citation Sentence Preference Alignment

投稿日: 2025年5月28日作成者: jarxiv

要約科学研究の記事では、現在の研究と以前の研究との関係を強調しているため、引用 … 続きを読む →

カテゴリー: cs.CL, cs.DL | コメントを受け付けていません

TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent

投稿日: 2025年5月28日作成者: jarxiv

要約大規模な言語モデル（LLM）がデリケートなワークフローに統合されると、懸念 … 続きを読む →

カテゴリー: cs.CL, cs.CR | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

OrionBench: A Benchmark for Chart and Human-Recognizable Object Detection in Infographics

Can Large Language Models Understand Symbolic Graphics Programs?

ID-Align: RoPE-Conscious Position Remapping for Dynamic High-Resolution Adaptation in Vision-Language Models

Mitigating Hallucination in Large Vision-Language Models via Adaptive Attention Calibration

UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models

Adaptive Deep Reasoning: Triggering Deep Thinking When Needed

SCIRGC: Multi-Granularity Citation Recommendation and Citation Sentence Preference Alignment

TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent

最近の投稿

最近のコメント

アーカイブ

カテゴリー