「cs.AI」カテゴリーアーカイブ

Title block detection and information extraction for enhanced building drawings search

投稿日: 2025年4月14日作成者: jarxiv

要約建築、エンジニアリング、および建設（AEC）業界は、建物の建設、メンテナン … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Fine-Grained Retrieval-Augmented Generation for Visual Question Answering

投稿日: 2025年4月14日作成者: jarxiv

要約視覚的な質問回答（VQA）は、画像からの情報を利用することにより、自然言語 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

投稿日: 2025年4月14日作成者: jarxiv

要約このテクニカルレポートは、ビデオジェネレーションファンデーションモデルをト … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images

投稿日: 2025年4月14日作成者: jarxiv

要約マルチモーダルLLMS（MLLM）を使用してシステムを提示して、時間的変化 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.CY | コメントを受け付けていません

Steering CLIP’s vision transformer with sparse autoencoders

投稿日: 2025年4月14日作成者: jarxiv

要約ビジョンモデルは非常に有能ですが、内部メカニズムはよく理解されていません。 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

投稿日: 2025年4月14日作成者: jarxiv

要約 1,350億パラメーターとAscend Neural Processing … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations

投稿日: 2025年4月14日作成者: jarxiv

要約 Visual Grounding（VG）は、自然言語の説明に基づいて画像に … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

CORTEX-AVD: A Framework for CORner Case Testing and EXploration in Autonomous Vehicle Development

投稿日: 2025年4月11日作成者: jarxiv

要約自律車（AVS）は、人為的エラーを減らすことにより、交通の安全性と効率を改 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke

投稿日: 2025年4月11日作成者: jarxiv

要約脳卒中患者の手矯正の意図は、データ収集の難しさのために困難です。さらに、 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Learning-Based Approximate Nonlinear Model Predictive Control Motion Cueing

投稿日: 2025年4月11日作成者: jarxiv

要約モーションキューイングアルゴリズム（MCAS）は、シミュレートされた車両の … 続きを読む →

カテゴリー: cs.AI, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Title block detection and information extraction for enhanced building drawings search

Fine-Grained Retrieval-Augmented Generation for Visual Question Answering

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images

Steering CLIP’s vision transformer with sparse autoencoders

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations

CORTEX-AVD: A Framework for CORner Case Testing and EXploration in Autonomous Vehicle Development

ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke

Learning-Based Approximate Nonlinear Model Predictive Control Motion Cueing

最近の投稿

最近のコメント

アーカイブ

カテゴリー