月別アーカイブ: 2025年1月

Long Story Short: Story-level Video Understanding from 20K Short Films

投稿日: 2025年1月13日作成者: jarxiv

要約視覚言語モデルの最近の開発により、ビデオの理解が大幅に進歩しました。ただ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

VLM-driven Behavior Tree for Context-aware Task Planning

投稿日: 2025年1月13日作成者: jarxiv

要約ビヘイビアツリー (BT) を生成するための大規模言語モデル (LLM) … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC, cs.RO | コメントを受け付けていません

VideoRAG: Retrieval-Augmented Generation over Video Corpus

投稿日: 2025年1月13日作成者: jarxiv

要約検索拡張生成 (RAG) は、クエリに関連する外部知識を取得し、それを生成 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.IR, cs.LG | コメントを受け付けていません

Solving nonograms using Neural Networks

投稿日: 2025年1月13日作成者: jarxiv

要約ノノグラムは、ヘッダーにある数字に従って、グリッド内のセルに色を付けるか空 … 続きを読む →

カテゴリー: cs.AI, cs.NE | コメントを受け付けていません

Gender Bias in Text-to-Video Generation Models: A case study of Sora

投稿日: 2025年1月13日作成者: jarxiv

要約テキストからビデオへの生成モデルの出現は、テキストのプロンプトから高品質の … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.CY, cs.LG | コメントを受け付けていません

EDNet: Edge-Optimized Small Target Detection in UAV Imagery — Faster Context Attention, Better Feature Fusion, and Hardware Acceleration

投稿日: 2025年1月13日作成者: jarxiv

要約低解像度、複雑な背景、ダイナミックなシーンのため、ドローン画像内の小さなタ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs

投稿日: 2025年1月13日作成者: jarxiv

要約教育においては、大規模言語モデル (LLM) の人間に似たテキストを生成す … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

The New Anticipatory Governance Culture for Innovation: Regulatory Foresight, Regulatory Experimentation and Regulatory Learning

投稿日: 2025年1月13日作成者: jarxiv

要約技術革新の急速なペースに伴い、従来の政策形成と立法方法は著しく時代錯誤にな … 続きを読む →

カテゴリー: cs.AI, cs.CY | コメントを受け付けていません

Towards Backdoor Stealthiness in Model Parameter Space

投稿日: 2025年1月13日作成者: jarxiv

要約バックドアのステルス性に関する最近の研究は、主に入力空間の区別できないトリ … 続きを読む →

カテゴリー: cs.AI, cs.CR | コメントを受け付けていません

DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information

投稿日: 2025年1月13日作成者: jarxiv

要約心臓病は依然として人間の健康に対する重大な脅威です。非侵襲的診断ツールと … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年1月

Long Story Short: Story-level Video Understanding from 20K Short Films

VLM-driven Behavior Tree for Context-aware Task Planning

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Solving nonograms using Neural Networks

Gender Bias in Text-to-Video Generation Models: A case study of Sora

EDNet: Edge-Optimized Small Target Detection in UAV Imagery — Faster Context Attention, Better Feature Fusion, and Hardware Acceleration

Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs

The New Anticipatory Governance Culture for Innovation: Regulatory Foresight, Regulatory Experimentation and Regulatory Learning

Towards Backdoor Stealthiness in Model Parameter Space

DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information

最近の投稿

最近のコメント

アーカイブ

カテゴリー