「cs.AI」カテゴリーアーカイブ

Accelerating Codec-based Speech Synthesis with Multi-Token Prediction and Speculative Decoding

投稿日: 2024年10月18日作成者: jarxiv

要約この文書の目標は、音声品質の犠牲を最小限に抑えながら、コーデックベースの音 … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS | コメントを受け付けていません

SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction

投稿日: 2024年10月18日作成者: jarxiv

要約大規模言語モデル (LLM) の最近の進歩により、長いコンテキストを処理で … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Towards Multilingual LLM Evaluation for European Languages

投稿日: 2024年10月18日作成者: jarxiv

要約大規模言語モデル (LLM) の台頭により、多数の言語やタスクにわたって自 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Influence Functions for Scalable Data Attribution in Diffusion Models

投稿日: 2024年10月18日作成者: jarxiv

要約拡散モデルは生成モデリングに大きな進歩をもたらしました。しかし、それらが … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs

投稿日: 2024年10月18日作成者: jarxiv

要約 Transformer ベースの大規模言語モデル (LLM) はさまざまな … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, stat.ML | コメントを受け付けていません

H2OVL-Mississippi Vision Language Models Technical Report

投稿日: 2024年10月18日作成者: jarxiv

要約小型ビジョン言語モデル (VLM) は、企業の商業文書や画像を処理するため … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring

投稿日: 2024年10月18日作成者: jarxiv

要約この研究では、時空間物体検出モデルの開発を通じて、UAV カメラを使用した … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion

投稿日: 2024年10月18日作成者: jarxiv

要約低品質または希少なデータは、実際にディープニューラルネットワークをトレ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Comprehensive Performance Evaluation of YOLO11, YOLOv10, YOLOv9 and YOLOv8 on Detecting and Counting Fruitlet in Complex Orchard Environments

投稿日: 2024年10月18日作成者: jarxiv

要約この研究では、商業果樹園における緑色の果物の検出のために、YOLOv8、Y … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

EchoApex: A General-Purpose Vision Foundation Model for Echocardiography

投稿日: 2024年10月18日作成者: jarxiv

要約心エコー検査の定量的評価は、心臓の状態を正確に評価し、病気の進行を監視し、 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Accelerating Codec-based Speech Synthesis with Multi-Token Prediction and Speculative Decoding

SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction

Towards Multilingual LLM Evaluation for European Languages

Influence Functions for Scalable Data Attribution in Diffusion Models

How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs

H2OVL-Mississippi Vision Language Models Technical Report

Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring

Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion

Comprehensive Performance Evaluation of YOLO11, YOLOv10, YOLOv9 and YOLOv8 on Detecting and Counting Fruitlet in Complex Orchard Environments

EchoApex: A General-Purpose Vision Foundation Model for Echocardiography

最近の投稿

最近のコメント

アーカイブ

カテゴリー