「cs.LG」カテゴリーアーカイブ

Do generative video models understand physical principles?

投稿日: 2025年2月28日作成者: jarxiv

要約 AIビデオ生成は革命を起こしており、品質とリアリズムが急速に進歩しています … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.LG | コメントを受け付けていません

ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Images

投稿日: 2025年2月28日作成者: jarxiv

要約複雑で実世界のシナリオで確実に実行できるエゴセントリック3Dハンドポーズ推 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC, cs.LG | コメントを受け付けていません

Deep Convolutional Neural Networks for Palm Fruit Maturity Classification

投稿日: 2025年2月28日作成者: jarxiv

要約パーム油の収量と品質を最大化するには、最適な成熟段階でヤシの果物を収穫する … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Dreamweaver: Learning Compositional World Models from Pixels

投稿日: 2025年2月28日作成者: jarxiv

要約人間は、世界の認識をオブジェクトと、色、形状、運動パターンなどの属性に分解 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Exploring QUIC Dynamics: A Large-Scale Dataset for Encrypted Traffic Analysis

投稿日: 2025年2月28日作成者: jarxiv

要約 QUIC Transport Protocolの採用の増加により、暗号化さ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.NI | コメントを受け付けていません

Deep Modeling of Non-Gaussian Aleatoric Uncertainty

投稿日: 2025年2月28日作成者: jarxiv

要約ディープラーニングは、特に不確実性分布が固定およびガウスの伝統的な仮定に適 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

HVI: A New color space for Low-light Image Enhancement

投稿日: 2025年2月28日作成者: jarxiv

要約 Low-light Image Enhancement（LLIE）は、破損 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Visual Adaptive Prompting for Compositional Zero-Shot Learning

投稿日: 2025年2月28日作成者: jarxiv

要約 Vision-Language Models（VLMS）は、視覚データとテ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription

投稿日: 2025年2月28日作成者: jarxiv

要約手書きのテキスト認識（HTR）は、特にページが共通のフォーマットとコンテキ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

A Dataset and Framework for Learning State-invariant Object Representations

投稿日: 2025年2月28日作成者: jarxiv

要約認識と検索のためにオブジェクト表現を学習するために、より一般的に使用される … 続きを読む →

カテゴリー: cs.CV, cs.IR, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Do generative video models understand physical principles?

ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Images

Deep Convolutional Neural Networks for Palm Fruit Maturity Classification

Dreamweaver: Learning Compositional World Models from Pixels

Exploring QUIC Dynamics: A Large-Scale Dataset for Encrypted Traffic Analysis

Deep Modeling of Non-Gaussian Aleatoric Uncertainty

HVI: A New color space for Low-light Image Enhancement

Visual Adaptive Prompting for Compositional Zero-Shot Learning

Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription

A Dataset and Framework for Learning State-invariant Object Representations

最近の投稿

最近のコメント

アーカイブ

カテゴリー