「cs.CV」カテゴリーアーカイブ

Nearest Neighbor Classification for Classical Image Upsampling

投稿日: 2024年8月16日作成者: jarxiv

要約画像の形式で順序付けされた一連のピクセルデータが与えられた場合、私たちの … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training

投稿日: 2024年8月16日作成者: jarxiv

要約近年、ゼロからトレーニングするという従来の焦点に代わって、事前トレーニング … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Towards Flexible Visual Relationship Segmentation

投稿日: 2024年8月16日作成者: jarxiv

要約視覚的関係の理解は、人間とオブジェクトの相互作用 (HOI) 検出、シーン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Understanding the Local Geometry of Generative Model Manifolds

投稿日: 2024年8月16日作成者: jarxiv

要約深い生成モデルは、トレーニング中に有限数のサンプルを使用して複雑なデータ多 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Can Large Language Models Understand Symbolic Graphics Programs?

投稿日: 2024年8月16日作成者: jarxiv

要約大規模言語モデル (LLM) の機能を評価することは、多くの場合困難です。 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

A Spitting Image: Modular Superpixel Tokenization in Vision Transformers

投稿日: 2024年8月16日作成者: jarxiv

要約 Vision Transformer (ViT) アーキテクチャは伝統的に … 続きを読む →

カテゴリー: 68T45, cs.AI, cs.CV, cs.LG, I.2.10 | コメントを受け付けていません

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties

投稿日: 2024年8月16日作成者: jarxiv

要約このペーパーでは、強力な解釈可能なセグメンテーションモデルを作成するため … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark

投稿日: 2024年8月16日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) の開発により、数学的問題に関 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation

投稿日: 2024年8月16日作成者: jarxiv

要約 Transformer を超えて、Transformer のパフォーマンス … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution

投稿日: 2024年8月16日作成者: jarxiv

要約現実世界のステレオ画像を再構成するための先駆的なフレームワークである Di … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Nearest Neighbor Classification for Classical Image Upsampling

SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training

Towards Flexible Visual Relationship Segmentation

Understanding the Local Geometry of Generative Model Manifolds

Can Large Language Models Understand Symbolic Graphics Programs?

A Spitting Image: Modular Superpixel Tokenization in Vision Transformers

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties

MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark

MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation

DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution

最近の投稿

最近のコメント

アーカイブ

カテゴリー