「cs.CV」カテゴリーアーカイブ

CAT: Circular-Convolutional Attention for Sub-Quadratic Transformers

投稿日: 2025年4月10日作成者: jarxiv

要約変圧器は、自然言語処理とコンピュータービジョンの顕著なブレークスルーを駆動 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

LostPaw: Finding Lost Pets using a Contrastive Learning-based Transformer with Visual Input

投稿日: 2025年4月10日作成者: jarxiv

要約ペットを失うことはペットの飼い主にとって非常に苦痛を伴う可能性があり、失わ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Zero-Shot Image-Based Large Language Model Approach to Road Pavement Monitoring

投稿日: 2025年4月10日作成者: jarxiv

要約舗装表面の状態の効果的かつ迅速な評価は、メンテナンスに優先順位を付け、輸送 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

ZIP: An Efficient Zeroth-order Prompt Tuning for Black-box Vision-Language Models

投稿日: 2025年4月10日作成者: jarxiv

要約最近の研究では、ブラックボックスプロンプトチューニング（BBPT）と呼ばれ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Classifying the Unknown: In-Context Learning for Open-Vocabulary Text and Symbol Recognition

投稿日: 2025年4月10日作成者: jarxiv

要約マルチモーダルのコンテキスト学習（MICL）を活用するマルチモーダルモデル … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Unified CNNs and transformers underlying learning mechanism reveals multi-head attention modus vivendi

投稿日: 2025年4月10日作成者: jarxiv

要約畳み込みニューラルネットワーク（CNNS）は、レイヤーに沿って進行する入力 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading

投稿日: 2025年4月10日作成者: jarxiv

要約この作業では、拡散モデルを使用してテキストからテキストの合成を調査し、物理 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation

投稿日: 2025年4月10日作成者: jarxiv

要約ゼロショット、トレーニングフリー、画像ベースのテキストツービデオツージェネ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking

投稿日: 2025年4月10日作成者: jarxiv

要約移動オブジェクトセグメンテーションは、動的な視覚環境を理解する上で重要な役 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes

投稿日: 2025年4月10日作成者: jarxiv

要約乱雑な環境で堅牢な把握が継続して、ロボット工学のオープンな課題のままです。 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

CAT: Circular-Convolutional Attention for Sub-Quadratic Transformers

LostPaw: Finding Lost Pets using a Contrastive Learning-based Transformer with Visual Input

Zero-Shot Image-Based Large Language Model Approach to Road Pavement Monitoring

ZIP: An Efficient Zeroth-order Prompt Tuning for Black-box Vision-Language Models

Classifying the Unknown: In-Context Learning for Open-Vocabulary Text and Symbol Recognition

Unified CNNs and transformers underlying learning mechanism reveals multi-head attention modus vivendi

CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading

EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation

MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking

GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes

最近の投稿

最近のコメント

アーカイブ

カテゴリー