「cs.AI」カテゴリーアーカイブ

End-To-End Underwater Video Enhancement: Dataset and Model

投稿日: 2024年3月19日作成者: jarxiv

要約水中ビデオ強化 (UVE) は、水中ビデオの視認性とフレーム品質を向上させ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

LeTO: Learning Constrained Visuomotor Policy with Differentiable Trajectory Optimization

投稿日: 2024年3月19日作成者: jarxiv

要約この論文では、微分可能軌道最適化を介して制約付き視覚運動ポリシーを学習する … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Effectiveness Assessment of Recent Large Vision-Language Models

投稿日: 2024年3月19日作成者: jarxiv

要約大規模ビジョン言語モデル (LVLM) の出現は、汎用人工知能の追求に向け … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System

投稿日: 2024年3月19日作成者: jarxiv

要約自動光学検査 (AOI) は製造プロセスにおいて極めて重要な役割を果たして … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Global $\mathcal{L}^2$ minimization at uniform exponential rate via geometrically adapted gradient descent in Deep Learning

投稿日: 2024年3月19日作成者: jarxiv

要約深層学習ネットワークで $\mathcal{L}^2$ コスト関数の最小化 … 続きを読む →

カテゴリー: 57R70, 62M45, cs.AI, cs.LG, math-ph, math.MP, math.OC, stat.ML | コメントを受け付けていません

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

投稿日: 2024年3月19日作成者: jarxiv

要約書いたり話したりするとき、人は時々立ち止まって考えることがあります。推論 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

LLM^3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning

投稿日: 2024年3月19日作成者: jarxiv

要約従来のタスクおよびモーションプランニング (TAMP) アプローチは、シ … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Reinforcement Learning with Token-level Feedback for Controllable Text Generation

投稿日: 2024年3月19日作成者: jarxiv

要約実際のアプリケーションの要件を満たすには、大規模言語モデル (LLM) の … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

投稿日: 2024年3月19日作成者: jarxiv

要約ラージカーネル畳み込みニューラルネットワーク (ConvNets) は最 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

ProMISe: Promptable Medical Image Segmentation using SAM

投稿日: 2024年3月19日作成者: jarxiv

要約 Segment Anything Model (SAM) の提案により、医 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

End-To-End Underwater Video Enhancement: Dataset and Model

LeTO: Learning Constrained Visuomotor Policy with Differentiable Trajectory Optimization

Effectiveness Assessment of Recent Large Vision-Language Models

OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System

Global $\mathcal{L}^2$ minimization at uniform exponential rate via geometrically adapted gradient descent in Deep Learning

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

LLM^3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning

Reinforcement Learning with Token-level Feedback for Controllable Text Generation

UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

ProMISe: Promptable Medical Image Segmentation using SAM

最近の投稿

最近のコメント

アーカイブ

カテゴリー