「cs.AI」カテゴリーアーカイブ

Safe Explicable Planning

投稿日: 2024年4月1日作成者: jarxiv

要約人間の期待は、他者や世界を理解することから生まれます。人間と AI の対 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Gecko: Versatile Text Embeddings Distilled from Large Language Models

投稿日: 2024年4月1日作成者: jarxiv

要約コンパクトで多用途なテキスト埋め込みモデルである Gecko を紹介します … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

ReALM: Reference Resolution As Language Modeling

投稿日: 2024年4月1日作成者: jarxiv

要約参照解決は重要な問題であり、さまざまな種類のコンテキストを理解し、適切に処 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

HARMamba: Efficient Wearable Sensor Human Activity Recognition Based on Bidirectional Selective SSM

投稿日: 2024年4月1日作成者: jarxiv

要約ウェアラブルセンサーの人間活動認識 (HAR) は、活動センシングにおけ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Rapid Motor Adaptation for Robotic Manipulator Arms

投稿日: 2024年4月1日作成者: jarxiv

要約一般化可能な操作スキルを開発することは、身体化された AI における中心的 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

GlitchBench: Can large multimodal models detect video game glitches?

投稿日: 2024年4月1日作成者: jarxiv

要約大規模マルチモーダルモデル (LMM) は、視覚入力などの複数の入力モダ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

SeaBird: Segmentation in Bird’s View with Dice Loss Improves Monocular 3D Detection of Large Objects

投稿日: 2024年4月1日作成者: jarxiv

要約単眼 3D 検出器は、自動車や小さな物体に対して優れたパフォーマンスを実現 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion Networks

投稿日: 2024年4月1日作成者: jarxiv

要約セマンティックセグメンテーションにおける最先端の手法の効率を向上させるに … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning

投稿日: 2024年4月1日作成者: jarxiv

要約大規模なデータセットで事前トレーニングされたモデルをさまざまな下流タスクに … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Language Model Beats Diffusion — Tokenizer is Key to Visual Generation

投稿日: 2024年4月1日作成者: jarxiv

要約大規模言語モデル (LLM) は、言語の生成タスクでは主要なモデルですが、 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Safe Explicable Planning

Gecko: Versatile Text Embeddings Distilled from Large Language Models

ReALM: Reference Resolution As Language Modeling

HARMamba: Efficient Wearable Sensor Human Activity Recognition Based on Bidirectional Selective SSM

Rapid Motor Adaptation for Robotic Manipulator Arms

GlitchBench: Can large multimodal models detect video game glitches?

SeaBird: Segmentation in Bird’s View with Dice Loss Improves Monocular 3D Detection of Large Objects

SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion Networks

MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning

Language Model Beats Diffusion — Tokenizer is Key to Visual Generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー