月別アーカイブ: 2023年6月

Measured Albedo in the Wild: Filling the Gap in Intrinsics Evaluation

投稿日: 2023年6月28日作成者: jarxiv

要約固有の画像分解と逆レンダリングは、コンピュータービジョンにおける長年の問 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment

投稿日: 2023年6月28日作成者: jarxiv

要約カメラの姿勢推定はコンピュータービジョンに関する長年の問題であり、これま … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties

投稿日: 2023年6月28日作成者: jarxiv

要約一般的な物理シーンの理解には、単に物体の位置を特定して認識するだけでは不十 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.RO | コメントを受け付けていません

Detector-Free Structure from Motion

投稿日: 2023年6月28日作成者: jarxiv

要約我々は、順序付けされていない画像から正確なカメラのポーズと点群を復元するた … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Symphonize 3D Semantic Scene Completion with Contextual Instance Queries

投稿日: 2023年6月28日作成者: jarxiv

要約 3D セマンティックシーン補完 (SSC) は、部分的な LiDAR ま … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Towards Language-Based Modulation of Assistive Robots through Multimodal Models

投稿日: 2023年6月28日作成者: jarxiv

要約ジェリアトロニクスの分野では、人間とロボットの間の効果的かつ透過的なコミュ … 続きを読む →

カテゴリー: cs.RO | コメントを受け付けていません

Language Models are Bounded Pragmatic Speakers

投稿日: 2023年6月28日作成者: jarxiv

要約言語モデルはどのように「考える」のでしょうか? この論文は、言語モデルのさ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Data-Driven Approach for Formality-Sensitive Machine Translation: Language-Specific Handling and Synthetic Data Generation

投稿日: 2023年6月28日作成者: jarxiv

要約このペーパーでは、4 つのターゲット言語の固有の言語特性に対応する、形式依 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Kosmos-2: Grounding Multimodal Large Language Models to the World

投稿日: 2023年6月28日作成者: jarxiv

要約私たちは、マルチモーダル大規模言語モデル (MLLM) である Kosmo … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

PMaF: Deep Declarative Layers for Principal Matrix Features

投稿日: 2023年6月28日作成者: jarxiv

要約主行列特徴量 (PMaF) を学習するために、2 つの微分可能な深い宣言層 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

月別アーカイブ: 2023年6月

Measured Albedo in the Wild: Filling the Gap in Intrinsics Evaluation

PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment

Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties

Detector-Free Structure from Motion

Symphonize 3D Semantic Scene Completion with Contextual Instance Queries

Towards Language-Based Modulation of Assistive Robots through Multimodal Models

Language Models are Bounded Pragmatic Speakers

Data-Driven Approach for Formality-Sensitive Machine Translation: Language-Specific Handling and Synthetic Data Generation

Kosmos-2: Grounding Multimodal Large Language Models to the World

PMaF: Deep Declarative Layers for Principal Matrix Features

最近の投稿

最近のコメント

アーカイブ

カテゴリー