月別アーカイブ: 2025年4月

Elucidating the Design Space of Multimodal Protein Language Models

投稿日: 2025年4月16日作成者: jarxiv

要約マルチモーダルタンパク質言語モデル（PLMS）は、シーケンスとトークンベー … 続きを読む →

カテゴリー: cs.AI, cs.LG, q-bio.QM | コメントを受け付けていません

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

投稿日: 2025年4月16日作成者: jarxiv

要約複雑な数学的推論の能力は、人工知能の重要なベンチマークです。 LLMSに適 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image

投稿日: 2025年4月16日作成者: jarxiv

要約このペーパーは、ロボット操作タスクにおける明確なオブジェクトのカテゴリレベ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Leveraging multimodal explanatory annotations for video interpretation with Modality Specific Dataset

投稿日: 2025年4月16日作成者: jarxiv

要約人間が注目した説明概念を含むデータセットであるMobygazeを使用して、 … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks

投稿日: 2025年4月16日作成者: jarxiv

要約 Hindsight Experience Replay（彼女）は、バイナリ … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Cryo-em images are intrinsically low dimensional

投稿日: 2025年4月16日作成者: jarxiv

要約シミュレーションベースの推論は、クレオスビなどの方法でニューラルネットワー … 続きを読む →

カテゴリー: cs.CV, cs.LG, q-bio.BM, q-bio.QM, stat.ML | コメントを受け付けていません

UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis

投稿日: 2025年4月16日作成者: jarxiv

要約大規模なビジョン言語モデルの最近の進歩は、デジタルデバイスの生産性を高める … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.HC | コメントを受け付けていません

Enhanced Small Target Detection via Multi-Modal Fusion and Attention Mechanisms: A YOLOv5 Approach

投稿日: 2025年4月16日作成者: jarxiv

要約情報技術の急速な発展に伴い、近代的な戦争はますます知性に依存しており、軍事 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Single-Input Multi-Output Model Merging: Leveraging Foundation Models for Dense Multi-Task Learning

投稿日: 2025年4月16日作成者: jarxiv

要約モデルのマージは、シングルタスクチェックポイントをマルチタスクモデルに融合 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution

投稿日: 2025年4月16日作成者: jarxiv

要約畳み込みニューラルネットワーク（CNNS）は、効率的な画像超解像度で広く使 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年4月

Elucidating the Design Space of Multimodal Protein Language Models

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image

Leveraging multimodal explanatory annotations for video interpretation with Modality Specific Dataset

Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks

Cryo-em images are intrinsically low dimensional

UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis

Enhanced Small Target Detection via Multi-Modal Fusion and Attention Mechanisms: A YOLOv5 Approach

Single-Input Multi-Output Model Merging: Leveraging Foundation Models for Dense Multi-Task Learning

Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution

最近の投稿

最近のコメント

アーカイブ

カテゴリー