投稿者「jarxiv」のアーカイブ

Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback

投稿日: 2025年6月5日作成者: jarxiv

要約スカラー報酬のような数値フィードバックを用いた強化学習(RL)の最近の進歩 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

投稿日: 2025年6月5日作成者: jarxiv

要約既存の統一モデルは、視覚言語理解やテキストから画像への生成では高い性能を発 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

FlySearch: Exploring how vision-language models explore

投稿日: 2025年6月5日作成者: jarxiv

要約現実の世界は混乱しており、構造化されていない。重要な情報を発見するためには … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

On the class of coding optimality of human languages and the origins of Zipf’s law

投稿日: 2025年6月5日作成者: jarxiv

要約ここでは、符号化システムの最適性に関する新しいクラスを提示する。そのクラス … 続きを読む →

カテゴリー: cs.CL, physics.soc-ph | コメントを受け付けていません

Multi Layered Autonomy and AI Ecologies in Robotic Art Installations

投稿日: 2025年6月5日作成者: jarxiv

要約バオヤン・チェン（baoyangchen.com）による大規模なインスタレ … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

GL-LowPopArt: A Nearly Instance-Wise Minimax-Optimal Estimator for Generalized Low-Rank Trace Regression

投稿日: 2025年6月5日作成者: jarxiv

要約我々は、一般化された低ランクのトレース回帰のための新しいCatoniスタイ … 続きを読む →

カテゴリー: cs.LG, stat.ML | コメントを受け付けていません

DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors

投稿日: 2025年6月5日作成者: jarxiv

要約オープンベンチマークは、再現性と透明性を提供し、大規模な言語モデルの評価と … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

MobCLIP: Learning General-purpose Geospatial Representation at Scale

投稿日: 2025年6月5日作成者: jarxiv

要約地理空間上の位置の表現学習は、一般的な地理空間知能を実現する上で、依然とし … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Learning Autonomous Surgical Irrigation and Suction with the da Vinci Research Kit Using Reinforcement Learning

投稿日: 2025年6月4日作成者: jarxiv

要約灌流-吸引プロセスは、低侵襲手術（MIS）において術野をすすぎ、清潔にする … 続きを読む →

カテゴリー: cs.RO | コメントを受け付けていません

One Policy but Many Worlds: A Scalable Unified Policy for Versatile Humanoid Locomotion

投稿日: 2025年6月4日作成者: jarxiv

要約従来の強化学習（RL）手法では、タスク固有の報酬が必要であり、訓練地形が増 … 続きを読む →

カテゴリー: cs.LG, cs.RO | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

FlySearch: Exploring how vision-language models explore

On the class of coding optimality of human languages and the origins of Zipf’s law

Multi Layered Autonomy and AI Ecologies in Robotic Art Installations

GL-LowPopArt: A Nearly Instance-Wise Minimax-Optimal Estimator for Generalized Low-Rank Trace Regression

DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors

MobCLIP: Learning General-purpose Geospatial Representation at Scale

Learning Autonomous Surgical Irrigation and Suction with the da Vinci Research Kit Using Reinforcement Learning

One Policy but Many Worlds: A Scalable Unified Policy for Versatile Humanoid Locomotion

最近の投稿

最近のコメント

アーカイブ

カテゴリー