jarxiv | Japanese arxiv | ページ 573

Fast Whole-Body Strain Regulation in Continuum Robots

投稿日: 2025年5月9日作成者: jarxiv

要約

多目的、マルチスケール連続ソフトロボットのリアルタイムひずみ制御に向けて到達するステップを提案します。
この問題を根本的に研究するために、ソフトロボットのプロトタイプの数学的に正確なダイナミクスによって有効なモデルベースの制御設定に自分自身を接地します。
具体化されたコンプライアンスの固有の機械的非線形性を拒否するのではなく、統合する態勢を整え、最初に元のロボットダイナミクスを別々の亜鉛剤に分離します。
第二に、結果のサブシステムのひずみダイナミクスを調整するために、安定化非線形バックステップコントローラーのセットを処方します。
第三に、その安定性を分析および確立することにより、相互接続された特異な摂動システムを研究します。
第四に、私たちの理論は、タコロボットアームの片方の腕の高速数値結果によってバックアップされています。
無限の程度のフリードムソフトロボットの全身減少したダイナミクスの大幅に短縮された時間の平衡状態へのひずみ調節を示します。
このペーパーは、具体化された知性を背景に私たちの思考を伝えます。それは、概念化、定式化、計算セットアップを通知し、無限のフリードムのソフトロボットのコントロールパフォーマンスの改善をもたらします。

要約(オリジナル)

We propose reaching steps towards the real-time strain control of multiphysics, multiscale continuum soft robots. To study this problem fundamentally, we ground ourselves in a model-based control setting enabled by mathematically precise dynamics of a soft robot prototype. Poised to integrate, rather than reject, inherent mechanical nonlinearities for embodied compliance, we first separate the original robot dynamics into separate subdynamics — aided by a perturbing time-scale separation parameter. Second, we prescribe a set of stabilizing nonlinear backstepping controllers for regulating the resulting subsystems’ strain dynamics. Third, we study the interconnected singularly perturbed system by analyzing and establishing its stability. Fourth, our theories are backed up by fast numerical results on a single arm of the Octopus robot arm. We demonstrate strain regulation to equilibrium, in a significantly reduced time, of the whole-body reduced-order dynamics of an infinite degrees-of-freedom soft robot. This paper communicates our thinking within the backdrop of embodied intelligence: it informs our conceptualization, formulation, computational setup, and yields improved control performance for infinite degrees-of-freedom soft robots.

arxiv情報

著者	Lekan Molu
発行日	2025-05-08 15:54:41+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO, cs.SY, eess.SY | コメントを受け付けていません

An End-to-End Framework for Optimizing Foot Trajectory and Force in Dry Adhesion Legged Wall-Climbing Robots

投稿日: 2025年5月9日作成者: jarxiv

要約

乾燥接着の脚の脚の軌跡計画は、足の剥離、スイング、および接着の段階が安定した登山に不可欠な接着力と剥離力に大きく影響するため、課題を提示します。
これに取り組むために、エンドツーエンドの足の軌跡と力最適化フレームワーク（FTFOF）が提案されており、軌道調整により足の接着力と剥離力を最適化します。
このフレームワークは、一般的な足の軌跡の制約とユーザー定義のパラメーターを入力として受け入れ、最終的には最適な単一足の軌跡を生成します。
さまざまな足構造に合わせて調整された3セグメント$ c^2 $連続ベジエ曲線を統合し、効果的な登山軌道の生成を可能にします。
拡張ベースのGRU予測モデルは、足の軌跡と対応する足の力との関係を確立します。
冗長性階層戦略と組み合わせた多目的最適化アルゴリズムは、特定のタスクに最も適した足の軌跡を識別し、それにより、剥離力、接着力、振動振幅全体の最適なパフォーマンスを確保します。
四足クライミングロボットMST-M3Fの実験的検証は、既存の脚のクライミングロボットで一般的に使用される軌跡と比較して、提案されたフレームワークが最大剥離力の削減を28 \％、振動振幅を82 \％で達成することを示しました。

要約(オリジナル)

Foot trajectory planning for dry adhesion legged climbing robots presents challenges, as the phases of foot detachment, swing, and adhesion significantly influence the adhesion and detachment forces essential for stable climbing. To tackle this, an end-to-end foot trajectory and force optimization framework (FTFOF) is proposed, which optimizes foot adhesion and detachment forces through trajectory adjustments. This framework accepts general foot trajectory constraints and user-defined parameters as input, ultimately producing an optimal single foot trajectory. It integrates three-segment $C^2$ continuous Bezier curves, tailored to various foot structures, enabling the generation of effective climbing trajectories. A dilate-based GRU predictive model establishes the relationship between foot trajectories and the corresponding foot forces. Multi-objective optimization algorithms, combined with a redundancy hierarchical strategy, identify the most suitable foot trajectory for specific tasks, thereby ensuring optimal performance across detachment force, adhesion force and vibration amplitude. Experimental validation on the quadruped climbing robot MST-M3F showed that, compared to commonly used trajectories in existing legged climbing robots, the proposed framework achieved reductions in maximum detachment force by 28 \%, vibration amplitude by 82 \%, which ensures the stable climbing of dry adhesion legged climbing robots.

arxiv情報

著者	Jichun Xiao,Jiawei Nie,Lina Hao,Zhi Li
発行日	2025-05-08 17:06:08+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO | コメントを受け付けていません

Efficient Estimation of Relaxed Model Parameters for Robust UAV Trajectory Optimization

投稿日: 2025年5月9日作成者: jarxiv

要約

オンラインの軌道最適化と最適な制御方法は、農業、環境監視、輸送などの持続可能な無人航空機（UAV）サービスを可能にするために重要です。
ただし、最適なコントローラーは、モデルの不一致に非常に敏感です。これは、装備された機器、配信されるパッケージ、または基本的な構造およびスラスト関連のパラメーターの既存のばらつきが原因で発生する可能性があります。
この問題を回避するために、最適なコントローラーをパラメーター推定器とペアにして、軌道計画のパフォーマンスを改善し、適応制御を実行できます。
ただし、UAVプラットフォームは、オンボード処理能力の点で制限されており、多くの場合、非線形パラメーターの推定が考慮するのに計算的に費用がかかりすぎます。
これらの問題に対処するために、効率的な最適なパラメーター推定器とともに、リラックスした、アフラメーターマルチローターモデルを提案します。
非線形ダイナミクスでのアフィナインパラメーター緩和を介して、公称移動ホライズンパラメーター推定（MHPE）問題を線形二次形態（LQ-MHPE）に凸に導き、リアルタイムで適応モデル予測コントロール（MPC）を促進する高速二次プログラム（QPS）を実現します。
このアプローチをモンテカルロシミュレーションの同等の非線形推定器と比較し、それぞれ平均解決時間と軌道最適性コストの減少を示しています。

要約(オリジナル)

Online trajectory optimization and optimal control methods are crucial for enabling sustainable unmanned aerial vehicle (UAV) services, such as agriculture, environmental monitoring, and transportation, where available actuation and energy are limited. However, optimal controllers are highly sensitive to model mismatch, which can occur due to loaded equipment, packages to be delivered, or pre-existing variability in fundamental structural and thrust-related parameters. To circumvent this problem, optimal controllers can be paired with parameter estimators to improve their trajectory planning performance and perform adaptive control. However, UAV platforms are limited in terms of onboard processing power, oftentimes making nonlinear parameter estimation too computationally expensive to consider. To address these issues, we propose a relaxed, affine-in-parameters multirotor model along with an efficient optimal parameter estimator. We convexify the nominal Moving Horizon Parameter Estimation (MHPE) problem into a linear-quadratic form (LQ-MHPE) via an affine-in-parameter relaxation on the nonlinear dynamics, resulting in fast quadratic programs (QPs) that facilitate adaptive Model Predictve Control (MPC) in real time. We compare this approach to the equivalent nonlinear estimator in Monte Carlo simulations, demonstrating a decrease in average solve time and trajectory optimality cost by 98.2% and 23.9-56.2%, respectively.

arxiv情報

著者	Derek Fan,David A. Copp
発行日	2025-05-08 17:28:26+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: (Primary), cs.RO, cs.SY, eess.SY, math.OC | コメントを受け付けていません

Physics-Learning AI Datamodel (PLAID) datasets: a collection of physics simulations for machine learning

投稿日: 2025年5月9日作成者: jarxiv

要約

機械学習ベースの代理モデルは、シミュレーション駆動型の科学的ワークフローを加速するための強力なツールとして登場しました。
ただし、それらの広範な採用は、物理ベースのシミュレーションに合わせた大規模で多様な、標準化されたデータセットの欠如によって妨げられています。
既存のイニシアチブは貴重な貢献を提供しますが、多くは特定の物理学ドメインに範囲に焦点を合わせたり、断片化されたツールに依存したり、一般化を制限する過度に単純化されたデータモデルに準拠しています。
これらの制限に対処するために、物理シミュレーションのデータセットを表現および共有するための柔軟で拡張可能なフレームワークであるPlaid（物理学習AI Datamodel）を紹介します。
Plaidは、シミュレーションデータを説明するための統一された標準を定義し、幅広い物理的ユースケース（gitlab.com/drti/plaid）にわたって複雑なデータセットを作成、読み取り、操作するライブラリが添付されています。
構造力学と計算流体のダイナミクスをカバーし、代表的な学習方法を使用してベースラインベンチマークを提供する、Plaid標準の下で6つの慎重に作成されたデータセットをリリースします。
ベンチマークツールは、顔を抱きしめて利用できるようになり、コミュニティによる直接参加と継続的な評価努力への貢献（Huggingface.co/PlaidCompetition）が可能になります。

要約(オリジナル)

Machine learning-based surrogate models have emerged as a powerful tool to accelerate simulation-driven scientific workflows. However, their widespread adoption is hindered by the lack of large-scale, diverse, and standardized datasets tailored to physics-based simulations. While existing initiatives provide valuable contributions, many are limited in scope-focusing on specific physics domains, relying on fragmented tooling, or adhering to overly simplistic datamodels that restrict generalization. To address these limitations, we introduce PLAID (Physics-Learning AI Datamodel), a flexible and extensible framework for representing and sharing datasets of physics simulations. PLAID defines a unified standard for describing simulation data and is accompanied by a library for creating, reading, and manipulating complex datasets across a wide range of physical use cases (gitlab.com/drti/plaid). We release six carefully crafted datasets under the PLAID standard, covering structural mechanics and computational fluid dynamics, and provide baseline benchmarks using representative learning methods. Benchmarking tools are made available on Hugging Face, enabling direct participation by the community and contribution to ongoing evaluation efforts (huggingface.co/PLAIDcompetitions).

arxiv情報

著者	Fabien Casenave,Xavier Roynard,Brian Staber,William Piat,Michele Alessandro Bucci,Nissrine Akkari,Abbas Kabalan,Xuan Minh Vuong Nguyen,Luca Saverio,Raphaël Carpintero Perez,Anthony Kalaydjian,Samy Fouché,Thierry Gonon,Ghassan Najjar,Emmanuel Menier,Matthieu Nastorg,Giovanni Catalani,Christian Rey
発行日	2025-05-08 12:58:22+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG | コメントを受け付けていません

GFlowNets for Active Learning Based Resource Allocation in Next Generation Wireless Networks

投稿日: 2025年5月9日作成者: jarxiv

要約

この作業では、通信、センシング、コンピューティングなど、さまざまな統合機能を備えたワイヤレスシステムの無線リソース割り当ての問題を検討します。
これらの不均一な要件に同時に対応できる適切なリソース管理手法を設計し、問題の高次元的で個別の性質を適切に拡張します。
リソースの割り当てパターンが順番に描画され、環境で評価され、環境の代理モデルを繰り返し更新するために使用される新しいアクティブな学習フレームワークを提案します。
私たちの方法は、生成フローネットワーク（Gflownet）を活用して好ましいソリューションをサンプリングします。そのようなモデルは、トレーニング報酬に比例して構成オブジェクトを生成するためにトレーニングされているため、モードの適切なカバレッジを提供します。
そのため、Gflownetは、サロゲートモデルを更新し、適切なソリューションを迅速に発見する多様で高度なリターンリソース管理デザインを生成します。
シミュレーション結果は、メソッドがベンチマークに対して20％のパフォーマンスの向上を達成するために無線リソースを割り当てることができ、買収ラウンドの数の半分未満を必要とすることを示すシミュレーション結果を提供します。

要約(オリジナル)

In this work, we consider the radio resource allocation problem in a wireless system with various integrated functionalities, such as communication, sensing and computing. We design suitable resource management techniques that can simultaneously cater to those heterogeneous requirements, and scale appropriately with the high-dimensional and discrete nature of the problem. We propose a novel active learning framework where resource allocation patterns are drawn sequentially, evaluated in the environment, and then used to iteratively update a surrogate model of the environment. Our method leverages a generative flow network (GFlowNet) to sample favorable solutions, as such models are trained to generate compositional objects proportionally to their training reward, hence providing an appropriate coverage of its modes. As such, GFlowNet generates diverse and high return resource management designs that update the surrogate model and swiftly discover suitable solutions. We provide simulation results showing that our method can allocate radio resources achieving 20% performance gains against benchmarks, while requiring less than half of the number of acquisition rounds.

arxiv情報

著者	Charbel Bou Chaaya,Mehdi Bennis
発行日	2025-05-08 13:16:40+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG | コメントを受け付けていません

A Probabilistic Approach to Learning the Degree of Equivariance in Steerable CNNs

投稿日: 2025年5月9日作成者: jarxiv

要約

操縦可能な畳み込みニューラルネットワーク（SCNNS）は、重みの等分散制約を通じて幾何学的対称性をモデル化することにより、タスクのパフォーマンスを向上させます。
しかし、未知または変化する対称性は、重量が過剰に制約され、パフォーマンスの低下につながる可能性があります。
これに対処するために、このペーパーでは、SCNNSの等量の程度を学ぶための確率的方法を紹介します。
フーリエ係数を使用して、変換グループ上の尤度分布として等量の程度をパラメーター化し、レイヤーごとのモデル化された共有等量のオプションを提供します。
これらの尤度分布は、ネットワーク全体で解釈可能な程度の等量を確保するために正規化されます。
利点には、SCNNSの柔軟なフレームワークを通じて、多くのタイプの等量ネットワークへの適用可能性と、追加の層を必要とせずに、コンパクトグループのあらゆるサブグループに関する等寛容を学習する能力が含まれます。
私たちの実験は、根本的な等量の程度を表す学習された尤度分布を備えた、混合対称性のあるデータセットでの競争力のあるパフォーマンスを明らかにしています。

要約(オリジナル)

Steerable convolutional neural networks (SCNNs) enhance task performance by modelling geometric symmetries through equivariance constraints on weights. Yet, unknown or varying symmetries can lead to overconstrained weights and decreased performance. To address this, this paper introduces a probabilistic method to learn the degree of equivariance in SCNNs. We parameterise the degree of equivariance as a likelihood distribution over the transformation group using Fourier coefficients, offering the option to model layer-wise and shared equivariance. These likelihood distributions are regularised to ensure an interpretable degree of equivariance across the network. Advantages include the applicability to many types of equivariant networks through the flexible framework of SCNNs and the ability to learn equivariance with respect to any subgroup of any compact group without requiring additional layers. Our experiments reveal competitive performance on datasets with mixed symmetries, with learnt likelihood distributions that are representative of the underlying degree of equivariance.

arxiv情報

著者	Lars Veefkind,Gabriele Cesa
発行日	2025-05-08 13:17:01+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG | コメントを受け付けていません

Latte: Transfering LLMs` Latent-level Knowledge for Few-shot Tabular Learning

投稿日: 2025年5月9日作成者: jarxiv

要約

機械学習モデルが限られた量のラベル付きデータでトレーニングされている少数の球形学習は、実際の課題に対処するための費用対効果の高いアプローチを提供します。
大規模な言語モデル（LLMS）の出現は、少数のショットの表形式学習のために事前に訓練された知識を活用することに関心を集めました。
有望な結果にもかかわらず、既存のアプローチは、望ましくないレイテンシを導入するテスト時間の知識抽出に依存しているか、信頼できない機能エンジニアリングにつながるテキストレベルの知識を導入します。
これらの制限を克服するために、より一般化されたダウンストリームモデルを最適化するためにLLMS内の潜在的な事前知識を転送するトレーニング時間の知識抽出フレームワークであるLatteを提案します。
Latteは、一般的な知識誘導下流の表形式学習を可能にし、さまざまな機能値にわたる情報の加重融合を促進し、限られたラベル付きデータに過剰適合のリスクを減らします。
さらに、Latteは既存の教師なしの事前トレーニングパラダイムと互換性があり、利用可能な非標識サンプルを効果的に利用して、非常に小さなラベル付きデータセットによって課されるパフォーマンスの制限を克服します。
さまざまな少ないショットの表形式学習ベンチマークでの広範な実験は、ラテの優れた性能を示しており、このドメインで最先端のアプローチとして確立されています

要約(オリジナル)

Few-shot tabular learning, in which machine learning models are trained with a limited amount of labeled data, provides a cost-effective approach to addressing real-world challenges. The advent of Large Language Models (LLMs) has sparked interest in leveraging their pre-trained knowledge for few-shot tabular learning. Despite promising results, existing approaches either rely on test-time knowledge extraction, which introduces undesirable latency, or text-level knowledge, which leads to unreliable feature engineering. To overcome these limitations, we propose Latte, a training-time knowledge extraction framework that transfers the latent prior knowledge within LLMs to optimize a more generalized downstream model. Latte enables general knowledge-guided downstream tabular learning, facilitating the weighted fusion of information across different feature values while reducing the risk of overfitting to limited labeled data. Furthermore, Latte is compatible with existing unsupervised pre-training paradigms and effectively utilizes available unlabeled samples to overcome the performance limitations imposed by an extremely small labeled dataset. Extensive experiments on various few-shot tabular learning benchmarks demonstrate the superior performance of Latte, establishing it as a state-of-the-art approach in this domain

arxiv情報

著者	Ruxue Shi,Hengrui Gu,Hangting Ye,Yiwei Dai,Xu Shen,Xin Wang
発行日	2025-05-08 13:32:09+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG | コメントを受け付けていません

Enhancing Treatment Effect Estimation via Active Learning: A Counterfactual Covering Perspective

投稿日: 2025年5月9日作成者: jarxiv

要約

近年、治療効果の推定のための多数の複雑なアルゴリズムが開発されていますが、治療後の効果、たとえば高価な腫瘍イメージングまたは治療効果を評価するために必要な生検手順をラベル付けするコストが高いため、トレーニングセットが不十分にラベル付けされる場合、その有効性は限られたままです。
したがって、制約されたラベル予算を順守しながら、より高品質のラベル付きデータを積極的に組み込むことが不可欠になります。
データ効率の高い治療効果の推定を有効にするために、アクティブな学習コンテキスト内の厳密な理論分析を通じて問題を形式化します。ここで、派生した主要な測定 – \ textit {factual}および\ textit {counteractualカバーラジアス}がリスク上限を決定します。
バウンドを減らすために、理想的でバランスの取れたデータ分布の下で優れた貪欲な半径削減アルゴリズムを提案します。
より現実的なデータ分布に一般化するために、さらにFCCMを提案します。これにより、最適化目標を\ textit {factual}および\ textit {counteractualカバレッジの最大化}に変換して、データ収集中の効果的な半径削減を確保します。
さらに、FCCMを他のベースラインに対してベンチマークすることは、完全な合成データセットと半合成データセットの両方でその優位性を示しています。

要約(オリジナル)

Although numerous complex algorithms for treatment effect estimation have been developed in recent years, their effectiveness remains limited when handling insufficiently labeled training sets due to the high cost of labeling the effect after treatment, e.g., expensive tumor imaging or biopsy procedures needed to evaluate treatment effects. Therefore, it becomes essential to actively incorporate more high-quality labeled data, all while adhering to a constrained labeling budget. To enable data-efficient treatment effect estimation, we formalize the problem through rigorous theoretical analysis within the active learning context, where the derived key measures — \textit{factual} and \textit{counterfactual covering radius} determine the risk upper bound. To reduce the bound, we propose a greedy radius reduction algorithm, which excels under an idealized, balanced data distribution. To generalize to more realistic data distributions, we further propose FCCM, which transforms the optimization objective into the \textit{Factual} and \textit{Counterfactual Coverage Maximization} to ensure effective radius reduction during data acquisition. Furthermore, benchmarking FCCM against other baselines demonstrates its superiority across both fully synthetic and semi-synthetic datasets.

arxiv情報

著者	Hechuan Wen,Tong Chen,Mingming Gong,Li Kheng Chai,Shazia Sadiq,Hongzhi Yin
発行日	2025-05-08 13:42:00+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG | コメントを受け付けていません

Comparing Hyper-optimized Machine Learning Models for Predicting Efficiency Degradation in Organic Solar Cells

投稿日: 2025年5月9日作成者: jarxiv

要約

この作業では、多層構造ITO/PEDOT：PSS/P3HT：PCBM/ALを備えたポリマー有機太陽電池（OSC）の電力変換効率（PCE）によって苦しむ時間的分解を表すために、最適な機械学習（ML）モデルのセットを提示します。
その目的のために、996エントリを備えたデータベースを生成しました。これには、180日以上の製造プロセスと環境条件の両方に関する最大7つの変数が含まれています。
次に、単純なコマンドラインインターフェイスによってデータベースに対して順次実行される自動化されたMLプロトコルの集合体をまとめるソフトウェアフレームワークに依存しました。
これにより、最適なモデルが得られるように、網羅的なベンチマークを介してMLモデルの種子を過度に最適化およびランダム化することができます。
達成された精度は、係数測定（R2）の値に広く0.90を超えていますが、ルート平均二乗誤差（RMSE）、2乗誤差（SSE）、および平均絶対誤差（MAE）> 1％のPCEであるPCE。
さらに、データベースで見られなかったOSCの動作をスクリーニングできる検証済みのモデルに貢献します。
その場合、R2〜0.96-0.97およびRMSE〜1％であるため、予測する提案の信頼性が確認されています。
比較目的のために、非線形平均二乗（LMS）に基づく古典的なベイジアン回帰フィッティングも提示されます。これは、単一OSCの単変量ケースに対してのみ十分に実行されます。
したがって、MLモデルによって示された機能の幅を上回ることができません。
最後に、MLフレームワークによって提供される標準化された結果のおかげで、データセットの変数とOSCの最適なパフォーマンスと安定性に対するその意味との間の依存関係を研究します。
再現性は、GitHubで公開されているデータセットを使用して、標準化されたレポートによって完全に保証されます。

要約(オリジナル)

This work presents a set of optimal machine learning (ML) models to represent the temporal degradation suffered by the power conversion efficiency (PCE) of polymeric organic solar cells (OSCs) with a multilayer structure ITO/PEDOT:PSS/P3HT:PCBM/Al. To that aim, we generated a database with 996 entries, which includes up to 7 variables regarding both the manufacturing process and environmental conditions for more than 180 days. Then, we relied on a software framework that brings together a conglomeration of automated ML protocols that execute sequentially against our database by simply command-line interface. This easily permits hyper-optimizing and randomizing seeds of the ML models through exhaustive benchmarking so that optimal models are obtained. The accuracy achieved reaches values of the coefficient determination (R2) widely exceeding 0.90, whereas the root mean squared error (RMSE), sum of squared error (SSE), and mean absolute error (MAE)>1% of the target value, the PCE. Additionally, we contribute with validated models able to screen the behavior of OSCs never seen in the database. In that case, R2~0.96-0.97 and RMSE~1%, thus confirming the reliability of the proposal to predict. For comparative purposes, classical Bayesian regression fitting based on non-linear mean squares (LMS) are also presented, which only perform sufficiently for univariate cases of single OSCs. Hence they fail to outperform the breadth of the capabilities shown by the ML models. Finally, thanks to the standardized results offered by the ML framework, we study the dependencies between the variables of the dataset and their implications for the optimal performance and stability of the OSCs. Reproducibility is ensured by a standardized report altogether with the dataset, which are publicly available at Github.

arxiv情報

著者	David Valiente,Fernando Rodríguez-Mas,Juan V. Alegre-Requena,David Dalmau,María Flores,Juan C. Ferrer
発行日	2025-05-08 13:47:26+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG | コメントを受け付けていません

Linear combinations of latents in generative models: subspaces and beyond

投稿日: 2025年5月9日作成者: jarxiv

要約

生成モデルからのサンプリングは、データ統合や増強などのアプリケーションにとって重要なツールになりました。
拡散、フローマッチング、および連続的な正規化フローは、さまざまなモダリティにわたって有効性を示し、生成の潜在変数に依存しています。
生成プロセスをより制御する必要がある実験設計または創造的なアプリケーションの場合、潜在変数を直接操作することが一般的になりました。
ただし、このような操作を実行するための既存のアプローチ（例：補間や低次元表現の形成など）は、特別な場合にのみうまく機能するか、ネットワークまたはデータモダリティ固有です。
生成モデルの仮定に準拠する潜在変数の線形組み合わせを形成する汎用方法として、潜在変数（LOL）の線形結合を提案します。
LOLは実装が簡単で、線形の組み合わせを形成するというより広いタスクに自然に対処しているためです。
潜在空間の部分空間の構築により、LOLは高次元オブジェクトの表現力豊かな低次元表現の作成を劇的に簡素化します。

要約(オリジナル)

Sampling from generative models has become a crucial tool for applications like data synthesis and augmentation. Diffusion, Flow Matching and Continuous Normalizing Flows have shown effectiveness across various modalities, and rely on latent variables for generation. For experimental design or creative applications that require more control over the generation process, it has become common to manipulate the latent variable directly. However, existing approaches for performing such manipulations (e.g. interpolation or forming low-dimensional representations) only work well in special cases or are network or data-modality specific. We propose Linear combinations of Latent variables (LOL) as a general-purpose method to form linear combinations of latent variables that adhere to the assumptions of the generative model. As LOL is easy to implement and naturally addresses the broader task of forming any linear combinations, e.g. the construction of subspaces of the latent space, LOL dramatically simplifies the creation of expressive low-dimensional representations of high-dimensional objects.

arxiv情報

著者	Erik Bodin,Alexandru Stere,Dragos D. Margineantu,Carl Henrik Ek,Henry Moss
発行日	2025-05-08 14:04:05+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG, stat.ML | コメントを受け付けていません

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント