Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular Detoxification?

要約

毒性は、初期段階の医薬品開発不全の主な原因のままです。
分子設計と特性の予測の進歩にもかかわらず、分子毒性修復のタスク – 毒性の低下を伴う構造的に有効な分子的代替を生成する – はまだ体系的に定義またはベンチマークされていません。
このギャップを埋めるために、分子毒性修復に焦点を当てた汎用マルチモーダル大手言語モデル（MLLMS）の最初のベンチマークタスクであるTomingolを導入します。
多様なメカニズムと粒度にまたがる11の主要なタスクと560の代表的な毒性分子をカバーする標準化されたデータセットを構築します。
専門家の毒物学的知識から通知された、メカニズムに対応したタスクに適した機能を備えた迅速な注釈パイプラインを設計します。
並行して、毒性エンドポイントの予測、合成アクセシビリティ、薬物の毒性、および構造的類似性を、修復の成功のためにハイスループット評価チェーンに統合する自動評価フレームワークであるToxievalを提案します。
30近くの主流の汎用MLLMSを体系的に評価し、複数のアブレーション研究を設計して、評価基準、候補の多様性、失敗の帰属などの重要な要因を分析します。
実験結果は、現在のMLLMがこのタスクで依然として重要な課題に直面しているが、毒性の理解、意味的制約の順守、構造認識分子編集における有望な能力を実証し始めることを示しています。

要約(オリジナル)

Toxicity remains a leading cause of early-stage drug development failure. Despite advances in molecular design and property prediction, the task of molecular toxicity repair – generating structurally valid molecular alternatives with reduced toxicity – has not yet been systematically defined or benchmarked. To fill this gap, we introduce ToxiMol, the first benchmark task for general-purpose Multimodal Large Language Models (MLLMs) focused on molecular toxicity repair. We construct a standardized dataset covering 11 primary tasks and 560 representative toxic molecules spanning diverse mechanisms and granularities. We design a prompt annotation pipeline with mechanism-aware and task-adaptive capabilities, informed by expert toxicological knowledge. In parallel, we propose an automated evaluation framework, ToxiEval, which integrates toxicity endpoint prediction, synthetic accessibility, drug-likeness, and structural similarity into a high-throughput evaluation chain for repair success. We systematically assess nearly 30 mainstream general-purpose MLLMs and design multiple ablation studies to analyze key factors such as evaluation criteria, candidate diversity, and failure attribution. Experimental results show that although current MLLMs still face significant challenges on this task, they begin to demonstrate promising capabilities in toxicity understanding, semantic constraint adherence, and structure-aware molecule editing.

arxiv情報

著者	Fei Lin,Ziyang Gong,Cong Wang,Yonglin Tian,Tengchao Zhang,Xue Yang,Gen Luo,Fei-Yue Wang
発行日	2025-06-18 14:00:37+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular Detoxification?

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー