A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews

要約

自然言語処理の効率は、機械学習モデル、特にニューラルネットワークベースのソリューションの出現により劇的に向上しました。
ただし、特に特定のドメインを考慮する場合、一部のタスクは依然として困難です。
このペーパーでは、パイプラインに統合された機械学習手法を使用して顧客レビューから洞察を抽出できるクラウドベースのシステムを紹介します。
トピックモデリングの場合、複合モデルは、自然言語処理、ベクトル埋め込みベースのキーワード抽出、およびクラスタリング用に設計されたトランスフォーマーベースのニューラルネットワークを使用します。
私たちのモデルの要素は統合され、効率的な情報抽出、抽出された情報のトピックモデリング、ユーザーニーズの要件をより適切に満たすためにさらに開発されました。
さらに、私たちのシステムは、このタスクの既存のトピックモデリングおよびキーワード抽出ソリューションよりも優れた結果を達成できます。
私たちのアプローチは検証され、ベンチマーク用に公開されているデータセットを使用して他の最先端の方法と比較されます。

要約(オリジナル)

The efficiency of natural language processing has improved dramatically with the advent of machine learning models, particularly neural network-based solutions. However, some tasks are still challenging, especially when considering specific domains. In this paper, we present a cloud-based system that can extract insights from customer reviews using machine learning methods integrated into a pipeline. For topic modeling, our composite model uses transformer-based neural networks designed for natural language processing, vector embedding-based keyword extraction, and clustering. The elements of our model have been integrated and further developed to meet better the requirements of efficient information extraction, topic modeling of the extracted information, and user needs. Furthermore, our system can achieve better results than this task’s existing topic modeling and keyword extraction solutions. Our approach is validated and compared with other state-of-the-art methods using publicly available datasets for benchmarking.

arxiv情報

著者	Robert Lakatos,Gergo Bogacsovics,Balazs Harangi,Istvan Lakatos,Attila Tiba,Janos Toth,Marianna Szabo,Andras Hajdu
発行日	2023-06-13 14:07:52+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー