Lingua Manga: A Generic Large Language Model Centric System for Data Curation

要約

データキュレーションは広範囲にわたる領域であり、多くの重要だが時間のかかるデータ処理タスクが含まれます。
ただし、このようなタスクは多様であるため、汎用のデータキュレーションシステムを開発することが困難になります。
この問題に対処するために、事前にトレーニングされた大規模な言語モデルを利用する、ユーザーフレンドリーで多用途なシステムである Lingua Manga を紹介します。
Lingua Manga は、柔軟かつ迅速な開発を促進しながら、高いパフォーマンスとラベル効率を達成するための自動最適化を提供します。
明確な目的とさまざまな技術熟練度レベルのユーザーを備えた 3 つのサンプルアプリケーションを通じて、Lingua Manga が熟練したプログラマーとローコードユーザー、さらにはノーコードユーザーの両方がデータキュレーションの課題に対処するのを効果的に支援できることを示します。

要約(オリジナル)

Data curation is a wide-ranging area which contains many critical but time-consuming data processing tasks. However, the diversity of such tasks makes it challenging to develop a general-purpose data curation system. To address this issue, we present Lingua Manga, a user-friendly and versatile system that utilizes pre-trained large language models. Lingua Manga offers automatic optimization for achieving high performance and label efficiency while facilitating flexible and rapid development. Through three example applications with distinct objectives and users of varying levels of technical proficiency, we demonstrate that Lingua Manga can effectively assist both skilled programmers and low-code or even no-code users in addressing data curation challenges.

arxiv情報

著者	Zui Chen,Lei Cao,Sam Madden
発行日	2023-06-20 17:30:02+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Lingua Manga: A Generic Large Language Model Centric System for Data Curation

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー