Lingua Manga : A Generic Large Language Model Centric System for Data Curation
Affiliation
University of ArizonaIssue Date
2023-08-01Keywords
General Earth and Planetary SciencesWater Science and Technology
Geography, Planning and Development
Metadata
Show full item recordCitation
Zui Chen, Lei Cao, and Sam Madden. Lingua Manga : A Generic Large Language Model Centric System for Data Curation. PVLDB, 16(12): 40744077, 2023. doi:10.14778/3611540.3611624Rights
Copyright is held by the owner/author(s). This work is licensed under the Creative Commons BY-NC-ND4.0 International License.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Data curation is a wide-ranging area which contains many critical but time-consuming data processing tasks. However, the diversity of such tasks makes it challenging to develop a general-purpose data curation system. To address this issue, we present Lingua Manga , a user-friendly and versatile system that utilizes pre-trained large language models. Lingua Manga offers automatic optimization for achieving high performance and label efficiency while facilitating flexible and rapid development. Through three example applications with distinct objectives and users of varying levels of technical proficiency, we demonstrate that Lingua Manga can effectively assist both skilled programmers and low-code or even no-code users in addressing data curation challenges.Note
Open access article.ISSN
2150-8097Version
Final published versionae974a485f413a2113503eed53cd6c53
10.14778/3611540.3611624
Scopus Count
Collections
Except where otherwise noted, this item's license is described as Copyright is held by the owner/author(s). This work is licensed under the Creative Commons BY-NC-ND4.0 International License.