dc.contributor.author | Vega Ruiz, Alfonso de la | |
dc.contributor.author | García Saiz, Diego | |
dc.contributor.author | Zorrilla Pantaleón, Marta E. | |
dc.contributor.author | Sánchez Barreiro, Pablo | |
dc.contributor.other | Universidad de Cantabria | es_ES |
dc.date.accessioned | 2025-01-31T10:21:38Z | |
dc.date.available | 2025-01-31T10:21:38Z | |
dc.date.issued | 2020-10 | |
dc.identifier.issn | 2590-1184 | |
dc.identifier.issn | 2665-9182 | |
dc.identifier.other | TIN2017-86520-C3-3-R | es_ES |
dc.identifier.uri | https://hdl.handle.net/10902/35271 | |
dc.description.abstract | Input data of a data mining algorithm must conform to a very specific tabular format. Data scientists arrange data into that format by creating long and complex scripts, where different low-level operations are performed, and which can be a time-consuming and error-prone process. To alleviate this situation, we present Lavoisier, a declarative language for data selection and formatting in a data mining context. Using Lavoisier, script size for data preparation can be reduced by ⁓40% on average, and by up to 80% in some cases. Additionally, accidental complexity present in state-of-the-art technologies is considerably
mitigated. | es_ES |
dc.description.sponsorship | This work has been funded by the Spanish Government under grant TIN2017-86520-C3-3-R. | es_ES |
dc.format.extent | 61 p. | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Elsevier | es_ES |
dc.rights | © 2020. This manuscript version is made available under the CC-BY-NC-ND 4.0 license | es_ES |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.source | Journal of Computer Languages, 2020, 60, 100987 | es_ES |
dc.subject.other | Data selection | es_ES |
dc.subject.other | Data formatting | es_ES |
dc.subject.other | Domain-specific languages | es_ES |
dc.subject.other | Data mining | es_ES |
dc.title | Lavoisier: a DSL for increasing the level of abstraction of data selection and formatting in data mining | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.relation.publisherVersion | https://doi.org/10.1016/j.cola.2020.100987 | es_ES |
dc.rights.accessRights | openAccess | es_ES |
dc.identifier.DOI | 10.1016/j.cola.2020.100987 | |
dc.type.version | acceptedVersion | es_ES |