Croissant: A Metadata Format for ML-Ready Datasets
Work
Year: 2024
Type: preprint
Abstract: Data is a critical resource for Machine Learning (ML), yet working with data remains a key friction point. This paper introduces Croissant, a metadata format for datasets that simplifies how data is u... more
Source: arXiv (Cornell University)
Authors Mubashara Akhtar, Omar Benjelloun, Costanza Conforti, Pieter Gijsbers, Joan Giner-Miguelez +16 more
Cites:
Cited by:
Related to: 10
Citation percentile (by year/subfield):
Subfield: Information Systems and Management
Field: Decision Sciences
Domain: Social Sciences
Sustainable Development Goal Industry, innovation and infrastructure
Open Access status: green