Rule — Version Control — DVC: Project uses Data Version Control
Similar to code, data should also be version controlled. However, version controlling data cannot be done with Git directly, as Git is not designed to deal with large and / or binary files. Tracking large files directly with Git adds bloat to your repository’s Git history, which needs to be downloaded every time your project is cloned. For properly version controlling Data in ML projects, mllint recommends using Data Version Control (DVC)....