Data Version Control (software)

DVC
Original author(s)Dmitry Petrov
Developer(s)Iterative.ai
Initial releaseMay 4, 2017; 5 years ago
Stable release
2.30.0 / October 10, 2022; 1 day ago
Repositoryhttps://github.com/iterative/dvc
Written inPython
TypeMachine Learning CLI
LicenseApache License 2.0
Websitedvc.org

DVC is a free and open-source, platform-agnostic version system for data, machine learning models, and experiments.[1] It is designed to make ML models shareable, experiments reproducible,[2] and to track versions of models, data, and pipelines.[3][4][5] DVC works on top of Git repositories[6] and cloud storage.[7]

The first (beta) version of DVC 0.6 was launched in May 2017.[8] In May 2020, DVC 1.0 was publicly released by Iterative.ai.[9]

  1. ^ Hewage Nipuni, Meedeniya Dulani (February 2022). "Machine Learning Operations: A Survey on MLOps Tool Support". ResearchGate. arXiv:2202.10169.
  2. ^ Barrak Amine, Eghan Ellis E., Adams Bram (March 2021). "On the Co-evolution of ML Pipelines and Source Code - Empirical Study of DVC Projects". IEEE Xplore. Archived from the original on 2022-10-05. Retrieved 2022-10-05.
  3. ^ Ivancic, Kristijan. "Data Version Control With Python and DVC". Real Python. Archived from the original on 2022-10-05. Retrieved 2022-10-05.
  4. ^ Wiggers, Kyle. "MLOps startup Iterative.ai nabs $20M". VentureBeat. Archived from the original on 2022-10-05. Retrieved 2022-10-05.
  5. ^ "MLOps Company Iterative Achieves Significant Customer and Company Growth in 2021". Business Wire. Archived from the original on 2022-10-05. Retrieved 2022-10-05.
  6. ^ Hall, Susan (4 February 2021). "Iterative.ai: Git-Based Machine Learning Tools for ML Engineers". The New Stack. Archived from the original on 5 October 2022. Retrieved 5 October 2022.
  7. ^ "What is DVC?". MLOps Guide. Archived from the original on 2022-10-05. Retrieved 2022-10-05.
  8. ^ Petrov, Dmitry. "DVC 3 Years and 1.0 Pre-release". Iterative.ai. Archived from the original on 2022-10-05. Retrieved 2022-10-05.
  9. ^ Anadiotis, George. "Streamlining data science with open source: Data version control and continuous machine learning". ZDNET. Archived from the original on 2022-10-05. Retrieved 2022-10-05.