Recently I was tasked to build a system that would automatically deploy pipelines using Kubeflow and Pachyderm. This system needed data versioning to make sure that results were reproducible in the future. Data versioning, or DVC can be summed up in these diagrams Data Versioning Control (DVC) has many features. […]