← All projects

DVC

Manage data the way code is managed with a Git-like model

Dev Toolsversion-controlmachine-learningdata-sciencemlopsgitopen-sourcedata-management
DVC screenshot

About

DVC (Data Version Control) is an open-source version control system designed for data science and machine learning projects. It brings Git-like workflows to managing datasets, models, and experiments. It offers both a lightweight Git extension for individual data scientists and an enterprise-grade solution for large-scale AI and data engineering teams.

Problem

Data science teams lack proper version control for datasets, models, and experiments the way software teams have for code.

For

data scientists and AI/ML engineering teams

How it works

DVC extends Git with commands and workflows that track and version large data files, models, and experiments without storing them directly in Git repositories.

Business model

freemium

Status

launched

Similar projects