← All projects

lakeFS

The Control Plane for AI-Ready Data

Data & Analyticsdata-version-controlobject-storagemlopsdata-engineeringopen-sourceai-infrastructuredata-governance
lakeFS screenshot

About

lakeFS is an open-source data version control platform that brings Git-like branching, versioning, and repository semantics to object storage systems. It helps AI and data engineering teams manage data lifecycle, provenance, and access across distributed infrastructure. Teams can test pipeline changes in isolation, ensure reproducibility of training runs, and maintain compliance and governance across data workloads.

Problem

Data teams lack version control, reproducibility, and governance tooling for large-scale object storage and data lakes.

For

AI and data engineering teams at enterprises

How it works

lakeFS wraps existing object storage (S3-compatible) with Git-like branching and versioning, enabling isolated testing, rollback, and full data lineage without copying data.

Business model

open-source

Status

launched

Company

lakeFS

Similar projects