Pachyderm logo


Pachyderm logo
Pachyderm logo


By Pachyderm

Certified enterprise ready

Pachyderm provides the data layer that allows machine learning teams to productionize and scale their machine learning.

Software version


Runs on

OpenShift 4.6+

Delivery method


With Pachyderm’s industry leading data versioning, pipelines and lineage, teams gain data driven automation, petabyte scalability and end-to-end reproducibility. Data Science and AI/ML teams use Pachyderm to get their Machine Learning projects to market faster, lower data processing and storage costs, and more easily meet regulatory compliance requirements.

Automatic data versioning and data-driven pipelines

Automate and unify your MLOps tool chain

Automatic parallel and incremental processing

Rapidly process the largest unstructured and structured data sets

End-to-end reproducibility and immutable data lineage

Iterate quickly while still meeting audit and data governance requirements

Pricing summary

Plans starting at

View all pricing options


Auth & Access Controls

Detailed Stats

JupyterHub Integration

Prometheus Metrics

Support & Solutions Architecture

Additional resources

Want more product information? Explore detailed information about using this product and where to find additional help.