pebblebed ventures

Remember: In case of emergency, panic first, THEN follow protocol.

Oxen

Git for ML datasets
Oxen logo

Founders

Oxen is a lightning-fast data version control system for structured and unstructured machine learning datasets that aims to make versioning datasets as easy as versioning code. The platform is built to track and store changes for everything from a single CSV to data repositories with millions of unstructured images, videos, audio, or text files.

Oxen's interface mirrors Git but is optimized for ML workflows. Performance benchmarks show Oxen is 40x faster than Git-LFS and 6.5x faster than simple S3 copy operations.

The platform enables collaboration across ML engineering, data science, product, and legal teams to share, review, and edit data together.

© 2025 Pebblebed · San Francisco, CA