GitHub - sematic-ai/sematic: An open-source ML pipeline development platform

The open-source Continuous Machine Learning Platform

Build ML pipelines with only Python, run on your laptop, or in the cloud.

Sematic is an open-source ML development platform. It lets ML Engineers and Data Scientists write arbitrarily complex end-to-end pipelines with simple Python and execute them on their local machine, in a cloud VM, or on a Kubernetes cluster to leverage cloud resources.

Sematic is based on learnings gathered at top self-driving car companies. It enables chaining data processing jobs (e.g. Apache Spark) with model training (e.g. PyTorch, Tensorflow), or any other arbitrary Python business logic into type-safe, traceable, reproducible end-to-end pipelines that can be monitored and visualized in a modern web dashboard.

Read our documentation and join our Discord channel.

Why Sematic

Easy onboarding – no deployment or infrastructure needed to get started, simply install Sematic locally and start exploring.
Local-to-cloud parity – run the same code on your local laptop and on your Kubernetes cluster.
End-to-end traceability – all pipeline artifacts are persisted, tracked, and visualizable in a web dashboard.
Access heterogeneous compute – customize required resources for each pipeline step to optimize your performance and cloud footprint (CPUs, memory, GPUs, Spark cluster, etc.)
Reproducibility – rerun your pipelines from the UI with guaranteed reproducibility of results

Getting Started

To get started locally, simply install Sematic in your Python environment:

$ pip install sematic

Start the local web dashboard:

$ sematic start

Run an example pipeline:

$ sematic run examples/mnist/pytorch

Create a new boilerplate project:

$ sematic new my_new_project

Or from an existing example:

Name		Name	Last commit message	Last commit date
Latest commit History 1,074 Commits
.circleci		.circleci
bazel		bazel
developer-docs		developer-docs
docker		docker
docs		docs
helm		helm
requirements		requirements
sematic		sematic
tools		tools
.bazelignore		.bazelignore
.bazelrc		.bazelrc
.flake8		.flake8
.gitbook.yaml		.gitbook.yaml
.gitignore		.gitignore
BUILD		BUILD
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README.rst		README.rst
WORKSPACE		WORKSPACE
mypy.ini		mypy.ini
pyproject.toml		pyproject.toml
stub.py.tpl		stub.py.tpl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The open-source Continuous Machine Learning Platform

Build ML pipelines with only Python, run on your laptop, or in the cloud.

Why Sematic

Getting Started

License

sematic-ai/sematic

Folders and files

Latest commit

History

Repository files navigation

The open-source Continuous Machine Learning Platform

Build ML pipelines with only Python, run on your laptop, or in the cloud.

Why Sematic

Getting Started