With Aqueduct, you build model training pipelines in minutes rather than weeks or months. Aqueduct comes prepackaged with connectors to most common data systems, so you can pull data into your pipelines, clean and featurize it, and train models easily.
Your pipelines can be run on any cloud infrastructure you use — Kubernetes, Spark, Airflow, etc. — and you can validate the quality of your models on every run with metrics & checks.
Define your ML tasks in vanilla Python — no more YAML configs, Dockerfiles, or DSLs to worry about.
Aqueduct workflows can run on any cloud infrastructure you already use — choose from Kubernetes, Spark, Airflow, Lambda, or Databricks.
Integrate powerful LLMs into your pipelines with a single API call — all without any operational overhead.
Regardless of where your code is running, Aqueduct captures and validates the code and data at every stage, so you know what ran and when it ran.
Every function run has error logs and stack traces, so you can pinpoint errors quickly.
Aqueduct is fully open-source, so you can be sure your code and data is always where it's supposed to be.
Machine Learning Engineer
Aqueduct gives me a comprehensive view of the data flow in my ML pipelines. Today, this context is scattered across a notebook and a couple Miro boards, but these pipelines change so fast that it's hard to keep track of them. To see all of my pipelines end-to-end and to see everything light up green is going to give me the confidence that I need to know everything's working and how well it's working.
Director of Data Science, Sparks & Honey
Aqueduct makes it easy to add a couple decorators to your codebase and automatically capture metrics, track them over time, and enforce constraints on those measurements over time. I don't have to think about where or how I track these things because Aqueduct does it for me.
Lead Engineer, Replate
Our previous infrastructure was built by data scientists and engineers with little knowledge of each others' best practices. It worked but wasn't ideal for us. Aqueduct streamlines production data science by providing a simple Pythonic API that makes it easy to get models into production. We can focus on delivering better models rather than maintaining cloud infrastructure.
© 2023 Aqueduct, Inc. All rights reserved.