It’s that time of year again: A bunch of us at Aqueduct are big fans of the NCAA tournament, and we couldn’t resist the opportunity to see if we could do a little bit of machine learning to predict how the games would go.
We were excited to discover that there’s a Kaggle competition for this with a long history! This made our lives much easier.
Over the last couple weeks, our team has been looking through past submissions and identified one of the better models in recent years. We’ve built an Aqueduct workflow that uses this model and the Kaggle data to make predictions for all the possible matchups in the tournament:
We’re really excited about this, so we wanted to share. We’ve built a bracket that show’s the model’s predicted likelihood for each matchup. This will update every day as the tournament progresses, so you can see who’s most likely to win the game on each day. Check it out here!
If you want to refine our pipeline, build your own model, or get a sense for how Aqueduct works, you can check out what we’ve built:
In the coming weeks, we’ll share more about the work that’s gone into this pipeline, but for now, happy tinkering! If you’re interested in learning more, join our Slack community and say hello!
*In case you were wondering, GitHub Codespaces is a tool that allows you to create a development environment with your GitHub repository directly in the cloud. Every GitHub account comes with 120 hours per-month of free Codespace usage.
We'd love to hear from you! Star us on GitHub, join our community, or start a discussion.
© 2023 Aqueduct, Inc. All rights reserved.