I asked data scientists what challenges they were facing in 2020 and the resounding answer was how difficult it was to deploy models to production. I thought writing 3-4 blog posts would solve it.

I& #39;m 8 posts in and still so much to say. Here& #39;s a breakdown of each post https://abs.twimg.com/emoji/v2/... draggable="false" alt="👇" title="Down pointing backhand index" aria-label="Emoji: Down pointing backhand index">
Part 4 - But when you need predictions in real time, you need online inference. There are many gotchas in online inference: you need to query data from multiple sources in real time, you& #39;ll need A/B testing, you need rollout strategies... https://mlinproduction.com/the-challenges-of-online-inference-deployment-series-04/">https://mlinproduction.com/the-chall...
Part 5 - If after learning about those challenges you decide you still need online inference, bless your heart. There are a lot of posts on Flask APIs, but that& #39;s the easiest part. You need versioning, autoscaling, and the ability to A/B test models. https://mlinproduction.com/online-inference-for-ml-deployment-deployment-series-05/">https://mlinproduction.com/online-in...
Part 8 - Just because a model passes its unit tests, doesn& #39;t mean it will move the product metrics. The only way to establish causality is through online validation. Like any other feature, models need to be A/B tested https://mlinproduction.com/ab-test-ml-models-deployment-series-08/">https://mlinproduction.com/ab-test-m...
I& #39;m planning on writing a few more posts. Next one will be on rollout strategies (dark mode vs canary vs blue green). But if there& #39;s something you think I missed, shout it out!
You can follow @MLinProduction.
Tip: mention @twtextapp on a Twitter thread with the keyword “unroll” to get a link to it.

Latest Threads Unrolled: