Some resources that I’ve found really helpful to understand machine learning in production.
1. Engineering starts with infrastructure. @vtuulos gave a great overview of the relationship between data science and infrastructure at Netflix https://youtu.be/XV5VGddmP24 ">https://youtu.be/XV5VGddmP...
1. Engineering starts with infrastructure. @vtuulos gave a great overview of the relationship between data science and infrastructure at Netflix https://youtu.be/XV5VGddmP24 ">https://youtu.be/XV5VGddmP...
2. What and how to monitor ML systems in the wild. @josh_wills gave an excellent deep-dive on DevOps meets Data Science based on his experience at Google, Cloudera, and Slack https://www.infoq.com/presentations/instrumentation-observability-monitoring-ml/">https://www.infoq.com/presentat...
3. Deploying ML is easy. Deploying it reliably is hard. Daniel Papasian and @tmu analyzed post mortems of 96 ML systems outages at Google and found that most outages are even ML and more related to the distributed character of the pipeline https://www.youtube.com/watch?v=hBMHohkRgAA&ab_channel=USENIX">https://www.youtube.com/watch...
4. I can’t believe I’m sharing a VC post, but @martin_casado and @BornsteinMatt gave an interesting perspective on the economics of AI, how cloud services are reducing the margin, scaling problem due to edge cases, and the diminishing value of added data https://a16z.com/2020/02/16/the-new-business-of-ai-and-how-its-different-from-traditional-software/">https://a16z.com/2020/02/1...