As a researcher, I used to think that beating state of the art is the most important thing in machine learning.
Now, with a bit more experience, I know that I was wrong.
This is a thread about a few other critical points that are often not considered quite enough!
Now, with a bit more experience, I know that I was wrong.
This is a thread about a few other critical points that are often not considered quite enough!

As the saying goes: garbage in, garbage out. Data is more important than the model itself!





Models are trained to be used by other developers and services.



In software engineering, the faster you can push changes to production, the faster you can move. This is true machine learning for as well.



Testing is not a one and done thing. Performance should be monitored constantly. Occasionally, the data coming from production can slowly drift from the training data.

This is just the tip of the iceberg. There is so much beyond training a model!
What are the key questions you ask when deploying models to production?
What are the key questions you ask when deploying models to production?