Today is #TechnicalTuesday
!
Let's talk about practical technologies that you can use today.
In this thread I will tell you about *AutoML*

Let's talk about practical technologies that you can use today.
In this thread I will tell you about *AutoML*


AutoML stands for *Automated Machine Learning*.
It encompasses a bunch of technologies and paradigms to gradually automate the process of creating machine learning solutions.
AutoML is about raising the abstraction level in ML and reducing the grunt work.
It encompasses a bunch of technologies and paradigms to gradually automate the process of creating machine learning solutions.


Getting a machine learning solution to work takes a few steps:
-

-

-

-

-


Most current AutoML frameworks today focus on
, i.e., helping you select among the plethora of machine learning models which is the best for your problem.
This problem is often framed in terms of:
-
model selection
-
hyperparameter optimization

This problem is often framed in terms of:
-

-


The "manual" way of doing this is to actually try each algorithm a bunch of times in your data and collect some statistics.

- How many neurons?
- How much dropout?
- Which activation function?
- Which regularization factor?
- ...
If you combine both problems, then you realize there are literally thousands (and potentially infinite) different algorithms you can try on your data.
If you were to do this yourself, the simplest solution is something like this:
If you were to do this yourself, the simplest solution is something like this:



Let's see a couple of examples



You can basically replace standard scikit-learn code with a generic Auto-Sklearn classifier and suddenly you are evaluating thousands of models:


Instead of manually designing a neural network, you can use Auto-Keras predefined "meta-models" and it will take care of finding the best architecture:
Yeah, I know
!
And AutoML is much more than model selection and hyperparameter search. It can also include automating:
- data preprocessing
- feature engineering
- feature selection
- dataset augmentation
- model distillation
- and more...

And AutoML is much more than model selection and hyperparameter search. It can also include automating:
- data preprocessing
- feature engineering
- feature selection
- dataset augmentation
- model distillation
- and more...



There are a lot of challenges to make AutoML production-ready. Data cleaning is a major bottleneck still, far from automated. And we need to understand how these methods exacerbate data bias.
Finally, if you are feeling adventurous, you can try @auto_goal, an experimental AutoML framework that goes beyond "standard" AutoML.
Check it out in https://autogoal.github.io !

As usual, if you like this topic, reply in this thread or @ me at any time. Feel free to
like and
retweet if you think someone else could benefit from knowing this stuff.
Read this thread online at < https://apiad.net/tweetstorms/technicaltuesday-automl>



Stay curious 
-
< https://www.automl.org/automl/ >
-
< https://www.automl.org/book/ >
-
<>
-
< https://github.com/windmaple/awesome-AutoML>

-

-

-

-
