Monthly Archives: May 2017

Automate your Machine Learning in Python – TPOT and Genetic Algorithms

Automatic Machine Learning (AML) is a pipeline, which enables you to automate the repetitive steps in your Machine Learning (ML) problems and so save time to focus on parts where your expertise has higher value. What is great is that it is not only some vague idea, but there are applied packages, which build on standard python ML packages such as scikit-learn.

Anyone familiar with Machine Learning will in this context most probably recall the term grid search.  And they will be entirely right to do so. AML is in fact an extension of grid search, as applied in scikit-learn, however instead of iterating over a predefined set of values and their combinations it searches for optimal solutions across methods, features, transformations and parameter values. AML “grid search” therefore does not have to be an exhaustive search over the space of possible configurations – one great application of AML is package called TPOT, which offers applications of e.g. genetic algorithms to mix the individual parameters within a configuration and arrive at the optimal setting. 

In this post I will shortly present some basics of AML and then dive into applications using TPOT package including its genetic algorithm solution optimization.

Basic concepts

The basic concept is very simple, once we receive our raw data we start with the standard ML pipeline.

Continue reading

Advertisements

We are looking for new colleagues! Behavioral data scientist & Client delivery analyst to extend our team

For our client, an international start-up company (South Africa, Great Britain, Switzerland…), we are currently looking for (1) behavioral data scientist and (2) client delivery analyst.

Table mountain, Cape Town (SA)

The client you would be working for is a company who provides big corporations with employee behavioral analytics. Our team is responsible for building and maintaining their analytical platform as well as for supporting the internal team of behavioral scientist in developing measurements.

The positions we are offering are demanding but do come with their unique advantages. Firstly, we don’t mind when or where you work as long as you deliver what you are supposed to. Secondly, you will have a huge opportunity to grow in data science and related fields, supported by our experienced team. And thirdly, you will be in direct contact with international start-up environment.  Continue reading