Sorry, I've got a headache...why dirty data are no laughing matter
Posted on 7th November 2017
AI is all the rage. The idea that machines can learn and become like us (heaven forbid they become like some of us) is nothing new. Data Scientists are continually beavering away to "teach" machines to recognise speech and images. In doing so, they are helping transform our world. However, like quite a few jobs, while the output is glamorous, the hard-graft involved in creating it can be quite arduous. How many times do you have to show a picture of a car before the AI can recognise all cars, from Lamborghinis to Ladas?
Their lives are not made any easier by the problems caused by dirty data. While these have been recognised for a long time, a recent worldwide survey of nearly 17,000 people by data science community site Kaggle, shows that when it comes to the key question "What do Data Scientists do at work?" the most common answer was, as shown in the graph below (which has been taken from the survey) "dirty data."
As reported by theverge.com, Kaggle CEO Anthony Goldbloom related the joke that "80 per cent of data science is cleaning the data and 20 per cent is complaining about cleaning the data." We sympathise, and not just with Data Scientists and their daily grind with their digital domestos and dustbans, but also because, as you'll see from the graphic, the second most common problem/thing they do at work is try to find more data scientists to help them share the load. As recruiters, this comes as no surprise to us.
Do take the time to click on the link above to this survey. It is full of a lot of interesting information, and you can click on the various drop-downs and see country specific information for your own part of the world, including salary rates and much else besides - of which I'll have more to say in another, brief blog.
Michael Phair, Be-IT
Posted in News, Recruitment News
.. Back to Blog