Review of Data Analysis with Open Source Tools by Philipp K Janert
Whether analyzing data is part of your everyday job or one of your projects needs specific data analysis, the same questions arise: What tools are the most suited for the task? Which techniques are the most adapted? What are the numbers really saying? Are they meaningful?

If those questions are familiar to you, Data Analysis with Open Source Tools by Philipp K Janert might be of great interest. This book is perfect for hands-on readers, wanting to achieve specific goals without getting entangled in formal definitions by getting right to the point. Here is an analogy: a Common Kite is depicted on the front cover of the book and as the bird waiting for its prey, you’ll be able to analyse the situation, understand it and take a clear decision after this read.
The book starts by giving graphical representations of several types of data sets. This is particularly useful to truly understand what we are working with, notably for visual readers. It gives important landmarks for further manipulations with the data. Indeed, within each sections (18 of them), there is a subsection called Workshop where the reader gets his hands dirty on examples. The main tools used are Python and its libraries (NumPy, matplotlib, scipy, etc), R and gnuplot. On the negative side, although several practical examples are given in each section, no files are provided so readers can follow exactly the same example on their side.
More advanced techniques are given later in the book going from clustering to data mining, but the author always find a way to prune away non essential concepts and focusing on what the reader really needs to know.
In short, I recommend this book to anyone wanting to give a little more kick to the tremendous amount of data surrounding them. Personal advice and little tricks make the read a most enjoyable one.
For more information : http://oreilly.com/catalog/9780596802363/
Share this
You May Also Like
These Related Stories

The iPad and the world of publishing
Following two very relevant questions from Frédérick in my last article concerning my impressions of the Ipad , I realized that there was material to write a new article to really answer his questions, but in a broader way.

Infopresse360 Day on Web Analytics: Summary of Jacques Warren's conference
Jacques Warren was commissioned to introduce Web analytics (WA) to the people in the room. He asks very simple questions, but how fundamental.
.png)
Why the performance indicators you use will become unreliable
The challenge of the moment, with the apocalypse of cookies , is to fully understand the repercussions of such a transformation in the digital world. Today, we are going to take a closer look at its consequences on the measurement of data, what we call analytics . Get ready, you will have to question several performance indicators that you observe on a daily basis to measure your effectiveness! We will therefore focus mainly on the repercussions of the cookie apocalypse on primary cookies. Unlike third-party cookies, which tend to disappear, primary cookies will remain relevant, but will experience major changes: the lifetime of primary cookies will be limited to 7 days (or even 24 hours in some cases) on the Safari browser; Cookies on Firefox will be deleted daily unless there has been user activity within 45 days. CHANGES IN YOUR DAILY LIFE What does this limited lifespan imply? In fact, this means that it will be difficult to recognize a user from one visit to another, which will ha …
Repérer et valoriser ce qui compte vraiment.
Recevez nos analyses et conseils pour rester à l’avant-garde du numérique.