March 2015

Mar 17, 2015

Analyzing Twitter data with R (part 1: connecting to Twitter API )



All "Smart" Businesses are looking to understand the social media trends , to analyze the massive amount of public data available online and to make insightful decisions based on these analysis.


Source :http://www.slashgear.com/twitter-data-grants-introduced-to-offer-select-institutes-data-trove-05315867/


For any statistician or future data scientist freshly graduating out of the university, it is very important to be able to have certain skills with the statistical modeling and mathematical knowledge.

In this series of posts, I will detail the necessary steps for that you will need to access Twitter, import data, clean it and analyze it and have a conclusion based on the data you have extracted.

It will be a simple step-by-step tutorial if you'd like to call it that way.

This series of posts is destined to students, currently taking data science classes, and to anyone interested in R language and social media in particular .

I've been asked to make the same thing with Python also. I will try to make time and share some Python practices for those using it for data analysis. But I encourage you to learn R !


Mar 2, 2015

R & SQL: Simple Data Science with R and SQL



Hello World!

The topic of today's article is databases.

As Data Scientists and Statisticians work with data everyday, they wont actually use that 50 lines text file data-sets provided by teachers in the statistical analysis courses in a real-world applications.

Statisticians work with massive amounts of data, whether this data is stored in flat-files or in databases, the size of the analyzed data will definitely be more then a couple of hundred records.
Thus the need for a way to extract data from large tables stored in databases in a simple, intuitive way.


Whats is SQL ?


SQL means Structured Query Language. For me, it has always been the "Simple Query Language".

I've always used the term "Simple" to describe the simplicity of learning and using basic sql functions.