Posts

Showing posts from March, 2015

Analyzing Twitter data with R (part 1: connecting to Twitter API )

Image
All "Smart" Businesses are looking to understand the social media trends , to analyze the massive amount of public data available online and to make insightful decisions based on these analysis. Source :http://www.slashgear.com/twitter-data-grants-introduced-to-offer-select-institutes-data-trove-05315867/ For any statistician or future data scientist freshly graduating out of the university, it is very important to be able to have certain skills with the statistical modeling and mathematical knowledge. In this series of posts, I will detail the necessary steps for that you will need to access Twitter, import data, clean it and analyze it and have a conclusion based on the data you have extracted. It will be a simple step-by-step tutorial if you'd like to call it that way. This series of posts is destined to students, currently taking data science classes, and to anyone interested in R language and social media in particular . I've been asked to make ...

R & SQL: Simple Data Science with R and SQL

Hello World! The topic of today's article is databases. As Data Scientists and Statisticians work with data everyday, they wont actually use that 50 lines text file data-sets provided by teachers in the statistical analysis courses in a real-world applications. Statisticians work with massive amounts of data, whether this data is stored in flat-files or in databases, the size of the analyzed data will definitely be more then a couple of hundred records. Thus the need for a way to extract data from large tables stored in databases in a simple, intuitive way. Whats is SQL ? SQL means  Structured Query Language . For me, it has always been the "Simple Query Language". I've always used the term  "Simple"  to describe the simplicity of learning and using basic sql functions.