Scalable architecture for real-time Twitter sentiment analysis

This post describes design and implementation of a scalable architecture to monitor and visualize sentiment against a twitter hashtag in real-time. The project streams live tweets from Twitter against a hashtag, performs sentiment analysis on each tweet, and calculates the rolling mean of sentiments. This sentiment mean is continuously sent to connected browser clients and displayed in a sparkline graph.

Continue reading "Scalable architecture for real-time Twitter sentiment analysis"

Top 15 Commands for Data Manipulation in R

R is one of the most widely used programming languages for data?and statistical analysis. At?eMumba we use R heavily to make sense out of data, to find patterns and for general exploratory data analysis. Generally, results?of these analyses are fed into machine learning models?to solve various?classification and regression problems. In this post, I want to share?top 15 commands that come very?handy while doing data exploration through R.

Log Analysis Part 2

Apache Storm

Apache Storm

imageContinuing on the log analysis journey, in this post I explore Apache Storm. Apache Storm is a framework for real time, distributed, fault tolerant computation. Storm gives you a set of abstractions to help build systems that can analyze a large volume of streaming data in real time. Here is an excellent talk on Storm by its creator, Nathan Marz, for anyone who wants to dive into the details.

Setting up Google reCAPTCHA in a ReactJS app

Google?s reCAPTCHA is an industry standard when it comes to fighting bots. Integrating it in a regular web app has almost become a no-brainer, thanks to plugins available on almost every platform to do the job. With this expectation, I started integrating it in my latest app built on ReactJS but encountered many roadblocks. Without going into details, I'm sharing here fastest recipe for integrating it with your ReactJS app.