Data visualisation course

We’ve launched a data visualisation course with JNTU & CIHL (at IIIT-H).
It covers large scale data analysis, building visualisations, and interactive designs.

The 4-week online course starts on May 26th (this Saturday). Course content and lessons will be available online. You’ll need about 2 hours every day to complete the assignments. These will be graded. Mentors are available 10am – 5pm for online help. Classroom sessions are on Saturdays 10am – 5pm at JNTU Hyderabad.

On completion, you’ll get a Certificate in Data Visualisation from CIHL and Gramener (assuming you’ve completed all the assignments.)

This course is for programmers. Ideally, you’d know HTML and some programming language – e.g. Python. (We’ll launch a non-programmer course soon.)

The course costs Rs. 9,000 for students and Rs. 12,000 for others.

You can contact or call 8008930678 / 9490422170 to register for the course.

Common birthdays


This visualisation shows the popularity of birthdays in the US between 1973 – 1999. The darkness of the colour shows the rank of how popular that birthday is. Dark colours are more popular (i.e. better ranked) birthdays.

  • Most people are born in August & September (and therefore were conceived around November & December, during the holidays, perhaps?)
  • However, very few people are actually born during holidays – New year, Independence day, Halloween, Thanksgiving and Christmas. (People don’t like to spoil their holidays?)
  • Few people are born on the 1st of April. (You don’t want your kid born on Fool’s Day)
  • Few people are born on the 13th of any month. (Unlucky?)
  • Plenty are born on Valentine’s Day and St Patrick’s day

We tried to see what this looked like in India.

Based on school registration data for ~700,000 students born between 1992 – 1995, here’s what it looks like. (Click for a larger version.)


This shows a number of bizarre patterns:

  • Almost everyone’s born between May and June – just before the school opens.
  • Almost no one is born in August – after school opens.
  • An unusual number of people have round-numbered days as birthdays – 5th, 10th, 15th, 20th, 25, and 30th. (This round-numbered pattern was also seen when we analysed utility fraud).
  • January 1st is fairly popular. Other than that, none of the holidays seem to have an effect.

In fact, these results are so striking that we are tempted to believe that the popularly accepted proof for a person’s age – their Class 10 certificate – generally bears a convenient fiction created for the purposes of school admission several years ago.

Data science news

Big Data Is An Issue Of Corporate Survival

“It is imperative from the business standpoint that you need to get ahead of this new wave of interacting with customers. You need to know who that customer is, what they represent to the business now, what they should represent to the business and how to move them along the trajectory to be that great customer they should be.”

Annika Jiminez, senior director for analytics solutions at Greenplum, said big data is happening in nearly every sector of business and government, from health care where it is used in medical records and treatment pathways to car manufacturers using it to capture data on how vehicles are used and transmitting it to a data center.

‘Big Data’ Could Remake Science — And Government

The research firm Gartner predicted in December 2011 that 85 percent of Fortune 500 firms will be unprepared to leverage big data for a competitive advantage by 2015.
Big-data analytics also has the potential to improve government efficiency, panelists at the TechAmerica event said.

The Centers for Medicare and Medicaid Services, for example, could pull data from insurance reports and hospital forms and anonymized data from electronic medical records to get a much better understanding of which medications and procedures are most effective, said Caron Kogan, a strategic planning director at Lockheed Martin Corp.

Visualization Broadens Business Intelligence’s Appeal

Some 400 IT and business unit managers responding to a survey found advanced analytics, which Dresner Advisory Services founder Howard Dresner defines as “extensive use of color, size, shape, 3D, texture, motion, etc. to convey meaning,” more compelling than Big Data, the cloud, social media analytics and other trendy business intelligence technologies.

On a rising scale of importance, from one to five, respondents gave advanced visualization a 3.8. Dashboards, respondents’ top priority, rated only slightly higher at 4.15

Predictive Analytics Goes Deep, Catches Pass From Tech Giant IBM

Beyond the simple data analysis of standard business intelligence (BI) software, predictive analytics solutions give midsize IT the ability to not just crunch numbers but get a glimpse of what the future may hold–this is an invaluable asset in the quickly changing tech market. Adoption of predictive software services has been slow in the world of IT, but it is now getting noticed both at the enterprise investor level and on the gridiron.

Where Big Data Shows Huge ROI

Big data projects can far surpass the hype by nurturing context and connections, according to an analysis of numerous case studies by Nucleus Research.
Examples of those returns included: a 942 percent ROI for a manufacturer that was able to scour large, disparate data sets from vendors for purchasing and cost information; 1,822 percent ROI from reduced labor costs by a resort that integrated shift scheduling processes with data from the National Weather Service; and an 863 percent ROI by a metropolitan police force that was able to combine various crime databases alongside predictive analytics and its department assets.

How visualisation uncovers the big picture of ‘Big Data’

According to Gartner, Big Data is “…the volume, variety and velocity of structured and unstructured data pouring through networks into processors and storage devices, along with the conversion of such data into business advice for enterprises.” A recent report from the Center for Economics and Business Research (CEBR) 1, suggests that improved use of this Big Data could add £216 billion to the UK economy and create 58,000 jobs. Data visualisation can be a key tool in helping users explore and communicate data through graphic representations – enabling collaborating, inferring connections and drawing conclusions that benefit business’ bottom line.