Given that the 2021 season of IPL is already underway, I thought of revisiting this dataset that I have been doing analysis on. It’s quite refreshing to see how we look at the same data very differently each time we see it and the hidden nuggets of insights that we encounter. I have written about IPL and the data analysis in the past here, here and here!

This time I am looking at the big picture and trying to answer the question “How are runs scored across overs in the 20 overs that is played in the game”. …


There is no dearth of subject matter experts when it comes to the topic of leadership. Newbies want a “leadership role” as soon as they join an organization — not that it is bad. Disruptive thinking and fresh blood do help companies go from ‘good’ to ‘great’ — but maybe all that’s said about getting to be a leader may not always be everyone’s cup of tea. I sincerely admire decent, hardworking and even stellar people who say “I don’t think I’m cut to be that leader, I’m happy where I am.” …


In this set of fairly basic analysis on the IPL datasets, I look at whether 2020 was a particularly interesting year when viewed from different angles. Notwithstanding the fact that due to COVID-19, the 2020 IPL was played outside of India (This is only the second time in IPLs twelve year history that these games were played outside India. The last one was in South Africa in 2009), we would generally expect that with a series of illnesses, controversies etc., the performance of the 2020 IPL edition could be different.

The purpose of this analysis, for me, was to enhance…


I’ve missed most of this year’s IPL tournament. I had meant to blog my analysis as the tournament was progressing, but I guess life came in the way. I just got hold of this year’s IPL match data from cricsheet.org, and ever since I have been itching to analyze it. Here is a short post about partnerships and a quick result summary. As usual, I used R and ggplot to work through the data, the most difficult part being extracting information from the YAML data format which the website provided the data in. …


I got interested in text mining when I was a newbie to R. Text is one of the most common form of unstructured data out there and the knowledge to deal with analysis of such unstructured data is a good tool to possess in your ‘data science’ arsenal. This website/ book from David Robinson and Julia Silge on text mining got me learning about the techniques and their applications. The website gives useful examples and case studies of applying functions such as ‘tf-idf’, ‘n-grams’ and ‘topic modeling’ that can be used to extract meaning from a corpus of text. …


The Indian Premier League is a popular cricket tournament which was founded and is run by the Indian cricket governing body BCCI. It is touted to be the most attended cricket league in the world with its brand value estimated to be close to USD 6 bn in 2019 (In contrast, the 27 year old English Premier League’s valuation is estimated at $6.7 bn).

There have been twelve seasons of IPL so far from 2008 to 2019. The 2020 games have been postponed indefinitely as on the date of writing this article on account of COVID-19. Typically, these games run…

Sathvik Nishanth

Learner for life! Data Enthusiast.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store