Indian Premiere League Data Collection
Curated collection of datasets on the IPL for in-depth analysis of one of cricket's most popular tournaments.
Many of us have watched the movie Moneyball. The film summarizes that with proper scouting and believing in the statistics of players, a great team can be built. However, this analysis can be done with an excellent dataset to help analyze the players, strengths, and weaknesses.

Indian Premiere League is among the most famous cricket league with players coming from worldwide. What makes this series more competitive this year is the auction resulting in all players changing their squad and two new teams added to the league. With IPL starting every year and as a true Cricket fan, I belived there was a need for datasets showing match information and also data for each delivery. So I created few datasets by scraping data from multiple sources to create the following datasets. Here is an overview of the datasets that I created in 2022 published on Kaggle.
Kaggle Dataset | Description | Upvotes | Views | Downloads |
---|---|---|---|---|
IPL 2008 to 2022 All Match Dataset | IPL 2008-2022 Ball By Ball and Match Info Data | 97 | 42k | 10k |
IPL 2022 Match Dataset | IPL 2022 Ball By Ball and Match Info Data | 91 | 38k | 8k |
IPL 2022 Player Statistics | A data for all players playing Tata IPL 2022 with all time IPL and T20 stats | 43 | 16k | 3k |
The code for this project is available at sahilvora10/IPL_Data_Extraction. If you have any questions or inquiries, please feel free to contact me at sahilvora2021@gmail.com
IPL 2008 to 2022 All Match Dataset
This dataset is the widely used dataset that combines all the available data for each of the match played from the intial opening season of 2008 till 2022. It also has statistics and data about each ball that has ever been delivered within these years.
- The dataset was among the most used dataset under Sport Category for 2022
- The data was scraped from Cricsheet.
- Around 226k rows of data was fetched.
- Around 37 different available features for the dataset.
This dataset was a daily updated open-sourced data while the tournament was active. It also has statistics and data about each ball that has ever been delivered within these years.
- The dataset was used by many contributors to predict the best players while the tournament was live!
- The data was scraped from Cricsheet.
- Around 17.9k rows of data was fetched.
- Around 37 different available features for the dataset.
With the new season of IPL with the squad finalized, this dataset was a concise dataset to get statistics of all the players for the year 2022. This data can be used to analyze to make your dream team, which can also help anyone play fanatasy leagues and tournaments.
This dataset has a CSV file with all players in the list. It contains details of each player’s all-time batting, bowling, and fielding figures in IPL and T20 stats, either international or domestic, apart from IPL.
- The dataset was among the most used dataset under daily category in 2022.
- The data was scraped from NDTV Sports.
- Contains data for all the 237 players that played in IPL 2022
- Around 80 different available features for the dataset.