Skip to content

Commit 006b2f7

Browse files
committed
ipl extra analysis updated
1 parent 0bd783c commit 006b2f7

6 files changed

Lines changed: 138230 additions & 0 deletions

File tree

Lines changed: 219 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,219 @@
1+
{
2+
"cells": [
3+
{
4+
"cell_type": "markdown",
5+
"metadata": {},
6+
"source": [
7+
"# IPL Dataset Analysis\n",
8+
"\n",
9+
"## Problem Statement\n",
10+
"We want to know as to what happens during an IPL match which raises several questions in our mind with our limited knowledge about the game called cricket on which it is based. This analysis is done to know as which factors led one of the team to win and how does it matter.\n",
11+
"\n",
12+
"## About the Dataset :\n",
13+
"The Indian Premier League (IPL) is a professional T20 cricket league in India contested during April-May of every year by teams representing Indian cities. It is the most-attended cricket league in the world and ranks sixth among all the sports leagues. It has teams with players from around the world and is very competitive and entertaining with a lot of close matches between teams.\n",
14+
"\n",
15+
"The IPL and other cricket related datasets are available at [cricsheet.org](https://cricsheet.org). Feel free to visit the website and explore the data by yourself as exploring new sources of data is one of the interesting activities a data scientist gets to do.\n",
16+
"\n",
17+
"## About the dataset:\n",
18+
"Snapshot of the data you will be working on:<br>\n",
19+
"<br>\n",
20+
"The dataset 136522 data points and 23 features<br>\n",
21+
"\n",
22+
"|Features|Description|\n",
23+
"|-----|-----|\n",
24+
"|match_code|Code pertaining to individual match|\n",
25+
"|date|Date of the match played|\n",
26+
"|city|City where the match was played|\n",
27+
"|venue|Stadium in that city where the match was played|\n",
28+
"|team1|team1|\n",
29+
"|team2|team2|\n",
30+
"|toss_winner|Who won the toss out of two teams|\n",
31+
"|toss_decision|toss decision taken by toss winner|\n",
32+
"|winner|Winner of that match between two teams|\n",
33+
"|win_type|How did the team won(by wickets or runs etc.)|\n",
34+
"|win_margin|difference with which the team won| \n",
35+
"|inning|inning type(1st or 2nd)|\n",
36+
"|delivery|ball delivery|\n",
37+
"|batting_team|current team on batting|\n",
38+
"|batsman|current batsman on strike|\n",
39+
"|non_striker|batsman on non-strike|\n",
40+
"|bowler|Current bowler|\n",
41+
"|runs|runs scored|\n",
42+
"|extras|extra run scored|\n",
43+
"|total|total run scored on that delivery including runs and extras|\n",
44+
"|extras_type|extra run scored by wides or no ball or legby|\n",
45+
"|player_out|player that got out|\n",
46+
"|wicket_kind|How did the player got out|\n",
47+
"|wicket_fielders|Fielder who caught out the player by catch|\n",
48+
"\n"
49+
]
50+
},
51+
{
52+
"cell_type": "markdown",
53+
"metadata": {},
54+
"source": [
55+
"## Load the data"
56+
]
57+
},
58+
{
59+
"cell_type": "code",
60+
"execution_count": 1,
61+
"metadata": {},
62+
"outputs": [],
63+
"source": [
64+
"# Read the data using pandas module.\n",
65+
"import pandas as pd\n",
66+
"import numpy as np\n",
67+
"\n"
68+
]
69+
},
70+
{
71+
"cell_type": "markdown",
72+
"metadata": {},
73+
"source": [
74+
"## There are matches being played all around the world. Find the list of unique cities where matches are being played throughout the world."
75+
]
76+
},
77+
{
78+
"cell_type": "code",
79+
"execution_count": null,
80+
"metadata": {},
81+
"outputs": [],
82+
"source": []
83+
},
84+
{
85+
"cell_type": "markdown",
86+
"metadata": {},
87+
"source": [
88+
"## Find columns containing null values if any."
89+
]
90+
},
91+
{
92+
"cell_type": "code",
93+
"execution_count": null,
94+
"metadata": {},
95+
"outputs": [],
96+
"source": []
97+
},
98+
{
99+
"cell_type": "markdown",
100+
"metadata": {},
101+
"source": [
102+
"## Matches are played throughout the world in different countries but they may or may not have multiple venues(stadiums where matches are played). Find the top 5 venues where the most matches are played.\n"
103+
]
104+
},
105+
{
106+
"cell_type": "code",
107+
"execution_count": null,
108+
"metadata": {},
109+
"outputs": [],
110+
"source": []
111+
},
112+
{
113+
"cell_type": "markdown",
114+
"metadata": {},
115+
"source": [
116+
"## Find out how the runs were scored that is the runs count frequency table( number of singles, doubles, boundaries, sixes etc were scored)."
117+
]
118+
},
119+
{
120+
"cell_type": "code",
121+
"execution_count": null,
122+
"metadata": {},
123+
"outputs": [],
124+
"source": []
125+
},
126+
{
127+
"cell_type": "markdown",
128+
"metadata": {},
129+
"source": [
130+
"## IPL seasons are held every year now let's look at our data and extract how many seasons and which year were they played?\n"
131+
]
132+
},
133+
{
134+
"cell_type": "code",
135+
"execution_count": null,
136+
"metadata": {},
137+
"outputs": [],
138+
"source": []
139+
},
140+
{
141+
"cell_type": "markdown",
142+
"metadata": {},
143+
"source": [
144+
"## Find out the total number of matches played in each season also find the total number of runs scored in each season.\n"
145+
]
146+
},
147+
{
148+
"cell_type": "code",
149+
"execution_count": null,
150+
"metadata": {},
151+
"outputs": [],
152+
"source": []
153+
},
154+
{
155+
"cell_type": "markdown",
156+
"metadata": {},
157+
"source": [
158+
"## There are teams which are high performing and low performing. Let's look at the aspect of performance of an individual team. Filter the data and aggregate the runs scored by each team. Display top 10 results which are having runs scored over 200."
159+
]
160+
},
161+
{
162+
"cell_type": "code",
163+
"execution_count": null,
164+
"metadata": {},
165+
"outputs": [],
166+
"source": []
167+
},
168+
{
169+
"cell_type": "markdown",
170+
"metadata": {},
171+
"source": [
172+
"## Chasing a 200+ target is difficulty in T-20 format. What are the chances that a team scoring runs above 200 in their 1st inning is chased by the opposition in 2nd inning.\n"
173+
]
174+
},
175+
{
176+
"cell_type": "code",
177+
"execution_count": null,
178+
"metadata": {},
179+
"outputs": [],
180+
"source": []
181+
},
182+
{
183+
"cell_type": "markdown",
184+
"metadata": {},
185+
"source": [
186+
"## Every season has that one team which is outperforming others and is in great form. Which team has the highest win counts in their respective seasons ?\n",
187+
"\n"
188+
]
189+
},
190+
{
191+
"cell_type": "code",
192+
"execution_count": null,
193+
"metadata": {},
194+
"outputs": [],
195+
"source": []
196+
}
197+
],
198+
"metadata": {
199+
"kernelspec": {
200+
"display_name": "Python 3",
201+
"language": "python",
202+
"name": "python3"
203+
},
204+
"language_info": {
205+
"codemirror_mode": {
206+
"name": "ipython",
207+
"version": 3
208+
},
209+
"file_extension": ".py",
210+
"mimetype": "text/x-python",
211+
"name": "python",
212+
"nbconvert_exporter": "python",
213+
"pygments_lexer": "ipython3",
214+
"version": "3.6.5"
215+
}
216+
},
217+
"nbformat": 4,
218+
"nbformat_minor": 2
219+
}

0 commit comments

Comments
 (0)