Skip to content

Minsifye/Stack-Overflow-Survey-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data Analysis on StackOverFlow Developer Survey 2017 Dataset

I have used CRISP-DM process during this analysis.

  1. Business Understanding - Started analysis with posed questions in mind.
  2. Data Understanding - To better understand the data, I started going through the dataset and noted points as how to use it for my analysis. For example: which columns will be helpful to answer a particular questions?
  3. Prepare Data - At various points, I have to do data wrangling and perform data transformation to achieve the results. Keeping DRY techniques in mind, I have also created a function to draw plotly barchart as this code was repeating often.
  4. Model Data - My analysis does not involve modeling step. I might add this in my future work.
  5. Results - I am using visualizations like barchart and piecharts to convey my findings, also added result statements at the end of every visualization for easy understanding of thought process.
  6. Deploy - I am not deploying this code anywhere right now. For now, it is available in jupyter notebook form only.

Table of Contents

  1. Installation
  2. Project Motivation
  3. File Descriptions
  4. Results
  5. Licensing, Authors, and Acknowledgements

Installation

Install plotly first, after that there should be no necessary libraries to run the code here beyond the Anaconda distribution of Python. The code should run with no issues using Python versions 3.*.

Project Motivation

For this project, I was interestested in using Stack Overflow data from 2017 to better understand:

  1. How other developers suggested breaking into the field (what education to pursue)?
  2. What factors about an individual contributed to salary?
  3. What was the state of bootcamps for assisting individuals with breaking into developer roles?
  4. How were bootcamps assisting with increasing diversity in tech careers?
  5. According to EmploymentStatus, which group has the highest average Career satisfaction?

File Descriptions

There is one notebook available here to showcase work related to the above questions. This notebook is exploratory in searching through the data pertaining to the questions showcased by the notebook title. Markdown cells were used to assist in walking through the thought process for individual steps.
Data files : Download

Results

The main findings of the code can be found at the Medium post available here.

Licensing, Authors, Acknowledgements

Must give credit to Stack Overflow for the data. You can find the Licensing for the data and other descriptive information at the Kaggle link available here. Otherwise, feel free to use the code here as you would like!

About

Data Analysis on Stack Overflow Survey Dataset 2017. The main findings of this analysis can be found at the Medium post.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors