Resume Classification and Ranking System

Overview

This project is a Resume Classification and Ranking System that processes multiple resumes in PDF format, classifies them into predefined job categories, and ranks them based on relevance to a specified job role.

Features

Extracts text from resumes in PDF format.
Cleans and preprocesses text using regex.
Tokenizes and sequences text for model input.
Uses a deep learning model (CNN + LSTM) to classify resumes.
Ranks resumes based on their softmax probability score for a given job role.
Normalizes scores and sorts resumes in descending order of relevance.

Dependencies

Ensure you have the following Python libraries installed:

pip install numpy pandas tensorflow scikit-learn PyPDF2

Dataset

The model is trained using the UpdatedResumeDataSet.csv, which contains resumes and their corresponding job categories.

Model Architecture

Embedding Layer: Converts words into dense vectors.
Conv1D Layer: Captures local dependencies in text.
MaxPooling1D Layer: Reduces dimensionality.
LSTM Layer: Extracts long-term dependencies.
Dropout Layer: Prevents overfitting.
Dense Layer with Softmax Activation: Outputs probability distribution across job categories.

Usage

Place resumes in the Resumes folder.
Load the pre-trained model weights (deeprank_model.h5).
Run the script to classify and rank resumes for a given job role.
Example:

pdf_folder = "Resumes"
job_role = "Data Science"
ranked_resumes = process_resumes(pdf_folder, job_role)
print(ranked_resumes)

Output

The script returns a sorted DataFrame containing:

Resume file name
Job role probability score
Normalized score (0-1 scale)

License

This project is licensed under the MIT License.

Author

Nilesh Ranjan Pal

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
Resume Ranker.ipynb		Resume Ranker.ipynb
UpdatedResumeDataSet.csv		UpdatedResumeDataSet.csv
deeprank_model.h5		deeprank_model.h5
label_encoder.pkl		label_encoder.pkl
tokenizer.pkl		tokenizer.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Resume Classification and Ranking System

Overview

Features

Dependencies

Dataset

Model Architecture

Usage

Output

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Resume Classification and Ranking System

Overview

Features

Dependencies

Dataset

Model Architecture

Usage

Output

License

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages