This project involves analyzing the faculty data of NUML's Department of Computer Science (CS) and Department of Software Engineering (SE). The analysis focuses on comparing faculty members across various aspects such as:
- The number of faculty members in each department.
- Designation-wise comparison (e.g., Associate Professors, Assistant Professors, and Lecturers).
- PhD qualification comparison.
- Comprehensive list of all faculty members with their designations, department, and contact information.
The workflow of this project includes the following steps:
-
Data Collection:
- Scraped faculty data from the NUML website for both CS and SE departments.
- Organized data into a structured format.
-
Data Transformation:
- Cleaned and formatted the data (e.g., handling missing fields, standardizing designations).
- Categorized faculty members by their designations and departments.
-
Data Analysis:
- Performed comparisons between the CS and SE departments on:
- Total number of faculty members.
- Designation distribution (Lecturers, Assistant Professors, Associate Professors).
- Number of PhD-qualified faculty members.
- Performed comparisons between the CS and SE departments on:
-
Visualization:
- Created visualizations to illustrate:
- Faculty distribution in CS and SE departments.
- Designation-wise comparison.
- PhD qualification comparison.
- List of all faculty members with their designations, departments, and contact details, color-coded as follows:
- Red: Lecturer
- Yellow: Associate Professor
- Orange: Assistant Professor
- Created visualizations to illustrate:
-
ETL Process:
-
Analysis Visualizations:
To reproduce this project, follow these steps:
- Clone the repository.
- Install the required Python libraries.
- Execute the data scraping and transformation script.
- View the analysis and visualizations.
- Python for data scraping, transformation, and analysis.
- Tableau for creating visualizations.
This project is licensed under the MIT License - see the LICENSE file for details.