Skip to content

Data-Wrangling-and-Visualisation/JobHack

Repository files navigation

JobHack - Job Market Data Wrangling & Analysis

GitHub
GitHub last commit
GitHub repo size

JobHack is a powerful toolkit for scraping, analyzing, and visualizing job market data from platforms like LinkedIn, Headhunter, and Indeed. Designed for job seekers, recruiters, and data enthusiasts.

🔥 Key Features

🕷️ Web Scrapers

  • LinkedIn Scraper (scrapy spider working fully)
  • Headhunter Integration (2,000+ jobs collected)
  • Coming Soon: Indeed & Glassdoor support

🧠 AI-Powered Augmentation

  • LLaMA 3.2 (1B) via Ollama (for future...)
  • Extracts:
    • Salary ranges
    • Technical requirements
    • Education levels
    • Company details

📊 Analysis Tools

  • Salary distribution visualizations
  • Skills demand heatmaps
  • Remote work trends analysis

🛠️ Tech Stack

# Core Technologies
- Scrapy, BeautifulSoup  # Web scraping
- Pandas, NumPy         # Data wrangling
- Matplotlib, Plotly    # Visualization
- Ollama (LLaMA 3.2)    # NLP augmentation
- Jupyter Notebooks     # Analysis

🚀 Quick Start

Clone the repo:

git clone https://github.com/Data-Wrangling-and-Visualisation/JobHack.git
cd JobHack

Run the webservice:

cd my_flask_app
python app.py

Then you can open your browser and check different visuals

Alt text Alt text Alt text

Alt text

Alt text

Alt text