Skip to content

Latest commit

 

History

History
47 lines (41 loc) · 1.36 KB

README.md

File metadata and controls

47 lines (41 loc) · 1.36 KB

Google Jobs Scraper

About the project

Simply, The script scrapes all the jobs from all the pages (first to final available page) located on https://careers.google.com/jobs and return's the result as a JSON string, Then you will have a JSON file containing all scraped data.

How to Run the Program

  1. Download and Install Python 3
  2. Install requirements
pip install requests beautifulsoup4 selenium
  1. Download the latest release of Chrome Driver for your OS
  2. Extract chromedriver and move it to the same directory of scrape_google.py file
  3. Finally, Run scrape_google.py
python scrape_google.py

Note: For Windows users, Please check this video

Structure of JSON output

{
  "total": "total_count",
  "jobs": [
    {
      "job_id": "id1",
      "title": "title1",
      "location": "location1", 
      "intro": "introduction1", 
      "resps": "responsibilities1",
      "quals": "qualifications1"
    },
    {
      "job_id": "id2",
      "title": "title2",
      "location": "location2", 
      "intro": "introduction2", 
      "resps": "responsibilities2",
      "quals": "qualifications2"
    },
    ...
  ]
}