Skip to content

Latest commit

 

History

History
23 lines (14 loc) · 646 Bytes

README.md

File metadata and controls

23 lines (14 loc) · 646 Bytes

Download data from Sakhr's Arabic Magazines Archive

A python script to download magazine scans from Sakhr's archive : http://archivebeta.sakhrit.com/DefaultArchive.aspx.

Usage

To use this script, first edit the start and end values in the file to your convenience then run the following command :

python ./sakhr-archive-scraper.py

TODO

  • Save html in the folder with the images!
  • Record where the script stopped to resume.
  • Extract magazine name and data from html.
  • More configuration variables, like retry numbers, etc.
  • Add a sleep call between iterations?