The program aimed to extract data on the top 1000 schools based on UTBK scores in 2022 using Scrapy. The targeted website was a static one, and the data was not loaded using JavaScript. Therefore, Scrapy was an appropriate choice due to its efficiency and speed in handling static data on websites. The program extracted relevant data such as the schools' names, locations, UTBK scores, and other relevant information. The extracted data was useful for analyzing and evaluating the schools' academic performance and ranking. The data was saved in CSV format for further processing and analysis.
URL: https://top-1000-sekolah.ltmpt.ac.id
Scrapy Documentation https://scrapy.org/
pip install scrapy
pip3 install scrapy
scrapy runspider scraper.py
scrapy runspider scraper.py -o data.csv
scrapy runspider scraper.py -o data.json