Skip to content

evyatark/news-flask

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

news-flask

Python Flask web scraping news sites

This flask application exposes some services for my Quarkus NewsStand application.

This flask application is written in Python and performs web-scraping of some news web sites. Currently working on Haaretz web site as a first POC.

All services are stateless.

In addition, this code base contains some Python utility scripts that are not directly connected.

The exposed endpoints:

Hello - just to manually test that server is up

start-page - Process start page. (currently hard-coded starting from "www.haaretz.co.il") Returns a list of relative URLs (as JSON)

scrape-single-page - Process a single page of one article. Returns a JSON representation of ArticleDetails.

(ArticleDetails contains just the meta-data of the article, not the full content).

About

Python Flask web scraping news sites

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published