Express JS

A simple web scraper that gets all articles from the guardian.

Sources

Dependencies

  • axios v0.27.2
  • cheerio v1.0.0-rc.11
  • cors v2.8.5
  • express v4.18.1

More details

  • The app is configured to scrape the data from https://www.theguardian.com/uk
  • The app runs on http://localhost:8000
  • You can see the resulted json file on http://localhost:8000/results
  • You can see the resulted html by opening the index.html file in the browser

Usage

  • run npm run start in the ./simple-web-scraper folder