A simple web scraper that gets all articles from the guardian.
Sources
Dependencies
- axios v0.27.2
- cheerio v1.0.0-rc.11
- cors v2.8.5
- express v4.18.1
More details
- The app is configured to scrape the data from https://www.theguardian.com/uk
- The app runs on http://localhost:8000
- You can see the resulted json file on http://localhost:8000/results
- You can see the resulted html by opening the index.html file in the browser
Usage
- run npm run start in the ./simple-web-scraper folder