GeeksForGeeksScrapper Save

Scrapes g4g and creates PDF

Project README

GeeksForGeeksScrapper

Scrapes GeeksForGeeks and creates html & PDF for chosen category along with syntax highlighting for the code.

Screenshots

Example of articles of C category as html:

alt tag

Example of articles of C category as pdf:

alt tag

Installation

To use the scrapper, install the following:

$ sudo apt-get install wkhtmltopdf

Then create venv

$ virtualenv /path/to/g4g-env

Switch to venv

$ source /path/to/g4g-env/bin/activate

Now install BeautifulSoup as:

$ pip install beautifulsoup4

or via package manager as:

$ sudo apt-get install python-bs4

or for Python dependencies, you can just install via requirements.txt inside the virtual environment.

$ pip install -r requirements.txt

Run the G4G_Scrapper

$ python g4g.py

Choose the category you want to scrape from the menu and wait for the magic to happen :)

You can find the output as G4G_<category_name>.html and G4G_<category_name>.pdf in the same directory.

Disclaimer: This is strictly for educational purpose only. Author will not be liable for anything, what so ever; you do with this script.

Open Source Agenda is not affiliated with "GeeksForGeeksScrapper" Project. README Source: CuriousLearner/GeeksForGeeksScrapper
Stars
142
Open Issues
12
Last Commit
3 years ago
License
MIT

Open Source Agenda Badge

Open Source Agenda Rating