This topic explains how to implement web scraping and python web development. Web scraping topics such as scrapy, beautiful soup, and others will be covered. A case study based on a Malaysian website.
Don't forget to hit the :star: if you like this repo.
The information on this Github is part of the materials for the subject High Performance Data Processing (SECP3133). This folder contains general big data information as well as big data case studies using Malaysian datasets. This case study was created by a Bachelor of Computer Science (Data Engineering), Universiti Teknologi Malaysia student.
Team | Library | Website | GitHub |
---|---|---|---|
Group 10 | Beautiful soup | StudyMalaysia.com | |
High Five | Beautiful soup | EduSpiral Consultant Services | |
QwQ | Beautiful soup | States and federal territories of Malaysia | |
SDS | Scrapy | Book Depository | |
BigMac | Scrapy | CompAsia.com | |
SIX | Scrapy | bukukita.com | |
AdMiPeQa | Selenium | Lazada | |
SamVerse | Selenium | Malaysia General Election (GE-15) | |
Group 9 | Selenium | Lazada Shopee | |
No Name | Requests | Puma: sneakers | |
Quad | Lxml | Jobstreet.com |
Please create an Issue for any improvements, suggestions or errors in the content.
You can also contact me using Linkedin for any other queries or feedback.