An Introduction to web scraping using Python
Manoj Pandey (~manojpandey) |
Web scraping is a technique for gathering data or information on web pages. You could revisit your favorite web site every time it updates for new information. Or you could write a web scraper to have it do it for you!
Besides looking at how websites are put together, we will also discuss the ethics of scraping. What is legal? How can you be a friendly scraper, so that the administrator of the website you are scraping won’t try to shut you down?
- Interest in building something
- Basic Python programming knowledge
- Basic HTML knowledge
Manoj, is currently a Computer Science sophomore, studying in New Delhi, India. He is passionate about learning new stuff, mentoring people around and tinkering with latest technology. He has an ardent interest in Machine Learning and Human Computer Interaction, and is currently working as a researcher with Stanford's HCI research group.
Recently, he organised his college's first hackathon: [email protected]. He has been frequently giving a lot of open talks in his college, since he joined the college from first semester, on competitive programming, python programming, general web development, version control systems and open source tools/libraries.
Besides, code, he loves music, and has a beautiful Spotify playlist. Feel free to ask for the link ;)