Skip to main content
CF

Scraping the Web for Fun and Profit

6h 33m 9s
English
Paid

This course will teach you how to use web scraping to create profitable projects. You will learn the basics of scraping using Python libraries (such as Beautiful Soup), methods to speed up scripts, bypassing blocks, scraping protected sites, and working with unofficial APIs. The course includes over 10 videos with real examples, source code, and materials that will be added in the future. The course is flexible and allows you to select lessons based on your needs.

About the Author: Jakob Greenfeld

Jakob Greenfeld thumbnail

Jakob Greenfeld is a German entrepreneur and educator focused on the practical craft of web scraping for solo founders and indie hackers — using scraped data as input to product ideas, market research, and growth experiments.

His CourseFlix listing carries Scraping the Web for Fun and Profit — covering the legal-and-ethical landscape, the Python tooling (BeautifulSoup, Playwright, Scrapy), proxy rotation, and the patterns for building scraping pipelines that survive site changes.

Material is paid and aimed at developer-founders building scraping pipelines for product or business purposes. For broader Python content, see CourseFlix's Python category page.

Watch Online 16 lessons

This is a demo lesson (10:00 remaining)

You can watch up to 10 minutes for free. Subscribe to unlock all 16 lessons in this course and access 10,000+ hours of premium content across all courses.

View Pricing
0:00
/
#1: Quick Introduction and Overview
All Course Lessons (16)
#Lesson TitleDurationAccess
1
Quick Introduction and Overview Demo
07:05
2
What I learned about selling data products
18:50
3
Bypassing Anti-Scraping Measures - Headers, Rotating Proxies, Scraping APIs, Javascript Rendering
24:52
4
Scraping RallyRd - Advanced Screen Scraping w- Selenium
26:17
5
Scraping RallyRd - Advanced Scraping of Data Behind a Login
13:43
6
Scraping Instagram Leads via Duck Duck Go
19:09
7
Scraping Shopify Sites, Reddit, Indeed, Upwork - Alternative Formats JSON, RSS
27:21
8
Scraping Messari - GraphQL Scraping and Data Flattening
14:17
9
Scraping YC Companies and Cryptocurrencies - Using Algolia
26:15
10
Scraping Udemy Courses - Leveraging Undocumented Internal APIs
35:48
11
Scraping SearchMySite.com - Post Requests and the Curl Convert Trick
18:00
12
Scraping all Pitchbook Profiles - Method 3 Sitemap Scraping
20:46
13
Scraping all Pitchbook Profiles - Method 2 Search Engine Scraping
21:45
14
Scraping all Pitchbook Profiles - Approach 1 Brute Force
30:03
15
Scraping Goodreads Part 2 - (try-except, iterating over pages)
48:40
16
Scraping GoodReads Quotes - (Requests and BeautifulSoup Basics)
40:18
Unlock unlimited learning

Get instant access to all 15 lessons in this course, plus thousands of other premium courses. One subscription, unlimited knowledge.

Learn more about subscription

Related courses

Frequently asked questions

What prerequisites should I have before enrolling in this course?
A basic understanding of Python is recommended before enrolling in this course, as it focuses on using Python libraries like Beautiful Soup for web scraping. Familiarity with HTML and web technologies will also be beneficial, as the course involves scraping data from various websites and working with APIs.
What projects will I build in this course?
Throughout the course, you will work on several projects that involve scraping data from different platforms. Examples include scraping Instagram leads, Shopify sites, Reddit, Indeed, and Upwork, as well as advanced screen scraping with Selenium on RallyRd. You will also learn to scrape Udemy courses using undocumented internal APIs and scrape GoodReads quotes using Beautiful Soup.
Who is the target audience for this course?
This course is designed for individuals interested in using web scraping to create profitable projects. It is suitable for those who want to learn how to collect data from various web sources, bypass anti-scraping measures, and work with unofficial APIs. Entrepreneurs, data enthusiasts, and developers looking to expand their skill set would benefit from this course.
How does this course compare in scope to other web scraping courses?
This course covers a wide range of web scraping techniques, including basic methods with Python libraries and advanced practices like bypassing anti-scraping measures, scraping data behind logins, and using GraphQL. Unlike some courses that focus on a single tool or method, this course provides varied examples, allowing learners to apply techniques across multiple platforms and scenarios.
What specific tools or platforms will I learn to use?
The course focuses on several key tools and platforms. You will learn to use Python with libraries such as Beautiful Soup for basic scraping tasks. Selenium is also covered for advanced screen scraping. Additionally, you will explore the use of GraphQL for data flattening and work with unofficial APIs for platforms like Udemy and SearchMySite.com.
What topics are not covered in this course?
The course does not cover data cleaning, transformation, or analysis in detail. While you will learn to collect data from various sources, the course focuses primarily on scraping techniques rather than subsequent data processing or integration into larger data systems. Additionally, the course does not delve into ethical considerations or legal aspects of web scraping.
How much time should I expect to commit to this course?
The course consists of over 10 video lessons, but the total runtime is not specified. Given the number of lessons, you should expect to spend several hours completing the course content. This includes watching videos, reviewing source code, and practicing the techniques through the provided examples and exercises. The course is flexible, allowing you to select lessons based on your specific needs.