Skip to main content
CF

Learn Web Scraping with NodeJs in 2020 - The Crash Course

7h 9m 10s
English
Free

Learn Web Scraping with NodeJs in 2020 - The Crash Course is a 50-lesson 7 hours 9 minutes self-paced course by Udemy. Master Web Scraping with NodeJs and tools such as Puppeteer (by Google), Request, Cheerio, and NightmareJs.

Course facts

Lessons
50
Duration
7 hours 9 minutes
Level
All levels
Language
English
Updated
Instructor
Udemy
Price
Free

Master Web Scraping with NodeJs and tools such as Puppeteer (by Google), Request, Cheerio, and NightmareJs. Delve into the world of Web Scraping and Data Mining with NodeJs. Learn contemporary scraping methods using Puppeteer and direct NodeJs requests.

Course Highlights

Embark on a journey to enhance your web scraping skills. This course provides you with modern techniques to scrape websites using NodeJs and its popular libraries.

What You Will Learn

  • Create data scrapers from scratch using NodeJs
  • Identify and select the right tools for scraping various websites
  • Leverage top scraping tools for NodeJs to your advantage
  • Automate user interactions with NodeJs
  • Develop scrapers using Puppeteer by Google
  • Build scrapers utilizing the native Request & Cheerio libraries
  • Explore scraping techniques with NightmareJs

Requirements

  • Proficiency in JavaScript with ES6 syntax
  • Familiarity with CSS and jQuery selectors

Who Should Enroll

  • Complete beginners interested in learning data scraping with NodeJs
  • Individuals with basic knowledge seeking to elevate their skills

Course Structure and Benefits

This comprehensive crash course is designed for ease of understanding, catering to both absolute beginners and those with prior knowledge in the field.

  • Hands-on experience by working alongside the instructor with full access to all files and code samples
  • In-depth insights into problem-solving and best practices for building efficient scrapers
  • Comprehensive coverage that prepares you to confidently create your own web scrapers by the course's end

Course Overview

This course is not a trivial tutorial found online with basic examples. Instead, it dives into everything you need to consider when initiating the development of a web scraper with NodeJs. It includes problem-solving strategies and knowledge necessary to empower you to build your own scraper confidently.

Engage with real-world examples and tackle authentic challenges while learning how to build scrapers using NodeJs.

Who teaches Learn Web Scraping with NodeJs in 2020 - The Crash Course? Udemy

Udemy thumbnail

Udemy is the largest open marketplace for online courses on the internet. Founded in 2010 by Eren Bali, Oktay Caglar, and Gagan Biyani and headquartered in San Francisco, the company went public on the Nasdaq in 2021 under the ticker UDMY. The platform hosts well over two hundred thousand courses across software development, IT and cloud, data science, design, business, marketing, and creative skills, taught by tens of thousands of independent instructors. Roughly seventy million learners use it worldwide, and the corporate arm — Udemy Business — supplies a curated subset of that catalog to enterprise customers.

Because Udemy is a marketplace rather than a single editorial publisher, the catalog is uneven by design. The strongest material lives in the long-form, project-based courses authored by working engineers — full-stack JavaScript, React, Node.js, Python data science, AWS, Docker and Kubernetes, mobile development with Flutter and React Native, and cloud certification preparation. The CourseFlix listing under this source is the slice of that catalog that has been mirrored here for offline-friendly viewing, organized by topic and updated as new releases land. Pricing on Udemy itself swings dramatically with the site's near-permanent sales, which is why the platform is best treated as a deep reference catalog: pick instructors with strong reviews and a track record of updating their material rather than buying on the headline price alone.

What lessons are included in Learn Web Scraping with NodeJs in 2020 - The Crash Course?

  • Space or K: play or pause
  • J: rewind 10 seconds
  • L: forward 10 seconds
  • Left Arrow: rewind 5 seconds
  • Right Arrow: forward 5 seconds
  • Up Arrow: volume up
  • Down Arrow: volume down
  • M: mute or unmute
  • F: toggle fullscreen
  • T: toggle theater mode
  • I: toggle mini player
  • 0 to 9: seek to 0 to 90 percent of the video
  • Shift plus N: next video
  • Shift plus P: previous video
0:00 0:00
#Lesson TitleDuration
1Introduction 02:12
2Tools & Project Setup 03:30
3Writing a Simple IMDB Scraper 10:32
4Why & When to Choose Scraping 03:57
5The biggest "problem" with scraping 02:38
6Running with Terminal / CMD & Basics on VSCode Debugger 06:39
7The Request library / method 05:13
8The Browser Automation method 06:11
901 - How to spoof / add custom request headers 06:52
1002 - Deal with GZIP Compression enabled websites 04:45
1103 - Selectors & More Parsing 19:07
1204 - Multiple Movies & Saving JSON to File 08:00
1305 - Exporting data to CSV 06:49
1406 - Easiest method of getting the proper selectors 05:37
1507 - Downloading Images Locally 10:22
1608 - Promisifying callback based functions 09:13
1709 - Exposing and working with Custom Functions in Puppeteer 09:26
18Using a Proxy Server 04:44
19Status Codes & Error Handling 10:05
20Manual Login with CSRF Protection & Cookie Parsing 18:36
2101 - Introduction and inspecting the structures 08:29
2202 - Building the scraper 20:25
2303 - Parsing latest posts of an ig user 12:10
2401 - Installation, introduction and testing 04:20
2502 - Writing a simple automated task 09:05
2603 - Generating PDF's, Emulating Phone Views, Getting Title & Url of pages 07:37
2704 - Using Puppeteer to login on Instagram 11:36
2805 - Making Websites Load Much Faster with a simple trick 08:24
2906 - HTTP Basic Authentication 03:47
3007 - Ignoring HTTPS / SSL Errors & Changing the ViewPort 03:41
3108 - Learning how to use Proxies 02:56
3201 - How to login on Twitter 10:23
3302 - Refactoring code and improving it 07:16
3403 - How to post Tweets with Puppeteer 11:34
3504 - Scraping Twitter User Details 13:57
3605 - Scraping More User Details 15:46
3706 - Scraping User Tweets 14:10
3807 - Scraping More Tweet Details ( Likes, Replies..etc ) 14:11
3908 - Handling The Infinite Loader of the Tweets 08:29
4001 - Creating the Module Structure 06:44
4102 - Parsing Product Details 11:29
4203 - More Details and Tricks on the Product Details page of Amazon 07:01
4301 - Installation, introduction and testing 06:29
4402 - Talking and testing more about NightmareJs 07:16
4503 - Scraping Ycombinator News with Vanilla Javascript 13:03
4604 - Scraping Ycombinator News - More details 10:17
4705 - Scraping Ycombinator News - Modulate & Pagination 11:41
4806 - Handling Downloads 06:07
4907 - Injecting Custom Javascript 05:24
50Congratulations! 00:55

What courses are similar to Learn Web Scraping with NodeJs in 2020 - The Crash Course?

More courses by Udemy

Frequently asked questions

What prerequisites do I need before enrolling in this course?
Before enrolling, you should have proficiency in JavaScript, specifically with ES6 syntax. Additionally, familiarity with CSS and jQuery selectors is required. These skills are essential as the course involves using these technologies when building web scrapers and selecting elements on web pages.
What projects will I build during the course?
The course includes several hands-on projects such as creating a simple IMDB scraper, scraping Twitter user details, and building a scraper for Instagram posts. You will also work on scraping product details from Amazon and news from Ycombinator using various tools like Puppeteer, Request, and Cheerio.
Who is the target audience for this course?
The course is designed for both complete beginners who are interested in learning data scraping with NodeJs and individuals who already have basic knowledge and are looking to elevate their skills in this field.
How does the depth and scope of this course compare to other web scraping courses?
This course offers a practical and modern approach to web scraping, focusing on tools such as Puppeteer, Request, and Cheerio. It provides 50 lessons that cover a range of topics from basic scraping techniques to more complex automation tasks, making it suitable for beginners as well as those with some prior experience.
Which specific tools and libraries are covered in the course?
The course covers several popular NodeJs libraries and tools for web scraping, including Puppeteer by Google, the Request library, Cheerio, and NightmareJs. These tools are used to automate user interactions and scrape data efficiently from various websites.
What topics are not covered in this course?
The course does not cover advanced data analysis or machine learning techniques that might be used to process or analyze the data obtained from web scraping. It focuses strictly on the methods and tools necessary for scraping and does not delve into post-scraping data processing.
What is the expected time commitment for completing the course?
The course comprises 50 lessons, although the total runtime is not specified. It is designed for a comprehensive learning experience, providing hands-on projects and in-depth insights. Students should expect to invest a significant amount of time to fully engage with the materials and exercises provided.