Data Scraping and Mining with Python

317 Learners

Course Overview

Course Overview to be entered here

Skills Covered

  • Real-World Scraping
  • Project Implementation
  • Requests Library
  • Scraping Theory
  • Sync vs Async
  • Real-World Scraping
  • Scraping Theory
  • Project Implementation
  • Sync vs Async
  • Requests Library
  • Real-World Scraping
  • Scraping Theory
  • Project Implementation
  • Sync vs Async
  • Requests Library

Course Curriculum

Course Content

  • Data Scraping and Mining with Python

    Preview
    • Lesson 01: Introduction

      15:34Preview
      • 1.01 Why Data Scraping
        02:35
      • 1.02 Applications of Data Scraping
        07:09
      • 1.03 Introduction of Instructor
        00:40
      • 1.04 Introduction to Course, Scraping, Tools
        01:28
      • 1.05 Projects Overview
        03:42
    • Lesson 02: Requests

      01:28:50Preview
      • 2.01 Introduction to Python Requests
        03:57
      • 2.02 Hand on with Requests
        08:28
      • 2.03 Extracting Quotes Manually
        10:05
      • 2.04 Quiz Extracting Authors
        00:40
      • 2.05 Solution (Extracting Authors)
        06:03
      • 2.06 Pagination
        09:46
      • 2.07 Quiz ( Extracting Author and Quotes)
        00:58
      • 2.08 Solution 01 (Extracting Author and Quotes)
        06:27
      • 2.09 Solution 02 (Extracting Author and Quotes)
        05:44
      • 2.10 Ajax Requests
        06:36
      • 2.11 Ajax Requests for Cricket Information
        08:25
      • 2.12 Ajax Requests Pagination
        03:53
      • 2.13 Quiz (Extracting Top Stats from Cricket info)
        01:22
      • 2.14 Solution 01 (Extracting Top Stats from Cricket Information)
        07:16
      • 2.15 Solution 02 (Extracting Top Stats from Cricket Information)
        09:10
    • Lesson 03: Beautiful Soap 4 BS4

      02:05:42Preview
      • 3.01 Introduction to BS4
        03:02
      • 3.02 Quiz Difference Between Requests and BS4
        00:25
      • 3.03 Solution (Difference Between Requests and BS4)
        01:04
      • 3.04 Hands-On with BS4
        05:54
      • 3.05 Extracting Data from Tree
        08:50
      • 3.06 Extracting Quotes from the Website
        07:33
      • 3.07 Quiz (Extracting Author Names)
        00:38
      • 3.08 Solution (Extracting Author Names)
        05:28
      • 3.09 Attributes of Tags in BS4
        09:10
      • 3.10 Multi-Valued Attributes of Tags in BS4
        03:55
      • 3.11 Scraping Movie Names from IMDB
        19:31
      • 3.12 Quiz (Getting the Ratings, Year, Name of the Movie)
        00:55
      • 3.13 Solution 01 (Getting the Ratings, Year, Name of the Movie)
        07:01
      • 3.14 Solution 02(Getting the Ratings, Year, Name of the Movie)
        07:08
      • 3.15 Scraping Time, Genre, and Release Date from IMDB 01
        05:08
      • 3.16 Scraping Time, Genre, and Release Date from IMDB 02
        08:25
      • 3.17 Combining Two Requests Data for IMDB
        04:35
      • 3.18 Movies Recommender System (Creating Movie URL)
        08:57
      • 3.19Movies Recommender System (Creating Director URL)
        05:36
      • 3.20 Movies Recommender System using BS4 (Getting Top 4 Movies)
        08:01
      • 3.21 Movies Recommender System using BS4 (Merge All Requests Together)
        04:26
    • Lesson 04: CSS Selectors

      01:45:45Preview
      • 4.01 Introduction to CSS Selectors
        02:49
      • 4.02 CSS Selectors Hands-On (Tags)
        05:17
      • 4.03 Quiz (Tags)
        01:08
      • 4.04 Solution (Tags)
        02:09
      • 4.05 CSS Selectors Hands-On (Descendants, ID, Class)
        07:04
      • 4.06 Quiz (Descendants)
        00:49
      • 4.07 Solution (Descendants)
        01:44
      • 4.08 Quiz (ID)
        00:44
      • 4.09 Solution (ID)
        01:40
      • 4.10 Quiz (Class)
        00:59
      • 4.11 Solution (Class)
        03:09
      • 4.12 CSS Selectors Hands-On (Nested Tags, ID Tags, Class Tags)
        04:32
      • 4.13 Quiz (Class with Tag)
        00:40
      • 4.14 Solution (Class with Tag)
        02:22
      • 4.15 CSS Selectors Hands-on(Coma Separator, Universal Selectors
        06:31
      • 4.16 Quiz (Combining Two Selectors)
        00:46
      • 4.17 Solution (Combining Two Selectors)
        02:40
      • 4.18 CSS Selectors Hands-On (Sibling Notations and Direct Child)
        07:24
      • 4.19 Quiz (Adjacent Sibling)
        00:45
      • 4.20 Solution (Adjacent Sibling)
        02:31
      • 4.21 Quiz (General Sibling)
        00:57
      • 4.22 Solution (General Sibling)
        02:55
      • 4.23 CSS Selectors Hands-On (Child Selectors)
        07:19
      • 4.24 Quiz (First Child)
        00:40
      • 4.25 Solution (First Child)
        03:44
      • 4.26 Quiz (Only Child)
        00:40
      • 4.27 Solution (Only Child)
        02:53
      • 4.28 Quiz (Last Child)
        00:44
      • 4.29 Solution (Last Child)
        03:05
      • 4.30 CSS Selectors Hands-On (Negations, Attributes)
        06:36
      • 4.31 Quiz (Negation)
        00:41
      • 4.32 Solution (Negation)
        01:59
      • 4.33 CSS Selectors Hands-On (Attributes, Attribute Values)
        03:51
      • 4.34 Quiz (Attribute Values)
        00:40
      • 4.35 Solution (Attribute Values)
        03:21
      • 4.36 CSS Selectors Hands-On (Attributes Wild Cards Values)
        06:25
      • 4.37 Quiz (Attributes Wild Card)
        00:50
      • 4.38 Solution (Attributes Wild Card)
        02:42
    • Lesson 05: Scrapy

      05:01:57Preview
      • 5.01 Introduction to Scrapy
        04:10
      • 5.02 Comparison of Scrapy and Requests
        03:40
      • 5.03 Scrapy at a Glance Documentation
        08:31
      • 5.04 Getting Started with Scrapy
        11:05
      • 5.05 Running Documentation Spider 1
        03:25
      • 5.06 Running Documentation Spider 2
        12:01
      • 5.07 Writing Spider from the Scratch
        07:24
      • 5.08 Understanding the Response (URL, Status)
        04:12
      • 5.09 Understanding the Response (Headers)
        07:09
      • 5.10 Understanding the Response (Values in Headers)
        06:51
      • 5.11 Understanding the Response (Body)
        06:04
      • 5.12 Understanding the Response (Request)
        04:41
      • 5.13 Understanding the Response (Meta)
        08:29
      • 5.14 Understanding the Response (Flags, Certificate, ip_address, Copy)
        05:17
      • 5.15 Understanding the Response (replace, urljoin, follow, follow_all)
        08:07
      • 5.16 Response CSS and Scrapy Shell
        09:24
      • 5.17 Extracting Quotes
        05:47
      • 5.18 Understanding Nested Selectors
        10:02
      • 5.19 Extracting the Author and Quotes
        10:05
      • 5.20 Checking for Next Page
        07:37
      • 5.21 Checking for Next Page in Spider
        05:36
      • 5.22 Checking for Next Page URL
        08:16
      • 5.23 Scraping Quotes from Next Pages
        11:07
      • 5.24 Exporting Extracted Data
        03:24
      • 5.25 Quiz (Get the Tags)
        00:58
      • 5.26 Solution (Get the Tags)
        07:20
      • 5.27 Next Website
        01:27
      • 5.28 CSS Selectors for Movie Names and URLs
        12:29
      • 5.29 Combined CSS Selectors for Movie Names and URLs
        09:41
      • 5.30 Send Request to the Film Information Page
        08:16
      • 5.31 Merge Data from Two Callbacks
        10:27
      • 5.32 Extracting Movie Duration and Genres
        11:12
      • 5.33 Exporting the Extracted Data
        08:27
      • 5.34 Quiz (Extracting the Year)
        00:58
      • 5.35 Solution (Extracting the Year)
        14:21
      • 5.36 Getting Director Name and URL
        07:15
      • 5.37 Getting Top Four Movies of Directors
        05:12
      • 5.38 Extracting Data
        16:25
      • 5.39 Extracting Data Anomaly (CSS Selector)
        07:14
      • 5.40 Extracting Data Anomaly (dont_filter Flag)
        07:51
    • Lesson 06: Scrapy Project

      01:36:55Preview
      • 6.01 Hugoboss Website for Scraping
        02:30
      • 6.02 Understanding Site Structure
        07:11
      • 6.03 Writing CSS Selectors for Listings
        07:43
      • 6.04 Listings in Scrapy Shell
        04:21
      • 6.05 Sending Request to Listings URLs
        07:23
      • 6.06 Writing CSS for Getting the Product from the listings
        11:03
      • 6.07 Extracting Products URL from the Listings
        05:03
      • 6.08 Sending Requests to Products of the Listings
        16:56
      • 6.09 Writing CSS for Getting the Product Information
        07:46
      • 6.10 Getting the Bigger Images of the Product
        13:57
      • 6.11 Adding Pagination to Spider and Running It
        09:41
      • 6.12 Output of the Spider
        03:21
    • Lesson 07: Selenium

      01:11:13Preview
      • 7.01 Introduction to Selenium
        02:12
      • 7.02 Getting Started with Selenium
        03:36
      • 7.03 Configuring the Webdriver
        03:41
      • 7.04 Extracting Quotes
        10:16
      • 7.05 Extracting Quotes and Author Names
        07:17
      • 7.06 Quiz (Extracting Quotes)
        00:41
      • 7.07 Solution (Extracting Quotes)
        07:22
      • 7.08 Clicking on Button
        05:01
      • 7.09 Pagination and Extracting Data
        08:07
      • 7.10 Exception Handling for Unavailable Elements
        05:41
      • 7.11 Navigating the Website for Login
        09:37
      • 7.12 Quiz (Log In and Extract Quote)
        00:44
      • 7.13 Solution (Log In and Extract Quote)
        06:58
    • Lesson 08: Project Selenium

      21:41Preview
      • 8.01 Overview of Project
        01:28
      • 8.02 Closing the Cookie Button
        03:22
      • 8.03 Setting the Language for Translation
        05:39
      • 8.04 Sending the Text for Translation
        03:46
      • 8.05 Downloading the Translation
        03:55
      • 8.06 Reading Data from File for Translation
        03:31

Why Join this Program

  • Develop skills for real career growthCutting-edge curriculum designed in guidance with industry and academia to develop job-ready skills
  • Learn from experts active in their field, not out-of-touch trainersLeading practitioners who bring current best practices and case studies to sessions that fit into your work schedule.
  • Learn by working on real-world problemsCapstone projects involving real world data sets with virtual labs for hands-on learning
  • Structured guidance ensuring learning never stops24x7 Learning support from mentors and a community of like-minded peers to resolve any conceptual doubts
  • Acknowledgement
  • PMP, PMI, PMBOK, CAPM, PgMP, PfMP, ACP, PBA, RMP, SP, OPM3 and the PMI ATP seal are the registered marks of the Project Management Institute, Inc.