scrapeByRegex

Overview

Scrape anything you want using Regex.

Video Example

Arguments

  • url: The URL from you want to scrape

  • regex: Regex that will select a part of the HTML of the URL

  • removeTag: Set to True if you want to remove the HTML tags

Expected Output

  • String: The HTML tag with its content or just the inner content.

Text example

scrapeByRegex("https://unlimitedsheets.com/", "<h1.*<\/h1>", true)
// Unlimited Sheets

scrapeByRegex("https://unlimitedsheets.com/", "<h1.*<\/h1>", false)
// <h1 class="site-title">Unlimited Sheets</h1>

Use cases

  • Scrape any site directly from Google Sheets using RegEx

Last updated