Back
Guide
AI

Scrape webpage data using AI

Get started with Serper in Cargo

Overview#


When enrichment providers or LinkedIn don't have the information you need on hard-to-enrich company segments (e.g. brick and mortar businesses), public data can often be a goldmine

  • especially depending on the region you're targeting.

We paired up with JetHR to show you how to use Cargo to find and use publicly available information for their target customers in European market. By leveraging web scraping and AI, they used this workflow to fill in the gaps in their CRM.



Note

All integrations mentioned in this guide require an associated connector to be set up in your Cargo workspace. Some integrations are eligible for use with Cargo credits. See the documentation for instructions on setting up connectors and using Cargo credits.



Get started with web scraping and AI in Cargo#



Step 1#

Set up your input variables to the workflow

Inputs used in the workflow are set up in the variables node at the beginning of the workflow. This node is used to define the parameters that will be passed through the rest of workflow as inputs to the rest of the nodes.


To power this workflow, the following variables are needed:

  • website: The website of the company you're targeting
  • hubspotCompanyID: The ID of the company in your CRM


Step 2#

Perform a Google search with Serper node

Use the Serper node to simulate a Google search. This step helps you gather initial links for public information about the company segments you are targeting. Make sure to refine your search terms to get the most relevant results.



Step 3#

Scrape web pages for relevant information

Once you have the search results, use a web scraping node to parse the retrieved links. Capture the HTML content from these pages, focusing on company descriptions and contact details.



Step 4#

Parse HTML content with AI

Next, use an OpenAI node to parse the HTML content and answer specific questions. This step helps you extract structured information from the unstructured web data. Ensure to configure the AI node to handle the text effectively, pulling out key details such as company names, domains, and relevant contact information.



Step 5#

Create arrays for CRM mapping and further enrichment

Finally, use a variables node to create arrays from the parsed information. These arrays can then be mapped to your CRM or used for further enrichment processes. This step ensures that the data is organized and ready for integration into your existing systems.



By following these steps, you can efficiently gather and utilize public information to enrich your company data, making your outreach and engagement efforts more effective.

Guide
Watch an introductory video
Pietro
Growth Lead@JetHR
Interested to know more about Cargo?
CategoryWeb Scraping
Documentationdocs.getcargo.io
Guide
Watch an introductory video
Pietro
Growth Lead@JetHR
grid-square-full

Engineer your growth now

Set the new standard in revenue orchestration.
Start creating playbooks to fast-track your revenue growth.

Explore now