Datasets & Web Scraper IDE

Datasets

  • Request a Custom Dataset

    Introduction Bright Data's 'Request a Custom Dataset' feature offers a streamlined and tailored approach to data acquisition. This guide outlines the steps to request and obtain a custom dataset suited to specific needs.   Step-by...

  • New Marketplace Dataset API Feature: Request, Fetch, and Initiate Data Collections

    We are thrilled to introduce a new feature in our Marketplace Dataset API, which is designed to enrich your data acquisition process. This update allows for a more granular and streamlined way to request and manage your data collec...

  • Filtering a Dataset: How to Work with Each Function

    When working with a large dataset, filtering can be a powerful tool to help you focus on the most relevant information for your analysis. In this documentation, we'll walk you through how to use the following functions to filter a ...

Getting started with Web Scraper IDE

See all 19 articles

Quick Tour of Web scraper IDE

  • How to start using the data collection tool

    There are two ways to use the data collection tool : - Develop a self-managed collector on your own - Request a custom dataset        

  • Develop a self-managed collector

    To develop a custom collector using our Integrated development environment (IDE), you will need to insert an URL and start interacting with the development environment using Javascript language. Step 1 - Start from scratch You can ...

  • I requested that you build a new Data Collector. How can I confirm that someone is working on it?

    You'll receive an email that the developer is working on your new Data Collector, and you will be notified when your collector is ready. Status of the request can also be found on your dashboard : 

  • Data Collector Dashboard

    Any collector you create using a template or a custom collector will appear on your Data Collector dashboard. Dashboard overview : - Free trial : As part of the 7-day free trial, you’re entitled to 1,000-page loads- Update availab...

  • Dashboard - Collector action menu

    The collector action menu allows performing different actions with the collector. Initiate by API - start a data collection without having to enter the control panel Initiate manually - Bright Data's control panel makes it easy t...

  • Dashboard - Properties

    Maintainer of the collector :  Self-serve: collector is maintained by you Full-service: collector is maintained by Bright Data Developers Type of the collector : Search : The collector input is a keyword (i.e., iPhone) PDP : ...

See all 14 articles

Data Type & Delivery

See all 10 articles

Web scraper IDE - Coding Environment and Tutorials

  • IDE Best Practices

    Best practices Missing dead_page condition. When using navigate command, a 'dead_page' condition should be added to check if the page is not found. This will prevent automatic retries. While we automatically handle this when the r...

  • IDE Complete examples

    Complete examples Introduction The collect and parse commands have been removed. The data will be returned from parser code as an object or array, and it will be automatically saved to the output: // Interaction code // Old code n...

  • IDE Page

    A : See more examples - Examples of template code that our collector engineers built. B : Add another step (stage) - It is useful to add stages when you want to collect data from multiple pages. For example, in case you want to co...

  • Coding environment - IDE Interaction code

    These are all of the codes that you can do with the IDE input - Global object available to the interaction code. Provided by trigger input or next_stage() calls navigate(input.url); navigate - Navigate the browser session to a UR...

  • Coding environment - IDE Parser code

    These are all of the codes that you can do with the IDE input - Global variable available to the parser code let url = input.url; $ - An instance of cheerioFind more information on the cheerio website. $('#example').text() $('$ex...

  • Finding element selectors

    In order to target an element (to click it or pull text out), you need to specify the element with a CSS selector. A CSS selector can match one or more items on the page. Most commands in the interaction code will require a selecto...

See all 8 articles

Troubleshooting