
Inspired by this Reddit post, my initial intention was to collect restaurant data from the official Michelin Guide (in CSV file format) so that anyone can map Michelin Guide Restaurants from all around the world on Google My Maps (see an example). Gaining just one can change a chef's life losing one, however, can change it as well. Through the years, Michelin stars have become very prestigious due to their high standards and very strict anonymous testers. 9 min read Photo by Fabrizio Magoni / UnsplashĪt the beginning of the automobile era, Michelin, a tire company, created a travel guide, including a restaurant guide.Retry: Use "Retry" to reload the page based on a set of pre-defined conditions, for example, if the current page does/does not contain a designated page element.how many times you'd like to scroll down the page and "Wait time" for how long to wait between scrolls. scroll "to the bottom of the page" or "for one screen". This is where you'd tell Octoparse to automatically scroll down the page to load more content as soon as the page gets loaded.įirst, tick the box for scroll down, and choose how you'd like to scroll the page, ie. The most frequently used option is adding a page scroll-down. After loading page: Options for what can be done after the webpage's loaded.Use cookies to open the webpage (such as when log-in is required).You can set up wait time to slow down the process whenever needed.Before page render: Options for what can be done before the page loads.URL: Change page URL here if you need to open a different webpage URL.Time out: Adjust "Timeout" if the web page takes more time to load than usual.This is why you'd always want to make use of the setting for the "Go to Web Page" step to make sure any special situation is accommodated properly. Save the settings and a "Loop Item" with "Go to Web Page" nested inside will be generated.Įvery website is different and no two networks is the same. Under the "Loop Item", select the loop mode as List of URLs and click the to input the URLs. When a Loop Item is added, double-click it to input the URLs. Add a "Loop" item from the drop-down menu.If you need to add a list of URLs in the workflow, hover over where you'd like to add the steps and click the "+" icon.If you choose to enter the URLs manually, please make sure you enter one URL per line or you can directly copy the list of URLs from an Excel sheet.Ģ.3 Open Multiple Webpages by Adding a Step in the Workflow Check Batch URL input for more detailed instructions. You can input the URLs manually, import the URLs from a file such as an XLS file, import the URLs from another task, or batch generate a list of URLs. Copy and paste the URLs into the Website box and Save to startĬhoose how you'd like to input the URLs.You can edit the list of URLs in the URL field of the Setting section when needed.Ģ.2 Open Multiple Webpages on the Side Navigation Menu


A Go to Web Page action would be generated automatically in the workflowġ.3 Open a Webpage by Adding a Step in the WorkflowĪ "Go to Web Page" step can always be added to the workflow directly.Paste the URL into the Website box and Save to start.Click on the + New button on the sidebar menu then select Advanced Mode.A new task will be generated automatically.ġ.2 Open a Webpage Using the Side Navigation Menu Copy and paste the target page URL into the search bar then click Start.The search bar is good for searching relevant scraping templates or loading a webpage for a new task when you input a specific webpage URL. Let's say we'd like to scrape the following webpage on eBay: There is more than one way you can tell Octoparse to open a webpage in the built-in browser.
