Now, you can add additional “select” commands under the page selection to also extract the lawyer’s address, phone number and more.In the textbox under it, enter the following regex code: mailto:(.*) Now select the email_url extraction and tick the “Use Regex” box. To do this, expand your email selection by clicking on the icon next to it.įirst, remove the “extract email” command since this is just extracting the text inside the button.We’ll set up ParseHub to clean up the address before it extracts it. You will notice that the email being pulled starts with “mailto:”. Here you can make your first selection for data to extract from this page.įirst, click on the “Email Attorney” button to select it. ParseHub will now open a new tab and render the profile page for the first name on the list. Then click on the Create New Template button. Click on “No” and next to Create New Template enter the name profile_template (or something relevant). A pop-up will appear asking you if this a “next page” command. Now, click on the PLUS(+) icon next to the lawyer selection and choose the "Click" command. Next, remove the URL extraction under your lawyer selection, since we are not interested in pulling the profile URL in this case.On the left sidebar, rename your selection to lawyer.Click on the second one on the list to select them all. The rest of the names on the list will be highlighted in yellow.It will be highlighted in green to indicate that it has been selected. Start by clicking on the first name on the list. ParseHub will now render the page inside the app. Then enter the URL of the page you will want to scrape.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |