Learn to navigate from a listing page to detail pages and extract structured movie information.
Open Websites
Find Links
Follow Links
Extract Data
Find Links
to identify specific URLs and Follow Links
to visit them.Extract Data
schema for information found on detail pages.Open Websites
and Extract Data
. We will add Find Links
and Follow Links
manually.Open Websites
BlockOpen Websites
block.https://www.rottentomatoes.com/browse/movies_in_theaters
Screenshot: Open Websites block configured with Rotten Tomatoes URL
Find Links
BlockFind Links
.+
icon on the canvas at the point where you want to add the block (i.e., after the Open Websites
block). The Find Links
block will be added and connected.Find Links
block to configure it.
Find links to individual movie pages in the main list
. You might add constraints like Links must contain '/m/'
if you observe a pattern in Rotten Tomatoes movie URLs.Screenshot: Find Links block configured to find movie page links
Follow Links
BlockFollow Links
.Find Links
block. The Follow Links
block will be added and connected.Follow Links
block to configure it.
Follow each link
. This will make the subsequent Extract Data
block run for every movie page found.Screenshot: Follow Links block configured to 'Follow each link'
Extract Data
BlockFollow Links
block navigates to.
Extract Data
block (it should be after Follow Links
).
A single item
, as each movie page contains details for one movie.
NAME | EXAMPLE VALUE OR A LONGER DESCRIPTION |
---|---|
movie_title | The main title of the movie. |
tomatometer | The Tomatometer score (percentage), e.g., 95%. Extract as text or just the number. |
audience_score | The Audience Score (percentage), e.g., 88%. Extract as text or just the number. |
synopsis | A brief summary or plot description of the movie. |
rating | The MPAA rating (e.g., PG-13, R). |
genre | Primary genre(s) of the movie, e.g., Action, Comedy. |
director | Name(s) of the director(s). |
release_date_theaters | The theatrical release date, e.g., Oct 20, 2023. |
You are on a movie detail page. Extract the specified information. If a field is not present, leave it empty.
Screenshot: Extract Data block configured for extracting movie details