Reputation: 187
When I crawl an app page on Google Play Store, I could only crawl the first few pages of an app review.
(The next page of app review is showed up when I click the arrow button using the web browser)
How can I crawl the others of reviews?
Upvotes: 4
Views: 11364
Reputation: 1
It's too late though but you can use BeautifulSoup library for python to crawl all the reviews. All reviews are inside div having class 'single-review'.
Create a soup object of the html of the webpage, select all the instances of 'single-review' class.
soup.select('.single-review')
will do it for you. Extract whatever you want from a review. For reference you might want to checkout the BeautifulSoup documentation
You don't need to check for right arrow or left arrow. Above method will give you all the review currently on the webpage.
Upvotes: 0
Reputation: 5220
I checked out Telegram's Google Play page and it seems all the reviews are loaded together with the rest of the page.
Reviews visible without clicking arrow
After clicking the arrow, the reviews section increases in size and shows full-length reviews
Then, you can look and older and "less helpful" reviews by clicking the right arrow.
However, all these reviews have been loaded from the start:
From there, you'd have to parse the HTML using your web crawler to get the reviews.
EDIT
Okay, after clicking the arrow a few times, Google Play starts to load some older reviews by pinging http://play.google.com/store/getreviews
with a token of sorts.
This is a snippet of the response I got:
<div class="single-review"> <a href="/store/people/details?id=113340797493751556969">...
It seems to contain all the loaded reviews, so all you'll have to do is to parse that, or try to figure out how the token works so you can fetch from getreviews
yourself.
Upvotes: 4