Reputation: 147
I have to get some URLs of a website, but there not existing as typical <a href="">
links.
I have already found out with phantomJS to crawl some JS contents and ajax results etc.
but I can't find out how to get a list of links to a page while I am crawling this page...
Here is the structure of the content:
I have already tried out to catch it with php's file_get_content() and the normal phantomJS() request but nothing helped me :(
could you please give me a hint?
Upvotes: 0
Views: 267
Reputation: 1103
look first in your inline tags for JS redirects. If that fails, look in your header for anything inside of tags like this:
<script async="" src="https://SOME-URL"></script>
If you copy those URLs into your browser you can get the plain text of the JS files. You can then search them for JavaScript redirect behaviors as well.
Upvotes: 1