getting the hyperlink from website

Question

I am using Jsoup. I do a get document= connect.get(); and get the html page.

now I write that to a text(string).

I have users who populate these pages. I know each user name . These pages have the username. I am able to do a string.contains("username") to check if the user is present or not.

Now my issue is: I have users with there names in

Tables

ordered lists

unordered lists

in Body

But in all these cases they have in format as:Example

2012 academic record

some are in table and all..

In the example I know the student name = john. how can I get all the urls?

==

Sreenath S · Accepted Answer

How about this:

    Document doc = Jsoup.connect(url).get();
    Elements links = doc.select("a[href]");

    for (Element link : links) {
        if(link.attr("abs:href").contains(studentName) || link.text().contains(studentName)){
            studentLinkList.add(link.attr("abs:href"));
        }
    }

getting the hyperlink from website

Answers (2)

Related Questions