extracting Text non recursively with Jsoup

Question

this is the code I'm trying to run :

String html = "ZOLA (1)";

Document doc = Jsoup.parse(html); //connect  to the page
Element element = doc.getAllElements().first(); //recive the names elements

System.out.println(element.text()); //prints "ZOLA (1)"
System.out.println(element.ownText()); // prints nothing

my goal is to extract only "ZOLA", without the text of the children node, but ownText prints nothing... how should I do it?

Arvind Kumar Avinash · Accepted Answer

The problem is that doc.getAllElements().first() returns


 
 
  ZOLA (1)

while you expect

ZOLA (1)

The following should work for you:

String html = "ZOLA (1)";

Document doc = Jsoup.parse(html);
Elements links = doc.getElementsByTag("a");
System.out.println(links.get(0));
System.out.println(links.get(0).ownText());

Output:

ZOLA (1)
ZOLA

extracting Text non recursively with Jsoup

Answers (2)

Related Questions