bajro
bajro

Reputation: 1250

Scala get html content from web page

I am trying to get a whole html from a website in scala and then parse or get out certain information out of it. The standard html library doesnt work for me as if I am trying to print the html content it doesnt print the whole html? Any solutions how to get a full html content from a web page?

Upvotes: 2

Views: 3153

Answers (2)

Som Bhattacharyya
Som Bhattacharyya

Reputation: 4112

Well you could use the excellent scala-scraper library here : Its basically a wrapper for the JSoup Java library
You could write code that reads like this :(taken from GitHub)

object NewsApp extends App {
  val browser = JsoupBrowser()
  val doc = browser.get("http://observador.pt")

  println()
  println("=== OBSERVADOR ===")

  doc >> extractor(".logo img", attr("src")) |> println
  doc >> extractorAt[String]("example-extractor") |> println

  println("==================")
  println()

  doc >> ".small-news-list h4 > a" foreach println
}

Upvotes: 4

sheff_master
sheff_master

Reputation: 19

How are you getting html page in Scala ? I think you can look at spray-client http://spray.io/documentation/1.2.3/spray-can/http-client/

Upvotes: 0

Related Questions