Johnny Maelstrom
Johnny Maelstrom

Reputation: 48441

How do I retrieve a URL from a web site using Java?

I want to use HTTP GET and POST commands to retrieve URLs from a website and parse the HTML. How do I do this?

Upvotes: 18

Views: 19797

Answers (2)

Rob Hruska
Rob Hruska

Reputation: 120286

You can use HttpURLConnection in combination with URL.

URL url = new URL("http://example.com");
HttpURLConnection connection = (HttpURLConnection)url.openConnection();
connection.setRequestMethod("GET");
connection.connect();

InputStream stream = connection.getInputStream();
// read the contents using an InputStreamReader

Upvotes: 21

kgiannakakis
kgiannakakis

Reputation: 104168

The easiest way to do a GET is to use the built in java.net.URL. However, as mentioned, httpclient is the proper way to go, as it will allow you among others to handle redirects.

For parsing the html, you can use html parser.

Upvotes: 3

Related Questions