Reputation: 48441
I want to use HTTP GET and POST commands to retrieve URLs from a website and parse the HTML. How do I do this?
Upvotes: 18
Views: 19797
Reputation: 120286
You can use HttpURLConnection in combination with URL.
URL url = new URL("http://example.com");
HttpURLConnection connection = (HttpURLConnection)url.openConnection();
connection.setRequestMethod("GET");
connection.connect();
InputStream stream = connection.getInputStream();
// read the contents using an InputStreamReader
Upvotes: 21
Reputation: 104168
The easiest way to do a GET is to use the built in java.net.URL. However, as mentioned, httpclient is the proper way to go, as it will allow you among others to handle redirects.
For parsing the html, you can use html parser.
Upvotes: 3