Reputation: 205
I want to extract the link details like Facebook and Google. But the problem is that the detials (like the manin text and images) are not sure. Sometimes the main text is in p tag sometimes it is extracted from meta tag. So any thought on which places to search?
http://forums.asp.net/p/1847212/5162336.aspx/1?extract+certain+part+of+text+like+google+and+bing
I pasted this link in facebook and the detail text it extracted was in p tag that was below another p tag which also had a text inside it. So how to decide this?
Upvotes: 0
Views: 165
Reputation: 3858
If the website in question is Open Graph protocol enabled, then you could parse the meta-data defined by that standard. Since it's the same system used by Facebook (and I believe Google+), you can probably expect a decent % of sites to support it. I wouldn't 100% rely on it, but a nice tool to have.
Upvotes: 1