GREP data within multiple tags from cURL html

Question

Getting rather desperate to understand how to get the data I want from a curl command.

I need a hand with generating a grep command to get the following html:

 timetable </t itle>< <h3>study table</h3> <p>< strong>biology <div> <table
width='100%' cellpadding='5' cellspacing='0'><tr><th colspan="3">Level 44 Building 1 <tr> 
<td >monday</td> <td >1:30 – 2:30</td> <td >< a>Room number 22</a></td> <td > </td>
</tr> <tr><th colspan="2">body> </html>
</code></pre>

<p>I would like the output look like:</p>

<pre><code>timetable
study table
Biology
Level 44 Building 1
Monday
1:30 - 2:30 
Room Number 22
</code></pre>

<p>Currently I only know how to do a single <code>grep</code> such as :</p>

<pre><code>grep 'href='
</code></pre>

Chris Seymour · Accepted Answer

If you have GNU grep:

$ grep -Po '(?<=>) ?\K[^<&>]{2,}(?=<)' file
timetable 
study table
biology 
Level 44 Building 1 
monday
1:30 – 2:30
Room number 22

Disclaimer: You should really use a proper parser for this.

GREP data within multiple tags from cURL html

Answers (2)

Related Questions