Removing HTML tags from a string of text

Question

For a bit of a practice assignment, my professor challenged the lecture to write up some code that removes HTML tags from a string of text. He mentioned a specific command that we would learn later on that would do this for us, but he wants us to do so manually.

Here's what I have so far:

#include
#include
using namespace std;

int main() {
  string name = " smelly  butts  smell";
  cout << name << endl;

  int a = 0, b = 0;

  for (int a = b; a < name.length(); a++) {
      if (name[a] == '<') {
          for (int b = a; b < name.length(); b++) {
              if (name[b] == '>') {
                  name.erase(a, (b + 1));
                  break;
              }
          }
      }
  }

  cout << name << endl;

  system("pause");
  return 0;
}

I feel like I'm close, but I'm not getting the correct output.

DoomzDay · Accepted Answer

for (int b = a; b < name.length(); b++) {
    if (name[b] == '>') {
        name.erase(a, (b + 1));
        break;
    }
}

In this part of code your are erasing a part of length (b), while you should erase a part of length (b - a)

Try this one:

for (int b = a; b < name.length(); b++) {
    if (name[b] == '>') {
        name.erase(a, (b - a + 1));
        break;
    }
}

It should works as you want.

Removing HTML tags from a string of text

Answers (2)

Related Questions