Get value from parsed HTML using Regex

Question

For a project to make communications clearer for a website, I have to pull the messages using regex (Why? Because the message is commented out. With normal document.getElement I can't reach the message. But with the Regex mentioned below i can.)

I am trying to get a value using this expression:

\s*(.|
)*?

How i use this expression:

var pulledmessage = /\s*(.|
)*?/.exec(htmlDoc);

The above expression gives me NULL when i console.log() it. My guess is that the htmlDoc format that i supply the regex is not working. I just have no clue how to make it so the value does get pulled.

What i use to parse HTML:

var html1 = httpGet(messages);

parser = new DOMParser();

htmlDoc = parser.parseFromString(html1,"text/html");

The result I want to get:

D. De: 
Information, Information. 
Information, Information
Para: Information
CC: Information
Alot of text here ............

I edited the above value to remove personal information.

html1 contains a full HTML page with the information required.

Thijs · Accepted Answer

New attempt. Seeing how the td you need is commented out, remove all HTML comment delimiters from the loaded HTML file before parsing the document. This will result in the td being rendered in the document and you can use innerHTML to get the message content.

const 
  documentString = `
  
    
    
      1
      2
      3
      4
      5
      6
      
      8
      
    `,
  outputElement = document.getElementById('output');

  debugger;
const
  // Remove all comment delimiters from the input string.
  cleanupDocString = documentString.replace(/(?:)/gm, '');
// Create a parser and construct a document based on the string. It should 
// output 8 divs.
parser = new DOMParser();
htmlDoc = parser.parseFromString(cleanupDocString,"text/html");

const
  // Get the 7th div with the class name from the parsed document.
  element = htmlDoc.getElementsByClassName('valorCampoSinTamFijoPeque')[6];

// Log the element found in the parsed document.
console.log(element);
// Log the content from the element.
console.log(element.innerHTML);

Get value from parsed HTML using Regex

Answers (2)

Related Questions