Andy
Andy

Reputation: 50540

PHP string comparison between two different types of encoding

I am attempting to match a string I know to a web page title in php.

I have the following string:

ℛobinhood

I have a page title that looks like this:

ℛobinhood

but is actually encoded like this if I look at the <title> element:

&#8475;obinhood

I'm not sure how to compare these two strings. How do I either convert the &#8475; to ℛ or the other way so that I can do a proper comparison?

Upvotes: 4

Views: 1869

Answers (2)

Alex M
Alex M

Reputation: 3513

You can use htmlentites ( http://php.net/manual/en/function.htmlentities.php ) to convert to the entity version.

Upvotes: 1

Eugen Rieck
Eugen Rieck

Reputation: 65264

Use the multibyte string functions:

  • mb_convert_encoding() to convert both versions to something sane (UTF8, internal encoding)
  • mb_strstr() to do the comparison

mb_* will have an encoding for HTML entities and (most likely) whatever encoding your string is in.

Upvotes: 1

Related Questions