Haim Bender
Haim Bender

Reputation: 8187

How do I preg_match for words in Hebrew

I need a function that matches full words in hebrew in php.

Please help.

Upvotes: 5

Views: 2568

Answers (3)

Gumbo
Gumbo

Reputation: 655309

Try this regular expression describing Unicode character properties:

/\p{Hebrew}+/u

Upvotes: 8

Haim Bender
Haim Bender

Reputation: 8187

Thanks for all your answers,

The one that works for me is preg_match("/^\p{Hebrew}+$/u", "שלום");

Upvotes: 2

Peter Bailey
Peter Bailey

Reputation: 105898

Assuming your source data is UTF-8 encoded

$input = "ט״סת תעסתינג O״ת סOמע העברעו תעחת";

preg_match_all( "/[\\x{0590}-\\x{05FF}]+/u", $input, $matches );

echo '<pre>';
print_r( $matches );
echo '</pre>';

Yields

Array
(
    [0] => Array
        (
            [0] => ט״סת
            [1] => תעסתינג
            [2] => ״ת
            [3] => ס
            [4] => מע
            [5] => העברעו
            [6] => תעחת
        )

)

I based the range of 0590 through 05FF on this Unicode chart (edit: found more good hebrew/unicode info here). I used this to generate my sample input. Since I don't know hebrew I can't actually verify that the matched output is valid.

You may need to tweak it but hopefully this gets you headed in the right direction.

Upvotes: 5

Related Questions