JustAGuy
JustAGuy

Reputation: 5941

Invoke-WebRequest and Hebrew characters

I already tried the reghack for PS to support Hebrew characters. I can type Hebrew no problems but for some reasons characters containing Hebrew returned from Invoke-WebRequest are in gibberish (see the following screenshot).

enter image description here

Here's the site URL I'm attempting to query: https://www.hometheater.co.il/vt278553.html

Update: It looks like the content-type being returned is ALWAYS of charset Windows-1255 which is probably the issue.

Upvotes: 2

Views: 602

Answers (1)

bergmeister
bergmeister

Reputation: 969

This seems to be not only an issue of having to specify the encoding but also that the shell cannot display the encoding correctly. If you specify the encoding to a file and edit it with a decent text editor (not Notepad but e.g. Notepad++), then you will be see that it has parsed it correctly.

Invoke-WebRequest -Uri "https://www.hometheater.co.il/vt278553.html" -ContentType "text/plain; charset=Windows-1255" -OutFile content.txt

We can also test that the in-memory presentation is correct by reading it and writing it to another file:

Get-Content .\content.txt | Set-Content test.txt

Upvotes: 1

Related Questions